Battalion and Company Level Data Bases

Since the collapse of the Soviet Union in 1991, the need and desire to model combat at the division-level has declined. The focus has shifted to lower levels of combat. As such, we have created the Battalion-Level Operations Data Base (BLODB) and the Company-Level Actions Data Base (CLADB).

The challenge for both of these databases is to find actions that have good data for both sides. It is the nature of military organizations that divisions have the staff and record keeping that allows one to model them. These records are often (but not always !!!) preserved. So, it is possible to assemble the data for both sides for an engagement at division level. This is true through at least World War II (up through 1945). After that, getting unit records from both sides is difficult. Usually one or both of the opponents are still keeping their records classified or close hold. This is why we ended up posting on this subject:

The Sad Story Of The Captured Iraqi DESERT STORM Documents

And:

So Why Are Iraqi Records Important?

 

Just to give an example of the difficulty of creating battalion-level engagements, for the southern offensive around Belgorod (Battle of Kursk) from 4-18 July 1943 I was able to created 192 engagements using the unit records for both sides. I have yet to create a single battalion-level engagement from those records. The only detailed description of a battalion-level action offered in the German records are of a mop-up operation done by the 74th Engineer Battalion. We have no idea of who they were facing or what their strength was. We do have strengths at times of various German battalions and we sometimes have strength and losses for some of the Soviet infantry and tank regiments, so it might be possible to work something up with a little estimation, but it certainly can not be done systematically like we have for division-level engagements. As U.S. and British armies (and USMC) tend to have better battalion-level record keeping than most other armies, it is possible to work something up from their records, if you can put together anything on their opponents. So far, our work on battalion-level and company-level combat has been more of a grab-bag and catch-and-catch-can effort that we had done over time.

Our battalion-level data base consists of 127 cases. They cover from 1918 to 1991. It is described here: http://www.dupuyinstitute.org/data/blodb.htm The blurry photo at the start of this blog if from that database.

Our company-level data base is more recent. It has not been set up yet as an Access data base. It consists of 98 cases from 1914 to 2000.

The BLODB was used for the battalion-level validation of the TNDM. This is discussed briefly in Chapter 19 of War by Numbers. These engagements are discussed in depth in four issues of  our International TNDM Newsletter (see Vol. 1, Numbers 2, 4, 5, 6 here: http://www.dupuyinstitute.org/tdipub4.htm )

The CLADB was used for a study done for Boeing on casualty rates compared to unit sizes in combat. This is discussed in depth in Chapter 12: The Nature of Lower Levels of Combat in War by Numbers.

Both databases are in need to expansion. To date, we have not found anyone willing to fund such an effort.

A Genius for War is back in print

While browsing Amazon.com I discovered that Trevor Dupuy’s Genius for War is back in print. It has a new cover.

It was republished last year by a company called Endeavour Media, Ltd. Trevor Dupuy’s works still belong to the family and are managed by one of his sons. The book does have 32 reviews on Amazon.com, a lot of them fairly recent (27 from 2011 and later).

A Genius for War

Also see:

http://www.endeavourmedia.co.uk/our-books/

http://www.endeavourmedia.co.uk/?s=dupuy

They have the following books by Trevor Dupuy for sale:

A Genius of War

The Battle of Austerlitz

Brave Men and Great Captains

Elusive Victory: The Arab-Israeli Wars, 1947-1974

Flawed Victory: How Lebanon became a cauldron of war and hate (co-authored by Paul Martell)

Future Wars: The World’s Most Dangerous Flashpoints

Hitler’s Last Gamble: The Battle of the Bulge

Military Heritage of America

The Compact History of the Revolutionary War

Understanding Defeat: How to Recover from Loss in Battle and Gain Victory in War

World War II: A Combat History (by R. Ernest Dupuy)

 

We also have a number of Trevor Dupuy’s books for sale. See http://www.dupuyinstitute.org/booksfs.htm

The books are:

Understanding War: History and Theory of Combat

Attrition: Forecasting Battles, Casualties and Equipment Losses in Modern War

Elusive Victory: The Arab Israeli Wars, 1947-197

Brave Men and Great Captains

Understanding Defeat

Future Wars

The Harper Encyclopedia of Military Biography

Military Heritage of the America, Volume II (only…but updated)

If War Comes; How to Defeat Saddam Hussein

Hitler’s Last Gamble (two volume Japanese language edition)

The Division Level Engagement Data Base (DLEDB)

The Division Level Engagement Data Base (DLEDB) is one of eight data bases that make up our DuWar suite of databases: See http://www.dupuyinstitute.org/dbases.htm This data base, of 752 engagements, is described in depth at: http://www.dupuyinstitute.org/data/dledb.htm

It now consists of 752 engagements from 1904 to 1991. It was originally created in 2000-2001 by us independent of any government contracts (so as to ensure it was corporate proprietary). We then used it as an instrumental part of the our Enemy Prisoner of War studies and then our three Urban Warfare studies.

Below is a list of wars/campaigns the engagements are pulled from:

Russo-Japanese War (1904-1905): 3 engagements

Balkan Wars (1912-1913): 1 engagement

World War I (1914-1918): 25 engagements

…East Prussia (1914): 1

…Gallipoli (1915): 2

…Mesopotamia (1915): 2

…1st & 2nd Artois (1915): 7

…Loos (1915): 2

…Somme (1916): 2

…Mesopotamia (1917): 1

…Palestine (1917): 2

…Palestine (1918): 1

…US engagements (1918): 5

World War II (1939-1945): 657 engagements

…Western Front: 295

……France (1940): 2

……North Africa (1941): 5

……Crete (1941): 1

……Tunisia (1943): 5

……Italian Campaign (1943-1944): 141

……France (1944): 61

…,,,Aachen (1944): 23

……Ardennes (1944-1945): 57

…Eastern Front: 267

……Eastern Front (1943-1945): 11

…….Kursk (1943): 192

……Kharkov (1943): 64

…Pacific Campaign: 95

…….Manchuria (1938): 1

…….Malayan Campaign (1941): 1

…….Phillipines (1942): 1

…….Islands (1944-1945): 4

…….Okinawa (1945): 27

…….Manila (1945): 61

Arab-Israeli Wars (1956-1973): 51 engagements

…1956: 2

…1967: 16

…1968: 1

…1973: 32

Gulf War (1991): 15 engagements

 

Now our revised version of the earlier Land Warfare Data Base (LWDB) of 605 engagements had more World War I engagements. But some of these engagements had over a hundred of thousand men on a side and some lasted for months. It was based upon how the battles were defined at the time; but was really not relevant for use in a division-level database. So, we shuffled them off to something called the Large Action Data Base (LADB), were 55 engagements have sat, unused, since then. Some actions in the original LWDB were smaller than division-level. These made up the core of our battalion-level and company-level data bases.

The Italian Campaign Engagements were the original core of this database. An earlier version of the data base has only 76 engagements from Italy in them (around year 2000). We then expanded, corrected and revised them. So the database still has 40 of the original engagements, 22 were revised, and the rest (79) are new.

The original LWDB was used for parts of Trevor Dupuy’s book Understanding War. The DLEDB was a major component of my book War by Numbers.

As can be seen, it is possible to use this database for model development and/or validation. One could start by developing/testing the model to the 141 Italian Campaign engagements, and then further develop it by testing it to the 141 campaigns from France and the Battle of the Bulge. And then, to test the human factors elements of your models (which if you are modeling warfare I would hope you would have), one could then test it to the 267 division-level engagements on the Eastern Front. Then move forward in time with the 51 engagements from the Arab-Israeli Wars and the 15 engagements from the Gulf War. There is not a lack of data available for model development or model testing. It is, of course, a lot of work; and lately it seems that the  industry has been more concerned about making sure their models have good graphics.

Just to beat a dead horse, we remind you of this post that annoyed several people over at TRADOC (the U.S. Army’s Training and Doctrine Command):

Wargaming Multi-Domain Battle: The Base Of Sand Problem

Finally, it is possible to examine changes in warfare over time. This is useful to understand it one is looking at changes in warfare in the future. The DLEDB covers 88 years of warfare. We also have the Battles Data Base (BaDB) of 243 battles from 1600-1900. It is described here: http://www.dupuyinstitute.org/data/badb.htm

Next I will describe our battalion-level and company-level databases.

Battles versus Campaigns (for Validation)

So we created three campaign databases. One of the strangest arguments I have heard against doing validations or testing combat models to historical data, is that this is only one outcome from history. So you don’t know if model is in error or if this was a unusual outcome to the historical event. Someone described it as the N=1 argument. There are lots of reasons why I am not too impressed with this argument that I may enumerate in a later blog post. It certainly might apply to testing the model to just one battle (like the Battle of 73 Easting in 1991), but these are weeks-long campaign databases with hundreds of battles. One can test the model to these hundreds of points in particular in addition to testing it to the overall result.

In the case of the Kursk Data Base (KDB), we have actually gone through the data base and created from it 192 division-level engagements. This covers every single combat action by every single division during the two week offensive around Belgorod. Furthermore, I have listed each and every one of these as an “engagement sheet’ in my book on Kursk. The 192 engagement sheets are a half-page or page-long tabulation of the strengths and losses for each engagement for all units involved. Most sheets cover one day of battle. It took considerable work to assemble these. First one had to figure out who was opposing whom (especially as unit boundaries never match) and then work from there. So, if someone wants to test a model or model combat or do historical analysis, one could simply assemble a database from these 192 engagements. If one wanted more details on the engagements, there are detailed breakdowns of the equipment in the Kursk Data Base and detailed descriptions of the engagements in my Kursk book. My new Prokhorovka book (release date 1 June), which only covers the part of the southern offensive around Prokhorovka from the 9th of July, has 76 of those engagements sheets. Needless to say, these Kursk engagements also make up 192 of the 752 engagements in our DLEDB (Division Level Engagement Data Base).  A picture of that database is shown at the top of this post.

So, if you are conducting a validation to the campaign, take a moment and check the results to each division to each day. In the KDB there were 17 divisions on the German side, and 37 rifle divisions and 10 tank and mechanized corps (a division-sized unit) on the Soviet side. The data base covers 15 days of fighting. So….there are around 900 points of daily division level results to check the results to. I drawn your attention to this graph:

There are a number of these charts in Chapter 19 of my book War by Numbers. Also see:

Validating Attrition

The Ardennes database is even bigger. There was one validation done by CAA (Center for Army Analysis) of its CEM model (Concepts Evaluation Model) using the Ardennes Campaign Simulation Data Bases (ACSDB). They did this as an overall comparison to the campaign. So they tracked the front line trace at the end of the battle, and the total tank losses during the battle, ammunition consumption and other events like that. They got a fairly good result. What they did not do was go into the weeds and compare the results of the engagements. CEM relies on inputs from ATCAL (Attrition Calculator) which are created from COSAGE model runs. So while they tested the overall top-level model, they really did not test ATCAL or COSAGE, the models that feed into it. ATCAL and COSAGE I gather are still in use. In the case of Ardennes you have 36 U.S. and UK divisions and 32 German divisions and brigades over 32 days, so over 2,000 division days of combat. That is a lot of data points to test to.

Now we have not systematically gone through the ACSDB and assembled a record for every single engagement there. There would probably be more than 400 such engagements. We have assembled 57 engagements from the Battle of the Bulge for our division-level database (DLEDB). More could be done.

Finally, during our Battle of Britain Data Base effort, we recommended developing an air combat engagement database of 120 air-to-air engagements from the Battle of Britain. We did examine some additional mission specific data for the British side derived from the “Form F” Combat Reports for the period 8-12 August 1940. This was to demonstrate the viability of developing an engagement database from the dataset. So we wanted to do something similar for the air combat that we had done with division-level combat. An air-to-air engagement database would be very useful if you are developing any air campaign wargame. This unfortunately was never done by us as the project (read: funding) ended.

As it is we actually have three air campaign databases to work from, the Battle of Britain data base, the air component of the Kursk Data Base, and the air component of the Ardennes Campaign Simulation Data Base. There is a lot of material to work from. All it takes it a little time and effort.

I will discuss the division-level data base in more depth in my next post.

Reviews of NPW

I was just surfing Amazon.com yesterday and noticed that Trevor Dupuy’s book Numbers, Predictions and Wars had 8 reviews. This is odd as I think the book went out of print before Amazon.com existed. The version they were reviewing was the hardcover from 1979. All the reviews were from 2012-2018, all four and five stars. Interesting. Even I don’t have a copy of the hardback version.

Numbers, Predictions & War

Now, if only people would pay more attention to his greatest work, Understanding War. It only has two reviews.

Understanding War

The Battle of Britain Data Base

The Battle of Britain data base came into existence at the request of OSD PA&E (Office of the Secretary of Defense, Program Analysis and Evaluation). They contacted us. They were working with LMI (Logistics Management Institute, on of a dozen FFRDCs) to develop an air combat model. They felt that the Battle of Britain would be perfect for helping to develop, test and validate their model. The effort was led by a retired Air Force colonel who had the misfortune of spending part of his career in North Vietnam.

The problem with developing any air campaign database is that, unlike the German army, the Luftwaffe actually followed their orders late in the war to destroy their records. I understand from conversations with Trevor Dupuy that Luftwaffe records were stored in a train and had been moved to the German countryside (to get them away from the bombing and/or advancing armies). They then burned all the records there at the rail siding.

So, when HERO (Trevor Dupuy’s Historical Evaluation Research Organization) did their work on the Italian Campaign (which was funded by the Air Force), they had to find records on the German air activity with the Luftwaffe liaison officers of the German armies involved. The same with Kursk, where one of the few air records we had was with the air liaison officer to the German Second Army. This was the army on the tip of the bulge that was simply holding in place during the battle. It was the only source that gave us a daily count of sorties, German losses, etc. Of the eight or so full wings that were involved in the battle from the VIII Air Corps, we had records for one group of He-111s (there were usually three groups to a wing). We did have good records from the Soviet archives. But it hard to assemble a good picture of the German side of the battle with records from only 1/24th of the units involved. So the very limited surviving files of the Luftwaffe air liaison officers was all we had to work with for Italy and Kursk. We did not even have that for the Ardennes. Luckily the German air force simplified things by flying almost no missions until the disastrous Operation Bodenplatte on 1 January 1945. Of course, we had great records from the U.S. and the UK, but….hard to develop a good database without records from both sides. Therefore, one is left with few well-documented air battles anywhere for use in developing, evaluating and validating an air campaign model.

The exception is the Battle of Britain, which has been so well researched, and extensively written about, that it is possible to assemble an accurate and detailed daily account for both sides for every day of the battle. There are also a few surviving records that can be tapped, including the personal kill records of the pilots, the aircraft loss reports of the quartermaster, and the ULTRA reports of intercepted German radio messages. Therefore, we (mostly Richard Anderson) assembled the Battle of Britain data base from British unit records and the surviving records and the extensive secondary sources for the German side. We have already done considerable preliminary research covering 15 August to 19 September 1940 as a result of our work on DACM (Dupuy Air Combat Model)

The Dupuy Air Campaign Model (DACM)

The database covered the period from 8 August to 30 September 1940. It was programmed in Access by Jay Karamales.  From April to July 2004 we did a feasibility study for LMI. We were awarded a contract from OSD PA&E on 1 September to start work on the database. We sent a two-person research team to the British National Archives in Kew Gardens, London. There we examined 249 document files and copied 4,443 pages. The completed database and supporting documentation was delivered to OSD PA&E in August 2005. It was certainly the easiest of our campaign databases to do.

We do not know if OSD PA&E or LMI ever used the data base, but we think not. The database was ordered while they were still working on the model. After we delivered the database to them, we do not know what happened. We suspect the model was never completed and the effort was halted. The database has never been publically available. PA&E became defunct in 2009 and was replaced by CAPE (Cost Assessment and Program Evaluation). We may be the only people who still have (or can find) a copy of this database.

I will provide a more detailed description of this database in a later post.

Dupuy’s Verities: The Requirements For Successful Defense

A Sherman tank of the U.S. Army 9th Armored Division heads into action against the advancing Germans during the Battle of the Bulge. {Warfare History Network]

The eighth of Trevor Dupuy’s Timeless Verities of Combat is:

Successful defense requires depth and reserves.

From Understanding War (1987):

Successful defense requires depth and reserves. It has been asserted that outnumbered military forces cannot afford to withhold valuable firepower from ongoing defensive operations and keep it idle in reserve posture. History demonstrates that this is specious logic, and that linear defense is disastrously vulnerable. Napoleon’s crossing of the Po in his first campaign in 1796 is perhaps the classic demonstration of the fallacy of linear (or cordon) defense.

The defender may have all of his firepower committed to the anticipated operational area, but the attacker’s advantage in having the initiative can always render much of that defensive firepower useless. Anyone who suggests that modern technology will facilitate the shifting of engaged firepower in battle overlooks three considerations: (a) the attacker can inhibit or prevent such movement by both direct and indirect means, (b) a defender engaged in a fruitless firefight against limited attacks by numerically inferior attackers is neither physically nor psychologically attuned to making lateral movements even if the enemy does not prevent or inhibit it, and (c) withdrawal of forces from the line (even if possible) provides an alert attacker with an opportunity for shifting the thrust of his offensive to the newly created gap in the defenses.

Napoleon recognized that hard-fought combat is usually won by the side committing the last reserves. Marengo, Borodino, and Ligny are typical examples of Napoleonic victories that demonstrated the importance of having resources available to tip the scales. His two greatest defeats, Leipzig and Waterloo, were suffered because his enemies still had reserves after his were all committed. The importance of committing the last reserves was demonstrated with particular poignancy at Antietam in the American Civil War. In World War II there is no better example than that of Kursk. [pp. 5-6]

Dupuy’s observations about the need for depth and reserves for a successful defense take on even greater current salience in light of the probably character of the near-future battlefield. Terrain lost by an unsuccessful defense may be extremely difficult to regain under prevailing circumstances.

The interaction of increasing weapon lethality and the operational and human circumstantial variables of combat continue to drive the long-term trend in dispersion of combat forces in frontage and depth.

Long-range precision firepower, ubiquitous battlefield reconnaissance and surveillance, and the effectiveness of cyber and information operations will make massing of forces and operational maneuver risky affairs.

As during the Cold War, the stability of alliances may depend on a willingness to defend forward in the teeth of effective anti-access/area denial (A2/AD) regimes that will make the strategic and operational deployment of reserves risky as well. The successful suppression of A2/AD networks might court a nuclear response, however.

Finding an effective solution for enabling a successful defense-in-depth in the future will be a task of great difficulty.

The Use of the Two Campaign Data Bases

The two large campaign data bases, the Ardennes Campaign Simulation Data Base (ACSDB) and the Kursk Data Base (KDB) were designed to use for validation. Some of the data requirements, like mix of personnel in each division and the types of ammunition used, were set up to match exactly the categories used in the Center for Army Analysis’s (CAA) FORCEM campaign combat model. Dr. Ralph E. Johnson, the program manager for FORCEM was also the initial contract manager for the ACSDB.

FORCEM was never completed. It was intended to be an improvement to CAA’s Concepts Evaluation Model (CEM) which dated back to the early 1970s. So far back that my father had worked with it. CAA ended up reverting back to CEM in the 1990s.

They did validate the CEM using the ACSDB. Some of their reports are here (I do not have the link to the initial report by the industrious Walt Bauman):

https://apps.dtic.mil/dtic/tr/fulltext/u2/a320463.pdf

https://apps.dtic.mil/dtic/tr/fulltext/u2/a489349.pdf

It is one of the few actual validations ever done, outside of TDI’s (The Dupuy Institute) work. CEM is no longer used by CAA. The Kursk Data Base has never used for validation. Instead they tested Lanchester equations to the ACSDB and KDB. They failed.

Lanchester equations have been weighed….

But the KDB became the darling for people working on their master’s thesis for the Naval Post-Graduate School. Much of this was under the direction of Dr. Tom Lucas. Some of their reports are listed here:

http://www.dupuyinstitute.org/links.htm

Both the ACSDB and KDB had a significant air component. The air battle over the just the German offensive around Belgorod to the south of Kursk was larger than the Battle of Britain. The Ardennes data base had 1,705 air files. The Kursk data base had 753. One record, from the old Dbase IV version of the Kursk data base, is the picture that starts this blog post. These files basically track every mission for every day, to whatever level of detail the unit records allowed (which were lacking). The air campaign part of these data bases have never been used for any analytical purpose except our preliminary work on creating the Dupuy Air Campaign Model (DACM).

The Dupuy Air Campaign Model (DACM)

This, of course, leads into our next blog post on the Battle of Britain data base.

Toward An American Approach To Proxy Warfare

U.S.-supported Philippine guerilla fighters led the resistance against the Japanese occupation of Luzon during World War II. [Warfare History Network]

U.S. Army Major Amos Fox has recently published the first two of a set of three articles examining nature of proxy warfare in the early 21st century and suggests some ideas for how the U.S. might better conduct it.

In “In Pursuit of a General Theory of Proxy Warfare,” published in February 2019 by the The Institute of Land Warfare at the Association of the U.S. Army, and “Time, Power, and Principal-Agent Problems: Why the U.S. Army is Ill-Suited for Proxy Warfare Hotspots,” published in the March-April 2019 edition of Military Review, Fox argues,

Proxy environments dominate modern war… It is not just a Russian, Iranian or American approach to war, but one in which many nations and polities engage. However, the U.S. Army lacks a paradigm for proxy warfare, which disrupts its ability to understand the environment or develop useful tactics, operations and strategies for those environments.

His examination of the basic elements of proxy warfare leads him to conclude that “it is dominated by a principal actor dynamic, power relationships and the tyranny of time.” From this premise, Fox outlines two basic models of proxy warfare: exploitative and transactional.

The exploitative model…is characterized by a proxy force being completely dependent on its principal for survival… [It] is usually the result of a stronger actor looking for a tool—a proxy force—to pursue an objective. As a result, the proxy is only as useful to the principal as its ability to make progress toward the principal’s ends. Once the principal’s ends have been achieved or the proxy is unable to maintain momentum toward the principal’s ends, then the principal discontinues the relationship or distances itself from the proxy.

The transactional model is…more often like a business deal. An exchange of services and goods that benefits all parties—defeat of a mutual threat, training of the agent’s force, foreign military sales and finance—is at the heart of the transactional model. However, this model is a paradox because the proxy is the powerbroker in the relationship. In many cases, the proxy government is independent but looking for assistance in defeating an adversary; it is not interested in political or military subjugation by the principal. Moreover, the proxy possesses the power in the relationship because its association with the principal is wholly transactional…the clock starts ticking on the duration of the bond as soon as the first combined shot is fired. As a result, as the common goal is gradually achieved, the agent’s interest in the principal recedes at a comparable rate.

With this concept in hand, Fox makes that case that

[T]he U.S. Army is ill-suited for warfare in the proxy environment because it mismanages the fixed time and the finite power it possesses over a proxy force in pursuit of waning mutual interests. Fundamentally, the salient features of proxy environments—available time, power over a proxy force, and mutual interests—are fleeting due to the fact that proxy relationships are transactional in nature; they are marriages of convenience in which a given force works through another in pursuit of provisionally aligned political or military ends… In order to better position itself to succeed in the proxy environment, the U.S. Army must clearly understand the background and components of proxy warfare.

These two articles provide an excellent basis for a wider discussion for thinking about and shaping not just a more coherent U.S. Army doctrine, but a common policy/strategic/operational framework for understanding and successfully operating in the proxy warfare environments that will only loom larger in 21st century international affairs. It will be interesting to see how Fox’s third article rounds out his discussion.

Validation Data Bases Available (Kursk)

The second large campaign validation database created was the Kursk Data Base (KDB), done 1993-1996. I was also the program manager for this one and it ran a lot smoother than the first database. There was something learned in the process. This database involved about a dozen people, including a Russian research team led by Col. (Dr.) Fyodor Sverdlov, WWII veteran, author and Frunze Military academy; and ably assisted by Col. (Dr.) Anatoli Vainer, ditto. It also involved was the author Dr. Richard Harrison, and of course, Richard Anderson and Jay Karamales. Col. David Glantz helped with the initial order of battle as a consultant.

The unique aspect of the database is that we obtained access to the Soviet archives and was able to pull from it the unit records at the division, corps and army-level for every Soviet unit involved. This was a degree of access and research never done before for an Eastern Front battle. We were not able to access the Voronezh Front files and other higher command files as they were still classified.

The KDB tracked the actions of all divisions and division-sized units on both sides for every day of the German offensive in the south for 4 July 1943 to 18 July 1943. Kursk is a huge battle (largest battle of WWII) and consists of four separate portions. This database covered only one of the four parts, and that part was similar in size to the Battle of the Bulge and the air battle was larger than the Battle of Britain. On the German side were 17 panzer, panzer grenadier and infantry divisions while on the Soviet side were 37 rifle divisions and 10 tank and mechanized corps. There were 9 attacking German armored divisions versus 10 Soviet tank and mechanized corps at the Belgorod Offensive at Kursk. At the Battle of the Bulge there were 8 attacking (engaged) German armored divisions versus 9 U.S. armored divisions. The database design and what data was tracked was almost the same as the Ardennes Campaign Simulation Data Base (ACSDB). The stats on the data are here: http://www.dupuyinstitute.org/data/kursk.htm

The database was programmed in Dbase IV and is DOS based. Dbase IV has the advantage that it allowed text fields. Dbase III did not, so we were limited to something like 256 characters for our remarks fields. With Dbase IV, the remarks fields sometimes grew to a page or two as we explained what data was available and how they were used to assemble daily counts of strengths and losses. Sometimes they were periodic (vice daily) reports and sometimes contradictory reports. It was nice to be able to fully explain for each and every case how we analyzed the data. The Dbase IV version of the KDB is publicly available through NTIS (National  Technical Information Service). The pictures in this blog post are screen shots from the Dbase IV version.

We also re-programmed the data base into Access and rather extensively and systematically updated it. This was in part because we took every single unit for every single day of the battle and assembled it into 192 different division-on-division engagements for use in our Division Level Engagement Data Base (DLEDB). This was done over a period of 11 years. We did the first 49 engagements in 1998-99 to support the Enemy Prisoner of War (EPW) Capture Rate Study for CAA (Center for Army Analysis), report E-4 (see http://www.dupuyinstitute.org/tdipub3.htm). Some of the other engagements were done later to support the study on Measuring the Value of Situational Awareness for OSD Net Assessment (Andy Marshall’s shop), reports SA-1. We (meaning me) then finished up the rest of the engagements in 2004 and 2009. In the end we had assembled an engagement record for every single division-on-division level engagement for the Belgorod Offensive. Added to that, in 1999 I began working on my Kursk book, which I had mostly finished in 2003 (but was not published until 2015). So over time, we rather systematically reviewed and revised the data in the database. This is not something we were able to do to the same extent for the ACSDB. The 192 engagements in DLEDB were then summarized as 192 separate “engagement sheets” in my Kursk book. There are also 76 of these engagement sheets available in my new Kursk book coming out in June: The Battle of Prokhorovka. This new book covers the part of the Belgorod offensive centered around the Battle of Prokhorovka.