This presentation will explore one such method--a text-mining method called topic modeling--and illustrate its potential through a comparative analysis of two Civil War newspapers: the New York Times and the Richmond Daily Dispatch. These are both relatively large corpora, each consisting of more than 100,000 articles and advertisements. Topic modeling enables us to identify major topics in such larger corpora and quantify and chart their relative frequency over time, allowing us to analyze some broad historical patterns.
Many of these patterns are surprising, prompting new questions and suggesting new insights. To illustrate the potential of topic modeling, this presentation will present some initial conclusions from this research. I will consider how graphs of fugitive slave ads and of hiring ads in the Dispatch suggest that increases in runaways compromised the local slave market in Richmond, albeit temporarily. I will also analyze the topic model to explore the relationship between florid, patriotic paeans about God, honor, and country and vitriolic editorials in each paper condemning the immorality of the other section's society. Graphs of these two topics are remarkably similar; patriotic poetry and splenetic attacks were always two sides of the same coin. Together these graphs provide a cardiogram registering the deployment of patriotism and sectionalism in two major newspapers during the course of the war.
See more of: AHA Sessions