My presentation focuses on the work our team of historians, librarians, and computer scientists have done in developing a pan-institution (Alberta, Dalhousie, Victoria, Toronto, Winnipeg, and Simon Fraser University) web archiving portal in Canada. Ingesting 16 TB of web archival data, we have attempted to develop transparent search algorithms, as well as other forms of supporting data to make decisions and discovery more transparent. The presentation speaks to the decisions and challenges we have faced when addressing the two above problems, both as historians as well as how to conceptualize and execute a large-scale project. It also discusses the skills students will need to work with this material, both to explore it on their own merits but also in order to have the ability to work in an interdisciplinary context.
See more of: Primary Sources and the Historical Profession in the Age of Text Search
See more of: AHA Sessions