Below are some specific resources, databases and archives that are particularly accessible for work using text and data mining.
Text and data mining the Times Digital Archive
We are currently running a project to explore issues and factors associated with mounting databases of digitised texts on the BEAR infrastructure to enable researchers across campus to access and work with the data in a secure fashion. As part of this, Library Services and IT Services have worked together to make copies of The Times Digital Archive available for text and data mining purposes for staff and doctoral researchers, by requesting space within the BEAR infrastructure.
To make a request please complete the form via the IT Service Desk. The Times Digital Archive is also available for text and data mining purposes by any member of the University via the Gale Digital Scholar Lab.
Text and data mining using the Gale Digital Scholar Lab
The Gale Digital Scholar Lab allows users to build a content set from the Gale archives the University has access to. It then provides options to clean the data and analyse the corpus using a range of tools. The archives we have access to using this tool are:
- Archives of Sexuality and Gender
- Archives Unbound
- British Library Newspapers
- Daily Mail Historical Archive
- Eighteenth Century Collections Online
- Mirror Historical Archive; 1903-2000
- Nineteenth Century Collections Online
- Nineteenth Century UK Periodicals
- Picture Post Historical Archive; 1938-1957
- Political Extremism and Radicalism
- Refugees; Relief; and Resettlement: Forced Migration and World War II
- Seventeenth and Eighteenth Century Burney Newspapers Collection
- Slavery and Anti-Slavery: A Transnational Archive
- The Economist Historical Archive
- The Illustrated London News Historical Archive; 1842-2003
- The Making of the Modern World
- The Sunday Times Historical Archive
- The Telegraph Historical Archive
- The Times Digital Archive
- The Times Literary Supplement Historical Archive
- U.S. Declassified Documents Online
- Women's Studies Archive
The Gale Digital Scholar Lab can be accessed via FindIt@Bham. Ensure you are logged into FindIt@Bham. Once the Gale Digital Scholar page has launched click on the ‘Login/create account’ button. You can then use the ‘Institutional Login’ option to log in with your University account.
Further archives potentially available for text and data mining
As part of the TDM Project we are looking at hard drives of archives that have previously been purchased by Library Services and how these could be used by researchers for text and data mining. These archives are currently hosted on a range of hardware and not networked and would need to be loaded onto BEAR to be used.
If you are a member of staff or a PGR and need to use any of these archives please contact copyright@contacts.bham.ac.uk.
- The Economist Historical Archive 1843-2003
- Picture Post Historical Archive
- The Making of the Modern World Digital Collection
- The Making of the Modern World Part II : 1851-1914.
- The Financial Times Historical Archive
- The Illustrated London News Historical Archive: 1842-2003
- State Papers Online Part I The Tudors, 1509-1603. State Papers Domestic
- State Papers Online Part II The Tudors, 1509-1603. State Papers Foreign
- State Papers Online Part III The Stuarts, 1603-1714. State Papers Domestic
- State Papers Online Part IV The Stuarts, 1603-1714. State Papers Foreign
- State Papers Online 1509-1714 Updated 2013 Part II, III and IV
- TLS Historical Archive
- DNSA The Berlin Crisis
- DNSA CIA Covert Operations 2
- Vogue Archive
- Documents on British Policy Overseas
- DNSA China and the US, 1960_1998
- DNSA CIA Covert Operations 1
- DNSA The Cuban Missile Crisis, 1962
- DNSA Guatemala
- DNSA Electronic Surveillance
- DNSA Iran: The Making of U.S. Policy 1977_1980
- DNSA Iraqgate: Saddam Hussein, U.S. Policy and the Prelude to the Persian Gulf War 1980_1994
- DNSA Presidential Directives Part I
- DNSA Presidential Directives Part II
- DNSA The Cuban Missile Crisis, 1962
- DNSA The Soviet Estimate: US Analysis of the Soviet Union 1947_1991
- DNSA U.S. Intelligence Community: Organization, Operations and Management, 1947_1989
- EEBO
- The Guardian and The Observer
- Cecil Papers