About the texts

The Text Creation Partnership partnered with three major commercial providers of digitally imaged historical books.  Rather than start from scratch, the project was able to “leverage” these enormous existing databases of page images and focus its energies and its funds on transcription and markup.

The three text corpora were keyed from the three databases in question:

  • Early English Books Online (published by ProQuest)
  • Eighteenth Century Collections Online (published by Gale Cengage)
  • Evans Early American Imprints (published by the Readex division of Newsbank)

The result has been a corpus of more than 70,000 transcribed and encoded historical texts, more than a billion words, all of which can now be searched online. The scope of the project’s effort is unprecedented and unmatched among digitization and text encoding projects of its kind, and represents a significant contribution both to primary-source history and to the documentation of the language itself.

Read more about each project

Early English Books Online (EEBO) text creation work

Eighteenth Century Collections Online (ECCO) text creation work 

Evans Early American Imprints (Evans) text creation work

Explore the three digital collections

Early English Books Online TCP (EEBO-TCP)
Phase 1 — comprising 25,000 texts made available to everyone in 2015;
Phase 2 — 35,000 texts made available to everyone in 2020.

Eighteenth Century Collections Online TCP (ECCO-TCP)
Full text of about 3,000 books available freely to everyone

Evans Early American Imprints TCP (Evans-TCP)
Full text of about 5,000 books available to everyone