Using our content

The output of the TCP consists of transcribed and encoded text, made available originally as SGML (and now mostly as XML) files. The wonderful thing about this is that you can do almost anything with this data. The challenge is that, in order to do almost anything with it, you will very likely need to apply some additional tools, software, or other resources in order to process or render the texts in a way that’s useful to you.

You can always search, browse, and read the texts in pre-defined ways using the web interfaces provided by ProQuest and the University of Michigan , but we hope that users will take advantage of the flexibility and richness of the TCP data to go beyond just what these interfaces provide, building new resources and developing new scholarship out of the source text files.

This area contains links to supporting files, examples, and more that might be useful. Please let us know if you have suggestions to add!

Citing the TCP


Projects and Publications using TCP Texts

The Results of Keying Instead of OCR

Why Markup?

Mapping TCP Texts to MARC Records