Text analysis & Labelling
AntConc is a freeware corpus analysis toolkit for concordancing and text analysis.
Callimachus is a regest of Greek and Latin Papyri (and Coptic papyri containing Greek words).
CorpusSearch 2 supports corpus linguistics research. It is useful both for the construction of syntactically annotated (parsed) corpora and for searching them. Running CorpusSearch on an appropriately annotated corpus a user can automatically: find and count lexical and syntactic configurations of any complexity, correct systematic errors,code the linguistic features of corpus sentences for later statistical analysis.
ediarum (ed) is a solution consisting of several software components that allows scientists to edit transcriptions of manuscripts and prints in TEI-compliant XML, to provide them with a text and subject apparatus as well as registers and to publish them on the web and in print.
EpiDoc provides guidelines and tools for encoding scholarly and educational editions of ancient documents. It uses a subset of the Text Encoding Initiative's standard for the representation of texts in digital form and was developed initially for the publication of digital editions of ancient inscriptions. Its domain has expanded to include the publication of papyri and manuscripts. More
Hypothesis an online tool for annotating the web.
Lexos a web-based tool to help you explore your favorite corpus of digitized texts.
MALLET is a Java-based package for statistical natural language processing, document classification, clustering, topic modelling, information extraction, and other machine learning applications to text. Topic models are useful for analyzing large collections of unlabeled text and the MALLET topic modelling package is used frequently in digital humanities textual analysis.
OxGarage is a web and RESTful service to manage the transformation of documents between a variety of formats.
oXygen suite of XML authoring, editing, and development tools.
Recogito is an online platform for collaborative document annotation. Recogito provides a personal workspace where you can upload, collect and organize your source materials - texts, images and tabular data - and collaborate in their annotation and interpretation.
Roma is a tool for working with TEI customizations.
Tapas (TEI) provide TEI publishing and repository services at low cost to those who lack institutional resources: faculty, students, librarians, archivists, teachers, and anyone else with TEI data who wants to store, share, and publish it.
TEI (Text Encoding Initiative). Tools for creating, editing, transforming, and publishing TEI documents and schemas using the P5 Guidelines TEI.
TEITOK is a web-based platform for viewing, creating, and editing corpora with both rich textual mark-up and linguistic annotation.
TextGrid services and tools to create, manage and edit your XML-based research data.
Transkribus is a platform for the text recognition, image analysis and structure recognition of historical documents.
Voyant a web-based reading and analysis environment for digital texts.
XML Copy editor is a free software that allows editing XML and its associated technologies.