Every research project on early Europe may benefit from exploring a well-informed research question computationally in an appropriate and representative collection of digital books. To make representative claims about findings from a computational analysis, the collection of books used in the analysis needs to be of a representative scope as well. Whether a researcher puts together a collection of books on their own, or whether they use a pre-existing collection, they have to be able to describe the scope of the collection used for analysis in relation to the total output of printed books from the relevant time period, region, language, or subject.
The following concepts in bibliography are important for this work:
The concepts are defined below.
Incunabula
From the Latin word cunae, meaning "cradle." Books, pamphlets, calendars, and indulgences printed from movable type in Europe prior to 1501, during the earliest years (infancy) of printing. The earliest example is the Gutenberg Bible believed to have been printed before 1456 in Mainz, Germany, by Johann Gutenberg, who is credited with the invention of modern printing. For other examples, see the Canon Missae (1458) of Johann Fust and Peter Schöffer (Columbia University Libraries). See also the online exhibition Printing in England from William Caxton to Christopher Barker (Glasgow University Library) and Hypnerotomachia Polyphili by Francisco Colonna printed by Aldus Manutius of Venice in 1499 (Royal Library of Denmark). Like medieval manuscripts, incunabula may contain hand-decorated initial letters and borders (see this copy of the first edition printed in France, courtesy of the Bibliothèque Nationale de France). For more information about incunabula, see The Infancy of Printing (Univ. of Wisconsin-Milwaukie Libraries) and Incunabula: Dawn of Western Printing (National Diet Library, Japan). The Incunabula Short-Title Catalogue developed the British Library is now available as a searchable online database. See also UC Berkeley's Incunabula Database. Singular: incunabulum. Synonymous with cradle books and incunables. See also: xylograph.
From: Online Dictionary of Library and Information Science (ODLIS)
Incunabula, printing from movable type prior to 1501, often do not follow conventions about title pages, author and publication date information, etc. For this reason they are cataloged in separate bibliographies, where they are described based on watermarks, quality of paper, likely printer, likely city, etc. and incunabula, printed before 1501 are often not included in retrospective bibliographies.
National Bibliography
As noted in the International Encyclopedia of Information and Library Science (Routledge, 2003), current national bibliography was originally undertaken by the book trade to facilitate commerce (as exemplified by the Cumulative Book Index in the United States) but since the early 1950s, the regular listing of new publications has been regarded as the proper function of a national agency, usually operating within the national library. Retrospective national bibliography has been accomplished in part by publication of the catalogs of the national library, based on collections established by copyright deposit. Projects such as the English Short Title Catalogue (ESTC) have extended retrospective bibliography beyond the holdings of national libraries.
From: Online Dictionary of Library and Information Science (ODLIS)
The International Federation of Library Associations (IFLA) has created a registry that gives quick factual information about when a country started with a concerted contemporary effort to identify publish yearly lists of publication. The registry is not complete, but allows a quick overview for selected European countries:
Andorra | Austria | Canada | Croatia | Cyprus | Czech Republic |
Denmark | Finland | France | Germany | Greece | Hungary | Italy |
Latvia | Lithuania | Malta | Norway | Serbia | Slovenia | Spain |
Sweden | Switzerland | United Kingdom
retrospective bibliography
The Bibliographie der nationalen Bibliographien = Bibliographie mondiale des bibliographies nationales = A world bibliography of national bibliographies, by Friedrich Domay, from 1987, which covers both retrospective and current projects up to 1980. While this work is out of date for modern national bibliographies, it is a key work for sources used in retrospective national bibliographies.
public domain
subscription
Also refers to the right of a library or library system to provide access to a bibliographic database, or other online resource, to its patrons under licensing agreement with a vendor, upon payment of an annual subscription fee and subject to renewal.
From: Online Dictionary of Library and Information Science (ODLIS)
A database to which Duke University Libraries subscribe must be accessed from the Research Databases A-Z list.
The very succinct definition from From: Niladri S, Dash, and S Arulmozi, "Definition of Corpus," in: History, Features, and Typology of Language Corpora. Singapore: Springer Singapore, 2018, 1-15 demonstrates that a text corpus is not simply the plain text behind a scan. Building a corpus implies a series of decisions that can change research outcomes:
Compatible to Computer
Operational in research and application
Representative of the source language
Processed by both man and machine
Unlimited in amount of language data
Systematic in the formation and text representation
The definition from the The Oxford Dictionary of English Grammar (2 ed.), 2014 also indicates the methodological approaches to working with a corpus
corpus (Plural corpuses, corpora.)
A collection of authentic spoken and/or written texts.
The study of the English language has been transformed in recent decades by the collection of large quantities of authentic texts in corpora on which grammatical, pragmatic, lexicographic, historical, etc. analyses can be based.
•• corpus-based: Research that is corpus-based is deductive in outlook in that it uses (annotated) corpora to test hypotheses about language.
•• corpus-driven: Research that is corpus-driven is inductive in outlook and takes unannotated corpus data as the starting point for investigation.
•• corpus linguistics: a methodological approach to the study of language by means of corpora, now usually in computerized form.