~ PhiloBiblon (sm)(tm) Home Page

Welcome to PhiloBiblon (sm)(tm)

PhiloBiblon image

Volume 2008,
Number 2 (August)
[ISSN 1096-6609]


From Abraham Ortelius. Theatrum orbis terrarum. Antwerp: Apud Ant. Coppenium Diesth, 1574, betw. pp. 10, 11. Courtesy of The Bancroft Library, University of California, Berkeley [G113.08.1574]

You have reached the Home Page of

PhiloBiblon

a project to construct a bio-bibliographical database of early texts produced in the Iberian Peninsula and to provide access to this information through an online service on the WorldWideWeb / WWW.

PhiloBiblon consists of three separate bibliographies, each of which has its own Home Page:

--oOo--

BETA Image

BETA / Bibliografía Española de Textos Antiguos

Texts in Spanish.
BITAGAP Image

BITAGAP / Bibliografia de Textos Antigos Galegos e Portugueses

Texts in Portuguese or Galician.
BITECA Image

BITECA / Bibliografia de Textos Catalans Antics

Texts in Catalan.

--oOo--

Research in all three of the PhiloBiblon bibliographies may be done through the

PhiloBiblon Search Engine

Máquina de búsqueda de PhiloBiblon (en español)

Help: Please Read Before Beginning Initial Search

The compilers of PhiloBiblon are very pleased to announce that the National Endowment for the Humanities has awarded the project an eighteen-month grant (1 June 2007 - 31 December 2008) to enhance web access to the data in its component bibliographies. In addition to the present access to descriptions of manuscripts and printed editions, the enhanced web site will provide access to texts and witnesses, to biographical data, and to secondary bibliography, among many other improvements.

Resource Lists providing bibliographies and live hypertext links -- to sources of use to researchers working in this area -- may be found on each of the three separate Home Pages shown above or, for information about PhiloBiblon itself and general information on Iberian Medieval Studies, on the

PhiloBiblon Resource List

--oOo--

PhiloBiblon

OUTLINE

I. Introduction
II. PhiloBiblon: Advanced Revelation database version in comparison with the Web version
III. PhiloBiblon: The Advanced Revelation Version
IV. PhiloBiblon: The Web-based Version
V. PhiloBiblon's Component Bibliographies, BETA, BITAGAP, BITECA
VI. Language
VII. Acknowledgments

I. Introduction

PhiloBiblon is a bio-bibliographical database management system designed to catalog the primary sources, both manuscript and printed, of a given corpus of texts, along with relevant secondary bibliography and authority files for persons, places, and institutions mentioned in the database. The web-based version gives partial access to three of the bibliographies that have used PhiloBiblon to catalog the vernacular sources of medieval Iberian culture: BETA (Bibliografía Española de Textos Antiguos), BITECA (Bibliografia de Textos Antics Catalans), BITAGAP (Bibliografia de Textos Antigos Galegos e Portugueses).

I.1. History of PhiloBiblon

Return to Outline

I.1.1. Madison

PhiloBiblon is based upon and derived from BOOST (the Bibliography of Old Spanish Texts, which was originally established in 1974 as part of a larger project, the computer-based Dictionary of the Old Spanish Language, at the Medieval Spanish Seminary of the University of Wisconsin. The purpose of BOOST was to aid in the selection of the corpus upon which that dictionary was to be based; and the first two editions (1975, 1977), produced in-house by John Nitti, Jean Gilkison, and Anthony Cárdenas, focussed almost exclusively on that purpose.

The first edition of BOOST, containing 966 entries, made use of a standard text-oriented database management system, FAMULUS, running on the Univac 1110 at the Madison Academic Computing Center of the U. of Wisconsin. FAMULUS, originally written at Berkeley in 1967 for the Pacific South West Forest and Range Experiment Station (USDA) for management of personal bibliographical files, takes as input standard ASCII text files created with any editor, and then is used to batch sort and index the entire file. The limitations it places on the organization of more complex materials are severe: It allows only ten fields (variable length), and a maximum of 4000 characters per record. In order to compensate for these limitations, related but in fact dissimilar data elements were grouped in one field. Thus AUTH included the author's title as well as his name and lumped together authors with editors and translators; while SPRL (Specific PRoduction Location) contained both the city of origin and the name of the printer or scribe.

FAMULUS allows sorting on any and all fields in any order and indexing of any field, although index terms are limited to a maximum of forty characters. The order of fields within the record and of records within the file can be changed and the entire data base resorted and re-indexed. The final printed version of BOOST consisted of the complete file ordered hierarchically (author / title / date / present location) followed by indices of each field and vocabulary indices of the author and title fields. It was printed as photo-offset copy of computer printout, all in upper-case letters.

At the same time that editorial control of BOOST passed to a team of outside scholars (Charles Faulhaber, Angel Gómez Moreno, David Mackenzie, Brian Dutton), the Medieval Spanish Seminary at Madison under John Nitti continued to provide technical support, re-writing FAMULUS into a customized data base management system closely resembling it, but without FAMULUS's limitations on the number of fields, their length, and the number of index terms. Thus the third edition of BOOST (1984) contained fourteen data elements and allowed index terms of up to 159 characters.

Other technical changes involved the separation or collapsing of individual data fields: the addition of two new fields:

Thus the date formulas in OPDT and SPDT, Original and Specific PRoduction Dates (e.g., 15th mid, 15th end) were not only ambiguous but were sorted alphanumerically rather than chronologically. To avoid the latter problem, in particular, they were changed to absolute numerical values (e.g., "1440-1460" instead of "15th mid").

With the change in the sorting hierarchy (from Author / Title to a topographical arrangement [City / Library / Shelfmark]), it became quite difficult to control the relationship between corresponding entries in BOOST2 and BOOST3. The Control NUMber (CNUM), the record identification number, was pressed into use as the linking element. All bibliographical information was collapsed together from the NOTE and BIBS fields of BOOST2 (which had been used to differentiate between bibliographical sources cited directly from those cited second- or third-hand) and incorporated into a single data element, BIBL. Unfortunately, this made it difficult to control the use of secondary bibliography.

The fusion of GTIT, the General or normalized TITle field, and STIT, the Specific or variant TITle field, was a mistake of a similar sort made for the same reason, the desire to create a single title index. In two other instances, however, data elements were divided in order to reflect differences in the data they contained. Thus translators were separated from the AUTH field and given one of their own. Similarly, printers and scribes were removed from SPRL and placed in a separate field. Both of these changes were correct tactically and strategically (although it would have been even better further to separate printers from scribes.)

I.1.2. Berkeley

The limitations of a flat file database management system like FAMULUS were thrown increasingly into relief by the availability of more sophisticated systems. The next technical change was the 1985 port of BOOST from the Madison system to another main frame database management system, SPIRES (Stanford Public Information Retrieval System). SPIRES's capabilities made it an attractive solution in its own right: it was interactive, text-oriented, and allowed repeatable variable-length data elements. Ordering, searching, and report generation were reasonably simple operations. Data entry and correction were more difficult than in older system, however.

Finally, in 1987, thanks to a grant from IBM, BOOST was ported into Revelation (now Advanced Revelation), a high-end, DOS-based relational database management system with variable length, repeatable fields that can be linked together into repeatable data structures, no limitation on the number of records or number of fields, and very little limitation on their size. Records are limited to 64K, while any single field within a record is also limited to 64K. A primary consideration in the selection of a relational system was the desire to make the database easier to maintain and update, by allowing changes to be made in one record and then propagated to others related to it rather than by making the same change in multiple records.

The original 1987 database design, the work of Charles B. Faulhaber and programmer John May, has evolved constantly since then, thanks to three major NEH grants as well as to smaller ones from other sources. In the process, the database management system itself was divorced from BOOST and made available for use with other corpora. It was baptized PhiloBiblon, from Richard de Bury's 14th-c. description of an ideal library. PhiloBiblon's structure now consists of ten tables with over 650 separate data fields.

II. PhiloBiblon: Advanced Revelation database version in comparison with the Web version

Return to Outline

In 1993 the Advanced Revelation version of PhiloBiblon and its three component bibliographies became available on CD-ROM disk as part of disk 0 of ADMYTE (Archivo Digital de Manuscritos y Textos Españoles), produced by Micronet, S.A. (Madrid). A new, revised and expanded edition of the Advanced Revelation version of PhiloBiblon was published on CD-ROM in 1999 by The Bancroft Library. It is now out of print.

The CD-ROM offers both advantages and disadvantages vis-à-vis the Web version: While the data on the CD-ROM version are more complete, they are less current: the CD-ROM database was closed in March of 1999. Thus the numerous materials added since then are not yet available on a stand-alone system. The CD-ROM version has much more sophisticated search and sorting capabilities than the Web-based version; however, it is significantly more difficult to learn how to use, which is a serious obstacle for the casual user. Moreover, the CD-ROM version can be updated only infrequently (for financial, not technical reasons), while the Web-based version can be updated regularly.

III. PhiloBiblon: The Advanced Revelation Version

Return to Outline

PhiloBiblon is designed to record data concerning texts in any given field or discipline (thematic, chronological, language, etc.): the primary printed or manuscript sources themselves, later attestations, the individuals involved with their production and/or transmission, and the institutions holding the materials, supported by a wide range of search possibilities and tailored report opportunities. It is structured on the basis of ten interrelated tables, each of which serves as an authority file for the other nine. Four are central to the system:

  1. UNIFORM TITLE: normalized version of author's name, text title (with frequently seen variants), and incipit/explicit, as well as dates (composition, translation, copying, etc.), language(s) of the text, associated individuals, relationships with other texts.

  2. MANUSCRIPT/EDITION: primary source manuscripts and editions (present location, shelf mark or call number, codicological or bibliographical description, previous owners and other related individuals, relationships with other manuscripts or editions)

  3. COPIES: adjunct to MANUSCRIPT/EDITION, for the recording of additional copies of a particular printed edition (present location, call number, condition, and previous owners)

  4. ANALYTIC: description of a given text found in a specific manuscript or printed source (exact form of title, incipit, explicit), together with specification of its folio or page location within the source and identifying characteristics in comparison with other transcriptions or editions. So called because it analyzes the contents of a given manuscript or printed edition.

Six other tables complement and serve as authority files for the primary tables:

  1. BIOGRAPHY: biographical information on individuals related in any way with the cataloged texts and primary sources (authors, translators, scribes, printers, patrons, binders, illuminators, current and previous owners, etc.)

  2. GEOGRAPHY: place, region, political or ecclesiastical division (county, duchy, diocese), kingdom, and nation names associated in various manners with any given text, manuscript, edition, individual, or institution

  3. INSTITUTIONS: institutions (e.g., convents, monasteries, libraries, religious orders, universities) related in any way with the cataloged texts or individuals in BIOGRAPHY (including previous owners or publishers)

  4. LIBRARY: libraries which hold the primary sources or copies of items in the secondary bibliography (address, telephone, hours, facilities, contact individuals)

  5. SUBJECT: authority files for subject headings, with a thesaurus of related headings, broader headings, narrower headings, and rejected headings

  6. BIBLIOGRAPHY: Secondary bibliography (editions, descriptions, studies, and critical appreciations) for all of the above tables

Upon entering PhiloBiblon, one of the extant bibliographies is chosen for default display. Access is then provided to popup menus which allow immediate access

Each of the interrelated authority files ("tables" in relational database nomenclature) is displayed in a series of linked "windows" containing fields for recording information appropriate to the table in question. For example, the Ms/Ed table (MANUSCRIPT/EDITION) records data concerning primary source manuscripts or editions in eight windows. A unique code number or record key (manid) is established automatically when the record is created and serves to link it to records in any other related table.

Many of the fields are associated and multi-valued. For example date, location, printer, and publisher are all linked in a repeatable data structure. This allows for the recording of differing opinions when various sources offer conflicting information. Thus authorities may differ on whether a given edition was printed in Sevilla or Burgos, and it is important to be able to record both places as well as their supporting authorities. Similarly, a complete list of the various watermarks that are found in a given manuscript can be linked to the leaves on which they are found and the authorities (e.g., Briquet) that record similar watermarks. In each such case, a qualifier field allows for the indication of the degree of certainty, and a source or basis field shows the authority that provides the data.

Many fields are filled in on-the-fly with data from related authority files (keyed to the corresponding code number in the appropriate file. Thus in order to link a given record to a secondary bibliographical source, it suffices to type the author's name in a pop-up index window and select the relevant record from BIBLIOGRAPHY, where the full bibliographic citation is maintained. Other fields supply information about data recorded in other files. Thus NO_CNUM shows the number of individual texts found in the particular Ms/Ed as recorded in the ANALYTIC file.

Navigation among the various related files (e.g., from MS/Ed to ANALYTIC to UNIFORM.TITLE to BIOGRAPHY) is accomplished by use of various keyboard commands or mouse clicks. Searches for specific types of data are carried out by through three different mechanisms:

The command line, in addition to defining and sorting data, also allows for the creation of ad hoc reports, for example, of all illuminated parchment manuscripts between given dates, of incipits or explicits containing a specific word, or of all works written or translated by a given author ordered by title or by date. In addition there are default formats for generating reports on information in Ms/Ed, UNIFORM.TITLE, ANALYTIC, and BIBLIOGRAPHY.

One of the things we have learned in the course of designing PhiloBiblon is that a program like this is never finished: the underlying database management software is upgraded, thereby requiring modifications in the application in order for it to remain current. As one gains experience, modifications in the data structure suggest themselves. Thus we initially believed that only a few fields would need qualifiers and bases to indicate the degree of certainty and the authority for the data. We now believe that qualifiers and bases are needed across the board and hope, eventually, to include them for all data fields.

PhiloBiblon, like the bio-bibliographies that it has enabled, is still a work in progress. Nevertheless, following the well-known proverb, "Lo mejor es enemigo de lo bueno," we believe that the Web-based version can provide a useful service to Hispano-medievalists.

IV. PhiloBiblon: The Web-based Version

Return to Outline

The version of PhiloBiblon currently available on the Web differs in several important aspects from the Advanced Revelation database version. The information available in the Web version represents only a portion of that currently available in the Advanced Revelation database version. The Web version is based on a complete data dump of the Ms/Ed table, augmented with information drawn from the UNIFORM.TITLE, ANALYTIC, BIBLIOGRAPHY, and COPIES tables. This means that Web users can query the data for information about a given text, a given manuscript, or the specific copy of a given text in a given manuscript.

Because of the way the information in the Web version of PhiloBiblon is currently structured, it is much easier to locate and print or save descriptions of a given manuscript or printed edition than of a given text. We hope to remedy that situation, eventually, by creating a data dump on the basis of the UNIFORM.TITLE table. Such a data dump would list the invariant information about a given text from UNIFORM.TITLE, followed by a list of the copies of that text (taken from ANALYTIC) with an abbreviated description of the manuscript or printed edition containing each copy of the text (from Ms/Ed).

The Web version will be updated periodically, no more frequently than once a month nor less frequently than once a quarter.

What is missing from the Web version is the entire range of information in the related tables, which serve essentially as authority files for the primary tables: GEOGRAPHY, BIOGRAPHY, INSTITUTIONS, LIBRARY, and SUBJECT. We hope to be able to add access to them at a later date.

V. PhiloBiblon's Component Bibliographies, BETA, BITAGAP, BITECA

Return to Outline

Three teams of scholars, working on medieval Spanish, Catalan, and Galician and Portuguese literature, have been using PhiloBiblon to create bio-bibliographical databases of the respective literatures:

While the basic parameters of each project are similar -- a bio-bibliographical listing of the medieval literary texts produced in their respective languages -- the specific details vary from one to another. Thus in BETA and BITAGAP the general cut-off date of composition for listing texts is 1500, while for BITECA it is the death of Ferdinand the Catholic (Ferran II) in 1516. For more information see the specific introduction to each bibliography.

There has been little concerted attempt to coordinate data among the three teams. Thus discrepancies will be found, for example, in the uniform titles of texts originally written in Latin. In the case of translations from one Iberian language into another, however, the team describing the translated text tends to defer to the expertise of the team dealing with the original. There has there been no systematic attempt to copy all of the information from the authority files of one bibliography into those of another. Thus the extensive information on Iñigo López de Mendoza, Marqués de Santillana, found in BETA (bioid 1031) has not been reproduced in BITECA (bioid 1543) nor in BITAGAP (bioid 1543).

VI. Language

Return to Outline

Each of the three bibliographies uses its target language (Catalan, Spanish, Portuguese) not only for data but also for descriptive text. Similarly, each of the three bibliographies tends to follow the scholarly and bibliographical conventions peculiar to its own scholarly tradition.

VII. Acknowledgments

Return to Outline

PhiloBiblon is a collaborative project involving the joint efforts of a number of individuals and institutions.

Principal developers for PhiloBiblon are:

Institutions which have contributed significantly to the development of PhiloBiblon include:

--oOo--

WWW access to and use of PhiloBiblon are free of charge. Reproduction of any materials found here is subject to the restrictions found in our Copyright Statement .

Best results will result from using latest - version WWW browsers which both render tables well and support JavaTM, but every effort will be made to present a generic service, accessible to any WWW viewing tool.

Readers are encouraged to send questions, comments, and any suggestions which they might have for the improvement of this site to Charles B. Faulhaber, email: cfaulhab@library.berkeley.edu.


Charles Faulhaber
University of California, Berkeley
10 November 1997

Decor


Copyright © 1997- by The Regents of the University of California, all rights reserved.
WWW site maintained at http://sunsite.berkeley.edu by manager@sunsite.berkeley.edu
Document designed by Jack Kessler, kessler@well.sf.ca.us
Document maintained by: Charles B. Faulhaber, cfaulhab@library.berkeley.edu.
Last dataload: 5 April 2008 (BITECA), 9 August 2008 (BITAGAP), 12 July 2002 (BETA).
Last HTML update: 9 August 2008.