Universal Bibliographic Control and International MARC Core ProgrammeUNIMARC: An IntroductionUnderstanding the UNIMARC format1. What is MARC ?MARC is an acronym for Machine Readable Catalogue or Cataloguing. This general description, however, is rather misleading as MARC is neither a kind of catalogue nor a method of cataloguing. In fact, MARC is a short and convenient term for assigning labels to each part of a catalogue record so that it can be handled by computers. While the MARC format was primarily designed to serve the needs of libraries, the concept has since been embraced by the wider information community as a convenient way of storing and exchanging bibliographic data.The original MARC format was developed at the Library of Congress in 1965-6 leading to a pilot project, known as MARC I, which had the aim of investigating the feasibility of producing catalogue data in machine-readable form. Similar work was in progress in the United Kingdom where the Council of the British National Bibliography had set up the BNB MARC Project with the remit of examining the use of machine-readable data in producing the printed British National Bibliography (BNB). These parallel developments led to Anglo-American cooperation on the MARC II project which was initiated in 1968. MARC II was to prove instrumental in defining the concept of MARC as a communication format. MARC II established certain principles which have been followed consistently over the years. In general terms, the MARC communication format is intended to be:
Despite cooperation there emerged several versions, e.g. UKMARC, INTERMARC and USMARC, whose paths diverged owing to different national cataloguing practices and requirements. Since the early 1970s an extended family of more than 20 MARC formats has grown up. Differences in data content means that editing is required before records can be exchanged. One solution to the problem of incompatibility was to create an international MARC format (UNIMARC) which would accept records created in any MARC format. So records in one MARC format could be converted into UNIMARC and then be converted into another MARC format. The intention was that each national agency would need to write only two programs - one to convert into UNIMARC and one to convert from UNIMARC - instead of one program for each other MARC format, e.g. INTERMARC to UKMARC, USMARC to UKMARC etc. So in 1977 the International Federation of Library Associations and Institutes (IFLA) published UNIMARC : Universal MARC format, stating that "The primary purpose of UNIMARC is to facilitate the international exchange of data in machine-readable form between national bibliographic agencies". This was followed by a second edition in 1980 and a UNIMARC Handbook in 1983. All focussed primarily on the cataloguing of monographs and serials and took advantage of international progress towards the standardisation of bibliographic information reflected in the International Standard Bibliographic Descriptions (ISBDs). In the mid-1980s it was seen necessary to expand UNIMARC to cover documents other than monographs and serials. So a new description of the format - the UNIMARC Manual - was produced in 1987. By this time UNIMARC had been adopted by several bibliographic agencies as their in-house format. So the statement of purpose was amended to include "UNIMARC may also be used as a model for the development of new machine-readable bibliographic formats". Developments did not stop there. Increasingly a new kind of format - an authorities format - was used. Previously agencies had entered an author's name into the bibliographic format as many times as there were documents associated with him or her. With the new system they created a single authoritative form of the name (with references) in the authorities file; the record control number for this name was the only item included in the bibliographic file. The user would still see the name in the bibliographic record, however, as the computer could import it from the authorities file at a convenient time. So in 1991 UNIMARC/Authorities was published. By that year users of UNIMARC realised that the occasional rewriting of manuals was not enough. What was needed was continuous maintenance. The Permanent UNIMARC Committee came into being that year, charged with regularly supervising the development of the format. In maintaining the format, care is taken to make changes upwardly compatible, i.e. no records created before a change would be invalid after it. The latest development in the format has come about because of the requirement of European Community countries to produce unified specialised catalogues of their records. In order to produce such unified catalogues they had to adopt a common format for them - UNIMARC.
2. The UNIMARC formatThe UNIMARC format, like any other version of MARC, involves three elements of the bibliographic record:
Record structureThe record structure is designed to control the representation of data by storing it in the form of strings of characters known as fields. All data in the record must be stored using one or more character sets. Since computers can store and manipulate only numbers, each symbol, alphabetical character etc. is assigned a number following the rules of a particular character set. For example, one character set assigns the number '75' to 'K'. UNIMARC allows the use of certain character sets, approved by the International Organization for Standardization (ISO). The record structure established by UNIMARC is an implementation of the relevant standard: Format for bibliographic information interchange on magnetic tape (ISO 2709-1981). This structure utilises record labels and directories. As few users need concern themselves with such items, the description below covers the way a cataloguer sees the record.
Content designationCertain conventions are followed in order to identify the data elements within records. Such elements which include author, title and subject access are further characterised where necessary. This supports the manipulation of the data for a variety of purposes:
For an example of such manipulation, see the "Displaying citations" section later in this document. In addition, UNIMARC records may be formatted for visual display on a VDU, for output on CD-ROM or fiche and for printing out as hard copy. In general, UNIMARC provides content designation only for data which is applicable to all copies of a work. However, information which applies only to some copies (or even a single copy) of a work may be of interest beyond the holding institution. In such cases UNIMARC assigns specific fields for such details. These fields are also available for cases where the information is for in-house purposes only.
Data contentThe content is the data which is stored in the fields within the record. Data can be coded data or bibliographic data.
3. The role of UNIMARCInitially, UNIMARC was used for the exchange of records on magnetic tape but has since been adapted for use in a variety of exchange and processing environments.The UNIMARC format is available to all agencies concerned with the exchange of bibliographic information. In practice, though, UNIMARC is orientated towards the requirements of libraries. The fields, which are identified by three-character numeric tags, are arranged in functional blocks. These blocks organise the data according to its function in a traditional catalogue record. In the table below, fields 0-- - 1-- hold the coded data while fields 2-- - 8-- contain the bibliographic data:
In addition to the 9-- block any other tag containing a 9 is available for local implementation. The fields defined by UNIMARC provide for different kinds and levels of information. This can be shown by looking at a typical record in the UNIMARC format. 4. Anatomy of a UNIMARC recordExample: Alain-Fournier's novel "Le Grand Meaulnes", translated into English as "The lost domain".
001 0192122622@ Before looking at the MARC fields in detail, it is important to understand how the coding defines the data content. This is done by means of field enumerators which are composed of the following elements:
The role of the field enumerators is explained with reference to the preceding record.
Details001 0192122622@ 001 (the record identifier) is a unique number or combination of letters and numbers that serves to identify the record in a file. It is almost the only field not to have indicators.
010##$a0-19-212262-2$d£12.95@
020##$aUS$b59-12784@
1 2 3 012345678901234567890123456789012345 100## 19590202d1959####|||y0engy0103####ba@ This is a fixed-length field where the meaning of a character is dependent on its position. Hence the transcription above is preceded by numbers showing the character positions (cp).
1011#$aeng$cfre@
102##$aGB$ben@
105##$aac######000ay@
2001#$a{NSB}The {NSE}lost domain$fAlain-Fournier$gtranslated from the French by Frank
Davison$gafterword by John Fowles$gillustrated by Ian Beck@
210##$aOxford$cOxford University Press$d1959@
215##$aix,298p,10 leaves of plates$cill, col.port$d23cm@
311##$aTranslation of: Le Grand Meaulnes. Paris : Emile-Paul, 1913@
454#1$1001db140203$150010$a{NSB}Le {NSE}Grand Meaulnes$1700#0$aAlain-Fournier$f1886-
1914$1210##$aParis$cEmile-Paul$d1913@
50010$a{NSB}Le {NSE}Grand Meaulnes$mEnglish@
606##$aFrench fiction$2lc
676##$a843/.912$v19@
680##$aPQ2611.O85@
700#0$aAlain-Fournier,$f1886-1914@
702#1$aDavison,$bFrank@
801#0$aGB$bWE/N0A$c19590202$gAACR2@
98700$aNov.1959/209@ For an example of this record without the fields, subfields etc., see Displaying citations below.
5. Putting UNIMARC to workBibliographic records in the UNIMARC format are designed for use in automated library systems. Depending on the versatility of the system a range of related functions can be supported by manipulating the data. Two such functions are information retrieval and displaying citations.
Information retrievalIn the UNIMARC format each data element is identified for the purposes of information retrieval. Using computer software, it is possible to search on most of the MARC fields and subfields in the record. For example:
While each record in the UNIMARC format is a discrete entity, a catalogue consisting of many such records becomes a database enhanced with the capacity to respond to highly specific or comprehensive search strategies. The range of search options will, of course, depend on the kind of software employed.
Displaying citationsUNIMARC offers a choice of formats for displaying records. Naturally, readers will not want to consult the full MARC record simply because the format is intended not for human perusal but for processing by computer. A sympathetic display for use by readers is the Catalogue card format: 843.912 (DC19)
Alain Fournier, 1886-1914 [Le Grand Meaulnes. English]. The lost domain / Alain-Fournier; translated from the French by Frank Davison; afterword by John Fowles; illustrated by Ian Beck. - Oxford: Oxford University Press, 1959. - ix,298p,10 leaves of plates; ill, col.port; 23cm Translation of: Le Grand Meaulnes. Paris : Emile-Paul, 1913 ISBN 0-19-212262-2: £12.95 1.Ti 2.The lost domain 3.Davison, Frank 4.French fiction B59-20618 Pressmark: Nov.1959/209
This citation represents a card in the classified sequence, which will be filed under 843.912. The
second to last line shows the other headings under which the record will appear in a library catalogue, and the national bibliography number. The first tracing is an abbreviation for "Title". In this particular layout the author's name appears on a separate line above the title etc. With the exception of 7-- fields (which present problems and so need the cataloguer to put in the punctuation) most of the punctuation is supplied by the computer as it translates subfield codes into punctuation and typeface.
6. Maintaining UNIMARCThe interests of users of UNIMARC records are represented by the Permanent UNIMARC Committee (PUC), which plays an important role by acting as a focus for user views and reactions when amendments to UNIMARC are proposed. It does this on behalf of IFLA UBCIM, which is ultimately responsible for UNIMARC.7. UNIMARC AuthoritiesThe UNIMARC Authorities format is designed to allow an agency to hold in one place the authoritative form of name of an author, corporate body name etc., together with references from other forms of name. Such data is linked to a bibliographic record by subfield $3 (Authority record number) in fields in the 7-- block of the bibliographic format.The data can be embodied in the bibliographic record either at the time of creation or when a user views that record. There are three types of authority record, coded in the record label as "x" (authority entry record), "y" (reference entry record) and "z" (general explanatory entry record).
Structure of the UNIMARC Authorities format
Anatomy of UNIMARC authorities recordsThe following are two typical examples of simple records:
As both are similar, only the second will be explained: 001 B329638@ 001 is the record identifier 100## $a19810716aengy0103####ba@ The general processing data field has the same sort of structure as the bibliographic 100 field. It gives the date entered on the file (16th July 1981). The record is "a" established (i.e. not provisional). The language of cataloguing is English. The code "y" shows that no transliteration system was used. In the eight-position character set part "0103" shows that the basic Latin and the extended Latin sets were used; the four blanks show that no others were used. The script of cataloguing is the Latin alphabet ("ba"). 152## $aAACR2@ 152 is the Rules field. The record follows the Anglo-American cataloguing rules, 2nd edition. Such information is held in 801 $g in the bibliographic format. 200#1 $aInnes,$bMichael@ Since the field is for a personal author, the indicators and subfield codes follow field 700 in the bibliographic format. 500#1 $0For works written under his pseudonym see $aStewart,$bJ.I.M. $3A369875@ This is a "See also" reference for a personal author; so the indicators and subfield codes follow field 700 in the bibliographic format. This includes the $3, which holds the record number for the J.I.M Stewart heading. There is the addition of $0 (zero) for "Instruction phrase". 801#0 $aUK$bBL$c19810629@ Like the same field in the bibliographic format, this gives the country, institution and date of latest transaction for an originating agency (second indicator 0). 810## $aWho's who@ This field gives the source in which the data was found - in this case a biographical dictionary.
Other Authorities format fields
The other equivalents of the bibliographic 7-- fields are 210 Corporate or Meeting Name (as 71-), 215 Territorial or Geographic Name (as 71-), 220 Family Name (as 72-). Field 676 contains the Dewey Decimal classification number, as in the bibliographic format, but with the addition of subfield $c Explanatory terms.
250## $aParsley@ 250 is used for Topical subjects as headings (like 606 in the bibliographic format). When a document on the herb is about parsley as a plant, the class number should be 583.48; when it is about parsley as food, the class number should be 641.655. The 7-- block is used to hold a form of name in a different language or script.
In a library's German language catalogue the authoritative form for the geographic name "Switzerland" is the German one (A234566). But this entry is linked to similar ones in French and Italian. A reader searching for books on "Svizzera" will be shown those where the subject is "Schweiz". In a library where one language predominates, the authoritative form will be in that language, with "See" references (415 fields) from the name in other languages 8. Short bibliography8 Short bibliographyISBD(G) : General International Standard for Bibliographic Description .... - Revised ed. ; prepared by the ISBD Review Committee Working Group set up by the IFLA Committee on Cataloguing.- München, London, New York, Paris : K G Saur, 1992.ISO 1001-1986. File structure and labelling of magnetic tape for information interchange. ISO 2709-1981. Format for bibliographic information interchange on magnetic tape. UNIBASE : UNIMARC Demonstration Database. - Frankfurt : IFLA Universal Bibliographic Control and International MARC Programme, 1994. UNIMARC in Theory and Practice : Proceedings of the Workshop Held in Sydney, Australia, 1988. - London : IFLA Universal Bibliographic Control and International MARC Programme, 1989. (Available from K G Saur). UNIMARC/Authorities. - München, London, New York, Paris : K G Saur, 1991. UNIMARC/CCF : Proceedings of the Workshop Held in Florence, 5-7 June 1991. - München, London, New York, Paris : K G Saur, 1993. UNIMARC and CDS/ISIS: Proceedings of the Workshops Held in Budapest, 21-22 June 1993 and Barcelona, 26 August 1993.- München, London, New Providence, Paris : K G Saur, 1994. UNIMARC Manual : Bibliographic Format. - 2nd ed. - München, London, New Providence, Paris : K G Saur, 1994.
9. Glossary
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
![]() |
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Latest Revision: March 3, 1999 |
Copyright © 1995-2000
International Federation of Library Associations and Institutions www.ifla.org |