high
medium
low
unknown
(\-?[\d]+/\-?[\d]+)
unknown
inapplicable
[\-+]?\d+(\.\d+)?(%|cm|mm|in|pt|pc|px|em|ex|gd|rem|vw|vh|vm)
[\d]+(\.[\d]+){0,2}
(\p{L}|\p{N}|\p{P}|\p{S})+
egXML
indicates the person, or group of people, to whom the element content is ascribed.
provides an externally-defined means of identifying the entity (or entities) being
named, using a coded value of some kind.
(reference) provides an explicit means of locating a full definition for the entity being named by
means of one or more URIs.
gives a minimum estimated value for the approximate measurement.
gives a maximum estimated value for the approximate measurement.
where the measurement summarizes more than one observation
or a range, supplies the minimum value
observed.
where the measurement summarizes more than one observation
or a range, supplies the maximum value
observed.
names the unit used for the measurement
Suggested values include: 1] cm(centimetres) ; 2] mm(millimetres) ; 3] in(inches) ; 4] lines; 5] chars(characters)
cm
(centimetres)
mm
(millimetres)
in
(inches)
lines
lines of text
chars
(characters) characters of text
specifies the length in the units specified
indicates the size of the object concerned using a project-specific vocabulary combining
quantity and units in a single string of words.
characterizes the precision of the values specified by the other attributes.
where the measurement summarizes more than one observation, specifies the applicability
of this measurement.
Sample values include: 1] all; 2] most; 3] range
indicates whether or not the element
bearing this attribute should be considered to mark the end of
an orthographic token in the same way as whitespace.
supplies a pointer to some location defining a named
period of time within which the datable item is understood to
have occurred.
supplies the value of the date or time in a standard form,
e.g. yyyy-mm-dd.
specifies the earliest possible date for the event in
standard form, e.g. yyyy-mm-dd.
specifies the latest possible date for the event in
standard form, e.g. yyyy-mm-dd.
indicates the starting point of the period in standard form, e.g. yyyy-mm-dd.
indicates the ending point of the period in standard
form, e.g. yyyy-mm-dd.
indicates whether or not this element is selected by default when
its parent is selected.
true
This element is selected if its parent is selected
false
This element can only be selected explicitly, unless it is the
only one of its kind, in which case it is selected if its parent is selected.
identifies one or more declarable elements within the
header, which are understood to apply to the element bearing this
attribute and its content.
(organization) specifies how the content of the division is organized.
composite
composite content: i.e. no claim is made about the
sequence in which the immediate contents of this division
are to be processed, or their inter-relationships.
uniform
uniform content: i.e. the immediate contents of this
element are regarded as forming a logical unit, to be
processed in sequence.
indicates whether this division is a sample of the
original source and if so, from which part.
initial
division lacks material present at end in source.
medial
division lacks material at start and end.
final
division lacks material at start.
unknown
position of sampled material within original unknown.
complete
division is not a sample.
specifies whether or not the division is fragmented by
some other structural element, for example a speech which is
divided between two or more verse stanzas.
Y
(yes) the division is incomplete in some respect
N
(no) either the division is complete, or no claim is made as to its completeness.
I
(initial) the initial part of an incomplete division
M
(medial) a medial part of an incomplete division
F
(final) the final part of an incomplete division
describes the status of a document either currently or, when
associated with a dated element, at the time indicated.
Sample values include: 1] approved; 2] candidate; 3] cleared; 4] deprecated; 5] draft; 6] embargoed; 7] expired; 8] frozen; 9] galley; 10] proposed; 11] published; 12] recommendation; 13] submitted; 14] unfinished; 15] withdrawn
(certainty) signifies the degree of certainty associated with the intervention or interpretation.
(responsible party) indicates the agency responsible for the intervention or interpretation, for example an
editor or transcriber.
indicates the nature of the evidence supporting the reliability or accuracy of the
intervention or interpretation.
Suggested values include: 1] internal; 2] external; 3] conjecture
internal
there is internal evidence to support the intervention.
external
there is external evidence to support the intervention.
conjecture
the intervention or interpretation has been made by the editor, cataloguer, or
scholar on the basis of their expertise.
internal
there is internal evidence to support the intervention.
external
there is external evidence to support the intervention.
conjecture
the intervention or interpretation has been made by the editor, cataloguer, or
scholar on the basis of their expertise.
contains a list of one or more pointers indicating sources
supporting the given intervention or interpretation.
(identifier) provides a unique identifier for the element bearing the attribute.
(number) gives a number (or other label) for an element, which is not necessarily unique within
the document.
(language) indicates the language of the element content using a tag generated
according to BCP 47
(rendition) indicates how the element in question was rendered or presented in the source text.
points to a description of the rendering or presentation used for this element in the
source text.
provides a base URI reference with which applications can resolve relative URI
references into absolute URI references.
signals an intention about how white space should be
managed by applications.
default
the processor should treat white space according to the
default XML white space handling rules
preserve
the processor should preserve unchanged any and all
white space in the source
gives a name or other identifier for the scribe
believed to be responsible for this hand.
points to a full description of the scribe concerned, typically supplied by a person element
elsewhere in the description.
characterizes the particular script or writing style used by
this hand, for example secretary, copperplate, Chancery, Italian, etc.
points to a full description of the script or writing style used by
this hand, typically supplied by a scriptNote element
elsewhere in the description.
describes the tint or type of ink, e.g. brown, or other
writing medium, e.g. pencil
specifies how widely this hand is used in the manuscript.
sole
only this hand is used throughout the manuscript
major
this hand is used through most of the manuscript
minor
this hand is used occasionally in the manuscript
(MIME media type) specifies the applicable multimedia internet mail extension (MIME) media type
indicates the units used for the measurement, usually
using the standard symbol for the desired units.
Suggested values include: 1] m(metre) ; 2] kg(kilogram) ; 3] s(second) ; 4] Hz(hertz) ; 5] Pa(pascal) ; 6] Ω(ohm) ; 7] L(litre) ; 8] t(tonne) ; 9] ha(hectare) ; 10] Å(ångström) ; 11] mL(millilitre) ; 12] cm(centimetre) ; 13] dB(decibel) ; 14] kbit(kilobit) ; 15] Kibit(kibibit) ; 16] kB(kilobyte) ; 17] KiB(kibibyte) ; 18] MB(megabyte) ; 19] MiB(mebibyte)
m
(metre) SI base unit of length
kg
(kilogram) SI base unit of mass
s
(second) SI base unit of time
Hz
(hertz) SI unit of frequency
Pa
(pascal) SI unit of pressure or stress
Ω
(ohm) SI unit of electric resistance
L
(litre) 1 dm³
t
(tonne) 10³ kg
ha
(hectare) 1 hm²
Å
(ångström) 10⁻¹⁰ m
mL
(millilitre)
cm
(centimetre)
dB
(decibel) see remarks, below
kbit
(kilobit) 10³ or 1000 bits
Kibit
(kibibit) 2¹⁰ or 1024 bits
kB
(kilobyte) 10³ or 1000 bytes
KiB
(kibibyte) 2¹⁰ or 1024 bytes
MB
(megabyte) 10⁶ or 1 000 000 bytes
MiB
(mebibyte) 2²⁰ or 1 048 576 bytes
specifies the number of the specified units that
comprise the measurement
indicates the substance that is being measured
may be used to specify further information about the entity referenced by
this name, for example the occupation of a person, or the status of a place.
(reference to the canonical name) provides a means of locating the canonical form
(nym) of the names associated with the object
named by
the element bearing it.
Suggested values include: 1] below; 2] bottom; 3] margin; 4] top; 5] opposite; 6] overleaf; 7] above; 8] end; 9] inline; 10] inspace
below
below the line
bottom
at the foot of the page
margin
in the margin (left, right, or both)
top
at the top of the page
opposite
on the opposite, i.e. facing, page
overleaf
on the other side of the leaf
above
above the line
end
at the end of e.g. chapter or volume.
inline
within the body of the text.
inspace
in a predefined space, for example left by an earlier scribe.
below
below the line
bottom
at the foot of the page
margin
in the margin (left, right, or both)
top
at the top of the page
opposite
on the opposite, i.e. facing, page
overleaf
on the other side of the leaf
above
above the line
end
at the end of e.g. chapter or volume.
inline
within the body of the text.
inspace
in a predefined space, for example left by an earlier scribe.
characterizes the element in some sense, using any convenient
classification scheme or typology.
provides a sub-categorization of the element, if needed
specifies the destination of the reference by supplying one or more URI References
specifies the intended meaning when the target of a
pointer is itself a pointer.
all
if the element pointed to is itself a pointer, then
the target of that pointer will be taken, and so on, until
an element is found which is not a pointer.
one
if the element pointed to is itself a pointer, then
its target (whether a pointer or not) is taken as the target
of this pointer.
none
no further evaluation of targets is carried out
beyond that needed to find the element specified in the
pointer's target.
(edition) supplies an arbitrary identifier for the source edition in which
the associated feature (for example, a page, column, or line
break) occurs at this point in the text.
indicates the end of a span initiated by the element
bearing this attribute.
indicates the kind of information held in this cell or
in each cell of this row.
Suggested values include: 1] label; 2] data
label
labelling or descriptive information only.
data
data values.
indicates the number of rows occupied by this cell or row.
(columns) indicates the number of columns occupied by this cell or
row.
signifies the hand of the agent which made the intervention.
indicates the effect of the intervention, for example in
the case of a deletion, strikeouts
which include too much or too little text, or in the case of an
addition, an insertion which duplicates some of the text
already present.
Sample values include: 1] duplicate; 2] duplicate-partial; 3] excessStart; 4] excessEnd; 5] shortStart; 6] shortEnd; 7] partial; 8] unremarkable
(sequence) assigns a sequence number related to the order in which
the encoded features carrying this attribute are believed to have occurred.
specifies the version name or number of the source from
which the translated version was derived
(paragraph) marks paragraphs in prose.
(foreign) identifies a word or phrase as belonging to some language other than that of the
surrounding text.
(emphasized) marks words or phrases which are stressed or emphasized for
linguistic or rhetorical effect.
(highlighted) marks a word or phrase as graphically distinct from the
surrounding text, for reasons concerning which no claim is
made.
identifies any word or phrase which is regarded as linguistically distinct, for example as
archaic, technical, dialectal, non-preferred, etc., or as forming part of a sublanguage.
specifies the sublanguage or register to which the word or phrase is being
assigned
specifies how the phrase is distinct diachronically
specifies how the phrase is distinct diatopically
specifies how the phrase is distinct diastatically
(speech or thought) indicates passages thought or spoken aloud, whether explicitly indicated in the source or
not, whether directly or indirectly reported, whether by real people or fictional characters.
may be used to indicate whether the quoted matter is regarded as having been vocalized
or signed.
may be used to indicate whether the quoted matter is regarded as direct or indirect
speech.
(quotation) contains a phrase or passage attributed by the narrator or author to some agency external
to the text.
(separated from the surrounding text with quotation marks) contains material which is marked as (ostensibly) being somehow different than the
surrounding text, for any one of a variety of reasons including, but not limited to: direct
speech or thought, technical terms or jargon, authorial distance, quotations from elsewhere, and
passages that are mentioned but not used.
may be used to indicate whether the offset passage is spoken or thought, or to
characterize it more finely.
Suggested values include: 1] spoken; 2] thought; 3] written; 4] soCalled; 5] foreign(foreign words) ; 6] distinct(linguistically distinct) ; 7] term(technical term) ; 8] emph(rhetorically emphasized) ; 9] mentioned
spoken
representation of speech
thought
representation of thought, e.g. internal monologue
written
quotation from a written source
soCalled
authorial distance
foreign
(foreign words)
distinct
(linguistically distinct)
term
(technical term)
emph
(rhetorically emphasized)
mentioned
refering to itself, not its normal referant
(cited quotation) contains a quotation from some other document, together with a bibliographic reference to
its source. In a dictionary it may contain an example text with at least one occurrence of the
word form, used in the sense being described, or a translation of the headword, or an example.
marks words or phrases mentioned, not used.
contains a word or phrase for which the author or narrator indicates a disclaiming of
responsibility, for example by the use of scare quotes or italics.
(description) contains a brief description of the object documented by its parent element, including its
intended usage, purpose, or application where this is appropriate.
identifies a phrase or word used to provide a gloss or definition for some other word or
phrase.
(canonical reference) identifies the associated term element using a canonical reference from a
scheme defined in a refsDecl element in the TEI header
contains a single-word, multi-word, or symbolic designation which is regarded as a technical
term.
identifies the associated gloss element using a canonical reference from a
scheme defined in a refsDecl element in the TEI header
supplies the sort key for this term in an index.
(latin for thus or so
) contains text reproduced although apparently incorrect or inaccurate.
(correction) contains the correct form of a passage apparently erroneous in the copy text.
groups a number of alternative encodings for the same point in
a text.
(regularization) contains a reading which has been regularized or normalized in some sense.
(original form) contains a reading which is marked as following the original, rather than being normalized
or corrected.
(gap) indicates a point where material has been omitted in a transcription, whether for editorial
reasons described in the TEI header, as part of sampling practice, or because the material is
illegible, invisible, or inaudible.
gives the reason for omission. Sample values include sampling,
inaudible, irrelevant, cancelled.
in the case of text omitted from the transcription because of deliberate deletion by an
identifiable hand, signifies the hand which made the deletion.
In the case of text omitted because of damage, categorizes the cause of the damage, if
it can be identified.
Sample values include: 1] rubbing; 2] mildew; 3] smoke
(addition) contains letters, words, or phrases inserted in the text by an
author, scribe, annotator, or corrector.
(deletion) contains a letter, word, or passage deleted, marked as deleted, or otherwise indicated as
superfluous or spurious in the copy text by an author, scribe, annotator, or corrector.
contains a word, phrase, or passage which cannot be transcribed with certainty because it
is illegible or inaudible in the source.
indicates why the material is hard to transcribe.
Where the difficulty in transcription arises from action (partial deletion, etc.)
assignable to an identifiable hand, signifies the hand responsible for the action.
Where the difficulty in transcription arises from damage, categorizes the cause of
the damage, if it can be identified.
Sample values include: 1] rubbing; 2] mildew; 3] smoke
(name, proper noun) contains a proper noun or noun phrase.
(referencing string) contains a general purpose name or referring string.
indicates more specifically the object referred to by the referencing string.
Values might include person, place, ship,
element etc.
(electronic mail address) contains an e-mail address identifying a location to which
e-mail messages can be delivered.
contains a postal address, for example of a
publisher, an organization, or an individual.
(address line) contains one line of a postal address.
a full street address including any name or number identifying a
building as well as the name of the street or route on which it is
located.
(postal code) contains a numerical or alphanumeric code used as part of a postal address to simplify
sorting or delivery of mail.
(postal box or post office box) contains a number or other identifier for some postal delivery point other than a street
address.
(number) contains a number, written in any form.
indicates the type of numeric value.
Suggested values include: 1] cardinal; 2] ordinal; 3] fraction; 4] percentage
cardinal
absolute number, e.g. 21, 21.5
ordinal
ordinal number, e.g. 21st
fraction
fraction, e.g. one half or three-quarters
percentage
a percentage
supplies the value of the number in standard form.
contains a word or phrase referring to some quantity of an object or commodity, usually
comprising a number, a unit, and a commodity name.
specifies the type of measurement in any convenient typology.
(measure group) contains a group of dimensional specifications which relate to the same object, for example
the height and width of a manuscript page.
contains a date in any format.
indicates the system or calendar to which the date represented by the content of this
element belongs.
Suggested values include: 1] Gregorian; 2] Julian; 3] Islamic; 4] Hebrew; 5] Revolutionary; 6] Iranian; 7] Coptic; 8] Chinese
Gregorian
Gregorian calendar
Julian
Julian calendar
Islamic
Islamic or Muslim (hijri) lunar calendar
Hebrew
Hebrew or Jewish lunisolar calendar
Revolutionary
French Revolutionary calendar
Iranian
Iranian or Persian (Jalaali) solar calendar
Coptic
Coptic or Alexandrian calendar
Chinese
Chinese lunisolar calendar
contains a phrase defining a time of day in any format.
(abbreviation) contains an abbreviation of any sort.
allows the encoder to classify the abbreviation according to some convenient
typology.
Sample values include: 1] suspension; 2] contraction; 3] brevigraph; 4] superscription; 5] acronym; 6] title; 7] organization; 8] geographic
(expansion) contains the expansion of an abbreviation.
(pointer) defines a pointer to another location.
Only one of the
attributes 'target' and 'cRef' may be supplied.
(canonical reference) specifies the destination of the pointer by supplying a canonical reference from a
scheme defined in a refsDecl element in the TEI header
(reference) defines a reference to another location, possibly modified by additional text or comment.
Only one of the
attributes 'target' and 'cRef' may be supplied.
(canonical reference) specifies the destination of the reference by supplying a canonical reference from a
scheme defined in a refsDecl element in the TEI header
(list) contains any sequence of items organized as a list.
describes the form of the list.
Suggested values include: 1] ordered; 2] bulleted; 3] simple; 4] gloss
ordered
list items are numbered or lettered.
bulleted
list items are marked with a bullet or other typographic device.
simple
list items are not numbered or bulleted.
gloss
each list item glosses some term or concept, which is given by a label element
preceding the list item.
contains one component of a list.
contains the label associated with an item in a list; in glossaries, marks the term being
defined.
(heading) contains any type of heading, for example the title of a section, or the heading of a list,
glossary, manuscript description, etc.
(heading for list labels) contains the heading for the label or term column in a glossary list or similar structured
list.
(heading for list items) contains the heading for the item or gloss column in a glossary list or similar structured
list.
contains a note or annotation.
indicates whether the copy text shows the exact place of reference for the note.
points to the end of the span to which the note is attached, if the note is not embedded
in the text at that point.
(index entry) marks a location to be indexed for whatever purpose.
supplies a name to specify which index (of several) the index entry belongs to.
indicates the location of an inline graphic, illustration, or figure.
The display width of the image
The display height of the image
A scale factor to be applied to the image to make it the desired display size
(uniform resource locator) A URL which refers to the image itself.
provides encoded binary data representing an inline graphic or other object.
The display width of the object
The display height of the object
A scale factor to be applied to the object to make it the desired display size
The encoding used to encode the binary data. If not specified, this is assumed to be
Base64.
marks a boundary point separating any kind of section of a text, typically but not
necessarily indicating a point at which some part of a standard reference system changes, where
the change is not represented by a structural element.
provides a conventional name for the kind of section changing at this milestone.
Suggested values include: 1] page; 2] column; 3] line; 4] book; 5] poem; 6] canto; 7] speaker; 8] stanza; 9] act; 10] scene; 11] section; 12] absent; 13] unnumbered
page
physical page breaks (synonymous with the pb element).
column
column breaks.
line
line breaks (synonymous with the lb element).
book
any units termed book, liber, etc.
poem
individual poems in a collection.
canto
cantos or other major sections of a poem.
speaker
changes of speaker or narrator.
stanza
stanzas within a poem, book, or canto.
act
acts within a play.
scene
scenes within a play or act.
section
sections of any kind.
absent
passages not present in the reference edition.
unnumbered
passages present in the text, but not to be included as part of the reference.
(page break) marks the boundary between one page of a text and the next in a standard reference system.
(line break) marks the start of a new (typographic) line in some edition or version of a text.
(column break) marks the boundary between one column of a text and the next
in a standard reference system.
(analytic level) contains bibliographic elements describing an item (e.g. an article or poem) published
within a monograph or journal and not as an independent publication.
(monographic level) contains bibliographic elements describing an item (e.g. a book or journal) published as an
independent item (i.e. as a separate physical object).
(series information) contains information about the series in which a book or other bibliographic item has
appeared.
in a bibliographic reference, contains the name(s) of the
author(s), personal or corporate, of a work; for example in the same
form as that provided by a recognized bibliographic name authority.
secondary statement of responsibility for a bibliographic item, for example the name of an
individual, institution or organization, (or of several such) acting as editor, compiler,
translator, etc.
(statement of responsibility) supplies a statement of responsibility for the intellectual content of a text, edition,
recording, or series, where the specialized elements for authors, editors, etc. do not suffice
or do not apply.
(responsibility) contains a phrase describing the nature of a person's intellectual responsibility.
contains a title for any kind of work.
indicates the bibliographic level for a title, that is, whether
it identifies an article, book, journal, series, or
unpublished material.
a
(analytic) analytic title (article, poem, or other item
published as part of a larger item)
m
(monographic) monographic title (book, collection, or
other item published as a distinct item,
including single volumes of multi-volume
works)
j
(journal) journal title
s
(series) series title
u
(unpublished) title of unpublished material (including
theses and dissertations unless
published by a commercial press)
classifies the title according to some convenient typology.
Sample values include: 1] main; 2] sub(subordinate) ; 3] alt(alternate) ; 4] short; 5] desc(descriptive)
contains the formalized descriptive title for a meeting or conference, for use in a
bibliographic description for an item derived from such a meeting, or as a heading or preamble
to publications emanating from it.
groups information relating to the publication or distribution
of a bibliographic item.
provides the name of the organization responsible for the publication or distribution of a
bibliographic item.
(scope of citation) defines the scope of a bibliographic reference, for example as a
list of page numbers, or a named subdivision of a larger work.
identifies the type of information conveyed by the element, e.g.
columns, pages, volume.
Suggested values include: 1] vol(volume) ; 2] issue; 3] pp(pages) ; 4] ll (lines) ; 5] chap(chapter) ; 6] part
vol
(volume) the element contains a volume number.
issue
the element contains an issue number, or volume and
issue numbers.
pp
(pages) the element contains a page number or page range.
ll
(lines) the element contains a line number or line range.
chap
(chapter) the element contains a chapter indication (number
and/or title)
part
the element identifies a part of a book or collection.
specifies the starting point of the range of units indicated by the type attribute.
specifies the end-point of the range of units indicated by the type attribute.
(publication place) contains the name of the place where a bibliographic item was published.
(bibliographic citation) contains a loosely-structured bibliographic citation of which the sub-components may or may
not be explicitly tagged.
(structured bibliographic citation) contains a structured bibliographic citation, in which only bibliographic sub-elements
appear and in a specified order.
(citation list) contains a list of bibliographic citations of any kind.
contains or references some other bibliographic item which is related to the present one in
some specified manner, for example as a constituent or alternative version of it.
If the 'target' attribute is used, the
relatedItem element must be empty
A relatedItem element should have either a 'target' attribute
or a child element to indicate the related bibliographic item
points to the related bibliographic element by means of an
absolute or relative URI reference
(verse line) contains a single, possibly incomplete, line of verse.
specifies whether or not the line is metrically complete.
Y
(yes) the line is metrically incomplete
N
(no) either the line is complete, or no claim is made as to its completeness
I
(initial) the initial part of an incomplete line
M
(medial) a medial part of an incomplete line
F
(final) the final part of an incomplete line
(line group) contains a group of verse lines functioning as a formal unit, e.g. a stanza, refrain,
verse paragraph, etc.
(speech) An individual speech in a performance text, or a passage presented as such in a prose or
verse text.
A specialized form of heading or label, giving the name of one or more speakers in a
dramatic text or fragment.
(stage direction) contains any kind of stage direction within a dramatic text or fragment.
indicates the kind of stage direction.
Suggested values include: 1] setting; 2] entrance; 3] exit; 4] business; 5] novelistic; 6] delivery; 7] modifier; 8] location; 9] mixed
setting
describes a setting.
entrance
describes an entrance.
exit
describes an exit.
business
describes stage business.
novelistic
is a narrative, motivating stage direction.
delivery
describes how a character speaks.
modifier
gives some detail about a character.
location
describes a location.
mixed
more than one of the above
(automatically generated text division) indicates the location at which a textual division generated
automatically by a text-processing application is to appear.
specifies what type of generated text division (e.g. index,
table of contents, etc.) is to appear.
Sample values include: 1] index; 2] toc; 3] figlist; 4] tablist
(TEI Header) supplies the descriptive and declarative information making up an electronic title page
prefixed to every TEI-conformant text.
specifies the kind of document to which the header is attached, for example whether it
is a corpus or individual text.
Sample values include: 1] text; 2] corpus
(file description) contains a full bibliographic description of an electronic file.
(title statement) groups information about the title of a work and those responsible for its intellectual
content.
specifies the name of a sponsoring organization or institution.
(funding body) specifies the name of an individual, institution, or organization responsible for the
funding of a project or text.
(principal researcher) supplies the name of the principal researcher responsible for the
creation of an electronic text.
(edition statement) groups information relating to one edition of a text.
(edition) describes the particularities of one edition of a text.
describes the approximate size of a text as stored on some carrier medium, whether digital
or non-digital, specified in any convenient units.
(publication statement) groups information concerning the publication or distribution of an electronic or other
text.
supplies the name of a person or other agency responsible for the
distribution of a text.
(release authority) supplies the name of a person or other agency responsible for
making an electronic file available, other than a publisher or
distributor.
(identifier) supplies any form of identifier used to identify some object,
such as a bibliographic item, a person, a title, an organization,
etc. in a standardized way.
categorizes the identifier, for example as an ISBN, Social
Security number, etc.
supplies information about the availability of a text, for example any restrictions on its
use or distribution, its copyright status, etc.
supplies a code identifying the current availability of the text.
free
the text is freely available.
unknown
the status of the text is unknown.
restricted
the text is not freely available.
(series statement) groups information about the series, if any, to which a publication belongs.
(notes statement) collects together any notes providing information about a text additional to that recorded
in other parts of the bibliographic description.
(source description) describes the source from which an electronic text was derived or generated, typically a
bibliographic description in the case of a digitized text, or a phrase such as "born digital"
for a text which has no previous existence.
(fully-structured bibliographic citation) contains a fully-structured bibliographic citation, in which all components of the TEI file
description are present.
(encoding description) documents the relationship between an electronic text and the
source or sources from which it was derived.
(project description) describes in detail the aim or purpose for which an electronic file was encoded, together
with any other relevant information concerning the process by which it was assembled or
collected.
(sampling declaration) contains a prose description of the rationale and methods used in sampling texts in the
creation of a corpus or collection.
(editorial practice declaration) provides details of editorial principles and practices applied
during the encoding of a text.
(correction principles) states how and under what circumstances corrections have been made in the text.
indicates the degree of correction applied to the text.
high
the text has been thoroughly checked and proofread.
medium
the text has been checked at least once.
low
the text has not been checked.
unknown
the correction status of the text is unknown.
indicates the method adopted to indicate corrections within the text.
silent
corrections have been made silently
markup
corrections have been represented using markup
indicates the extent of normalization or regularization of the original source carried out
in converting it to electronic form.
indicates the authority for any normalization carried out.
indicates the method adopted to indicate normalizations within the text.
silent
normalization made silently
markup
normalization represented using markup
specifies editorial practice adopted with respect to quotation marks in the original.
(quotation marks) indicates whether or not quotation marks have been retained as content within the text.
none
no quotation marks have been retained
some
some quotation marks have been retained
all
all quotation marks have been retained
specifies how quotation marks are indicated within the text.
summarizes the way in which hyphenation in a source text has been treated in an encoded
version of it.
(end-of-line) indicates whether or not end-of-line hyphenation has been retained in a text.
all
all end-of-line hyphenation has been retained, even though the lineation of the
original may not have been.
some
end-of-line hyphenation has been retained in some cases.
hard
all soft end-of-line hyphenation has been removed: any remaining end-od-line
hyphenation should be retained.
none
all end-of-line hyphenation has been removed: any remaining hyphenation occurred
within the line.
describes the principles according to which the text has been segmented, for example into
sentences, tone-units, graphemic strata, etc.
(standard values) specifies the format used when standardized date or number values are supplied.
describes the scope of any analytic or interpretive information added to the text in
addition to the transcription.
(tagging declaration) provides detailed information about the tagging applied to a document.
supplies information about the usage of a specific element within a text.
The values of gramGrp/ must belong to the set identified
in tagUsage for this element.
The values of cit[@type='trans']/gramGrp/ must belong to the set identified in
tagUsage for this element, in the 'equiv' section.
The values of gramGrp[@type='agr']/ must belong to the set identified in
tagUsage for this element, in the 'agr' section.
The @ana attributes should identify a resource fragment in the FreeDict ontology interface.
(element name) the name (generic identifier) of the element indicated by the tag.
specifies the number of occurrences of this element within the text.
(with unique identifier) specifies the number of occurrences of this element within the text which bear a
distinct value for the global xml:id attribute.
specifies the identifier of a rendition element which defines how this element
is to be rendered.
supplies the formal name of the namespace to which the elements documented by its children
belong.
the full formal name of the namespace concerned.
supplies information about the rendition or appearance of one or more elements in the source
text.
identifies the language used to describe the rendition.
css
Cascading Stylesheet Language
xslfo
Extensible Stylesheet Language Formatting Objects
free
Informal free text description
other
A user-defined rendition description language
where CSS is used, provides a way of defining
pseudo-elements, that is, styling rules
applicable to specific sub-portions of an element.
(references declaration) specifies how canonical references are constructed for this
text.
(canonical reference pattern) specifies an expression and replacement pattern for transforming a canonical reference into
a URI.
specifies a regular expression against which the values of cRef attributes
can be matched.
specifies a replacement pattern which, once subpattern substitution
has been performed, provides a URI.
(reference state) specifies one component of a canonical reference defined by the milestone method.
indicates what kind of state is changing at this milestone.
Suggested values include: 1] page; 2] column; 3] line; 4] book; 5] poem; 6] canto; 7] stanza; 8] act; 9] scene; 10] section; 11] absent
page
page breaks in the reference edition.
column
column breaks.
line
line breaks.
book
any units termed book, liber, etc.
poem
individual poems in a collection.
canto
cantos or other major sections of a poem.
stanza
stanzas within a poem, book, or canto.
act
acts within a play.
scene
scenes within a play or act.
section
sections of any kind.
absent
passages not present in the reference edition.
specifies the fixed length of the reference component.
(delimiter) supplies a delimiting string following the reference component.
(classification declarations) contains one or more taxonomies defining any classificatory
codes used elsewhere in the text.
defines a typology used to classify texts either implicitly, by means of a bibliographic
citation, or explicitly by a structured taxonomy.
contains an individual descriptive category, possibly nested within a superordinate
category, within a user-defined taxonomy.
(category description) describes some category within a taxonomy or text typology, either in the form of a brief
prose description or in terms of the situational parameters used by the TEI formal textDesc.
(application information) records information about an application which has
edited the TEI file.
provides information about an application which has acted upon the document.
Supplies an identifier for the application, independent of its version number or display
name.
Supplies a version number for the application, independent of its identifier or display
name.
[\d]+[a-z]*[\d]*(\.[\d]+[a-z]*[\d]*){0,3}
(text-profile description) provides a detailed description of non-bibliographic aspects of a text, specifically the
languages and sublanguages used, the situation in which it was produced, the participants and
their setting.
contains information about the creation of a text.
(language usage) describes the languages, sublanguages, registers, dialects, etc.
represented within a text.
characterizes a single language or sublanguage used within a text.
(identifier) Supplies a language code constructed as defined in BCP 47 which is used to identify the
language documented by this element, and which is referenced by the global
xml:lang attribute.
specifies the approximate percentage (by volume) of the text which uses this language.
100
(text classification) groups information which describes the nature or topic of a text in terms of a standard
classification scheme, thesaurus, etc.
contains a list of keywords or phrases identifying the topic or nature of a text.
identifies the controlled vocabulary within which the set of keywords concerned is
defined.
(classification code) contains the classification code used for this text in some standard classification system.
identifies the classification system or taxonomy in use.
(category reference) specifies one or more defined categories within some taxonomy or text typology.
identifies the classification scheme within which the set of categories concerned is
defined
(revision description) summarizes the revision history for a file.
summarizes a particular change or correction made to a particular version of an electronic
text which is shared between several researchers.
(geographic coordinates declaration) documents the notation and the datum used for geographic coordinates expressed as content of
the geo element elsewhere within the document.
supplies a commonly used code name for the datum employed.
Suggested values include: 1] WGS84(World Geodetic System) ; 2] MGRS(Military Grid Reference System) ; 3] OSGB36(ordnance survey great britain) ; 4] ED50(European Datum coordinate system)
WGS84
(World Geodetic System) a pair of numbers to be interpreted as latitude followed by longitude according to
the World Geodetic System.
MGRS
(Military Grid Reference System) the values supplied are geospatial entity object codes, based on
OSGB36
(ordnance survey great britain) the value supplied is to be interpreted as a British National Grid Reference.
ED50
(European Datum coordinate system) the value supplied is to be interpreted as latitude followed by longitude according
to the European Datum coordinate system.
(TEI document) contains a single TEI-conformant document,
comprising a TEI header and a text, either in isolation or as part of a
teiCorpus element.
specifies the version number of the TEI Guidelines against
which this document is valid.
contains a single text of any kind, whether unitary or composite, for example a poem or
drama, a collection of essays, a novel, a dictionary, or a corpus sample.
(text body) contains the whole body of a single unitary text, excluding any front or back matter.
contains the body of a composite text, grouping together a sequence of distinct texts (or
groups of such texts) which are regarded as a unit for some purpose, for example the collected
works of an author, a sequence of prose essays, etc.
contains a single text of any kind, whether unitary or composite, which interrupts the text
containing it at any point and after which the surrounding text resumes.
(text division) contains a subdivision of the front, body, or back of a text.
contains a closing title or footer appearing at the end of a division of a text.
contains the primary statement of responsibility given for a work
on its title page or at the head or end of the work.
contains a brief description of the place, date, time, etc. of production of a letter,
newspaper story, or other work, prefixed or suffixed to it as a kind of heading or trailer.
A formal list or prose description of the topics addressed by
a subdivision of a text.
contains a quotation, anonymous or attributed, appearing at the start of a section or
chapter, or on a title page.
groups together dateline, byline, salutation, and similar phrases appearing as a preliminary
group at the start of a division, especially of a letter.
groups together salutations, datelines, and similar phrases appearing as a final group at
the end of a division, especially of a letter.
(salutation) contains a salutation or greeting prefixed to a foreword, dedicatory epistle, or other
division of a text, or the salutation in the closing of a letter, preface, etc.
(signature) contains the closing salutation, etc., appended to a foreword,
dedicatory epistle, or other division of a text.
contains a postscript, e.g. to a letter.
(title page) contains the title page of a text, appearing within the front or back matter.
classifies the title page according to any convenient typology.
(document title) contains the title of a document, including all its
constituents, as given on a title page.
contains a subsection or division of the title of a work, as
indicated on a title page.
specifies the role of this subdivision of the title.
Suggested values include: 1] main; 2] sub(subordinate) ; 3] alt(alternate) ; 4] short; 5] desc(descriptive)
main
main title of the work
sub
(subordinate) subtitle of the work
alt
(alternate) alternative title of the work
short
abbreviated form of title
desc
(descriptive) descriptive paraphrase of the work
(document author) contains the name of the author of the document, as given on the
title page (often but not always contained in a byline).
contains a formal statement authorizing the publication of a work, sometimes required to
appear on a title page or its verso.
(document edition) contains an edition statement as presented on a title page of a
document.
(document imprint) contains the imprint statement (place and date of publication,
publisher name), as given
(usually) at the foot of a title page.
(document date) contains the date of a document, as given
(usually) on a title page.
gives the value of the date in standard form, i.e. YYYY-MM-DD.
(front matter) contains any prefatory matter (headers,
title page, prefaces, dedications, etc.)
found at the start of a document, before the main body.
(back matter) contains any appendixes, etc. following the main part of a text.
indicates type of entry, in dictionaries with multiple types.
Suggested values include: 1] main; 2] hom(homograph) ; 3] xref(cross reference) ; 4] affix; 5] abbr(abbreviation) ; 6] supplemental; 7] foreign
main
a main entry (default).
hom
(homograph) groups information relating to one homograph within an entry.
xref
(cross reference) a reduced entry whose only function is to point to another main
entry (e.g. for forms of an irregular verb or for variant spellings:
was pointing to be, or
esthete to aesthete).
affix
an entry for a prefix, infix, or suffix.
abbr
(abbreviation) an entry for an abbreviation.
supplemental
a supplemental entry (for use in dictionaries which issue supplements to their main
work in which they include updated information about entries).
foreign
an entry for a foreign word in a monolingual dictionary.
contains a (sortable) character sequence reflecting the entry's alphabetical position in
the printed dictionary.
gives an expanded form of information presented more concisely in the dictionary
(normalized) gives a normalized form of information given by the source text in a
non-normalized form
gives the list of split values for a merged form
gives a value which lacks any realization in the printed source text.
(original) gives the original string or is the empty string when the element does not appear
in the source text.
provides a reference to an anchor element elsewhere in the document
indicating the original location of this component.
gives a reference to another element, where the original appears as a merged form.
(optional) indicates whether the element is optional or not
groups successive entries for a set of homographs.
contains a reasonably well-structured dictionary entry.
(homograph) groups information relating to one homograph within an entry.
groups together all information relating to one word sense in a dictionary entry, for
example definitions, examples, and translation equivalents.
gives the nesting depth of this sense.
(form information group) groups all the information on the written and spoken forms of one headword.
classifies form as simple, compound, etc.
Suggested values include: 1] simple; 2] lemma; 3] variant; 4] compound; 5] derivative; 6] inflected; 7] phrase
simple
single free lexical item
lemma
the headword itself
variant
a variant form
compound
word formed from simple lexical items
derivative
word derived from headword
inflected
word in other than usual dictionary form
phrase
multiple-word lexical item
(orthographic form) gives the orthographic form of a dictionary headword.
gives the type of spelling.
gives the extent of the orthographic information provided.
Sample values include: 1] full(full form) ; 2] pref(prefix) ; 3] suff(suffix) ; 4] part(partial)
(pronunciation) contains the pronunciation(s) of the word.
indicates whether the pronunciation is for whole word or part.
Sample values include: 1] full(full form) ; 2] pref(prefix) ; 3] suff(suffix) ; 4] part(partial)
indicates what notation is used for the pronunciation, if more than one occurs in
the machine-readable dictionary.
(hyphenation) contains a hyphenated form of a dictionary headword, or hyphenation information in some
other form.
(syllabification) contains the syllabification of the headword.
contains the stress pattern for a dictionary headword, if given separately.
(grammatical information) within an entry in a dictionary or a terminological data file, contains grammatical
information relating to a term, word, or form.
classifies the grammatical information given according to some convenient typology — in
the case of terminological information, preferably the dictionary of data element types
specified in ISO WD 12 620.
Sample values include: 1] pos(part of speech) ; 2] gen(gender) ; 3] num(number) ; 4] animate; 5] proper
(gender) identifies the morphological gender of a lexical item, as given in the dictionary.
indicates grammatical number associated with a form, as given in a dictionary.
contains grammatical case information given by a dictionary for a given form.
(person) contains an indication of the grammatical person (1st, 2nd, 3rd, etc.) associated with a
given inflected form in a dictionary.
(tense) indicates the grammatical tense associated with a given inflected form in a dictionary.
contains information about the grammatical mood of verbs (e.g. indicative, subjunctive,
imperative).
(inflectional class) indicates the inflectional class associated with a lexical item.
indicates the type of indicator used to specify the inflection class, when it is
necessary to distinguish between the usual abbreviated indications (e.g.
inv) and other kinds of indicators, such as special codes referring
to conjugation patterns, etc.
Sample values include: 1] abbrev; 2] verbTable
(grammatical information group) groups morpho-syntactic information about a lexical item, e.g. pos, gen, number, case, or
iType (inflectional class).
(part of speech) indicates the part of speech assigned to a dictionary
headword such as noun, verb, or adjective.
(subcategorization) contains subcategorization information (transitive/intransitive, countable/non-countable,
etc.)
(collocate) contains a collocate of the headword.
(definition) contains definition text in a dictionary entry.
(etymology) encloses the etymological information in a dictionary entry.
(language name) name of a language mentioned in etymological or other linguistic discussion.
(usage) contains usage information in a dictionary entry.
classifies the usage information using any convenient typology.
Sample values include: 1] geo(geographic) ; 2] time; 3] dom(domain) ; 4] reg(register) ; 5] style; 6] plev(preference level) ; 7] lang(language) ; 8] gram(grammatical) ; 9] syn(synonym) ; 10] hyper(hypernym) ; 11] colloc(collocation) ; 12] comp(complement) ; 13] obj(object) ; 14] subj(subject) ; 15] verb; 16] hint
(label) contains a label for a form, example, translation, or other piece of information, e.g.
abbreviation for, contraction of, literally, approximately, synonyms:, etc.
classifies the label using any convenient typology.
(cross-reference phrase) contains a phrase, sentence, or icon referring the reader to some other location in this or
another text.
indicates the type of cross reference, using any convenient typology.
Sample values include: 1] syn(synonym) ; 2] etym(etymological) ; 3] cf(compare or consult) ; 4] illus(illustration)
(related entry) contains a dictionary entry for a lexical item related to the headword, such as a compound
phrase or derived form, embedded inside a larger entry.
(orthographic-form reference) in a dictionary example, indicates a reference to the orthographic form(s) of the headword.
indicates the kind of typographic modification made to the headword in the reference.
Sample values include: 1] cap(capital) ; 2] noHyph(no hyphen)
(orthographic-variant reference) in a dictionary example, indicates a reference to variant orthographic form(s) of the
headword.
indicates the kind of variant involved.
Sample values include: 1] pt(past tense) ; 2] pp(past participle) ; 3] prp(present participle) ; 4] f(feminine) ; 5] pl(plural)
(pronunciation reference) in a dictionary example, indicates a reference to the pronunciation(s) of the headword.
(pronunciation-variant reference) in a dictionary example, indicates a reference to variant pronunciation(s) of the headword.
contains one cell of a table.
contains one row of a table.
contains text displayed in tabular form, in rows and columns.
indicates the number of rows in the table.
(columns) indicates the number of columns in each row of the table.
(analysis) indicates one or more elements containing interpretations of the
element on which the ana attribute appears.
(corresponds) points to elements that correspond to the current
element in some way.
The W3C XInclude element
pointer to the resource being included
xml
text