Text Encoding Initiative

The XML Version of the TEI Guidelines

<certainty>


<certainty> indicates the degree of certainty or uncertainty associated with some aspect of the text markup.
Attributes (In addition to global attributes and those inherited from metadata)
target points at the elements whose markup is uncertain.
Datatype: IDREFS
Values: one or more valid identifiers, separated by white space.
Default: #REQUIRED
Example:
Elizabeth went to <persName id="P1">Essex</persName>
   <certainty target="P1" locus="gi" degree="0.6"/>
Note

If more than one identifier is given, the <certainty> element is interpreted as applying to all. If no identifier is present on the element being annotated, the attribute should give the identifier of a <ptr> element which points at the element being annotated; for further discussion of this indirect pointing mechanism, see chapter 14 Linking, Segmentation, and Alignment.

locus indicates the precise location of the uncertainty in the markup: applicability of the element, precise position of the start- or end-tag, value of a specific attribute, etc.
Datatype: CDATA
Suggested values include:
#gi uncertain whether the element used actually applies to the passage.
#startloc start-tag may not be correctly located.
#endloc end-tag may not be correctly located.
#location both the start-tag and the end-tag may not be correctly located.
name the value given for the attribute name is uncertain.
#transcribedContent the content of the element may not be a correct transcription of the source text.
#suppliedContent the content of the element may not have been correctly supplied by the reader, e.g. as in the cases of corr and abbrev elements.
Default: #REQUIRED
Note

The ‘#’ distinguishes the terms of the controlled vocabulary from possible collisions with attribute names. Extensions to this vocabulary should also use this prefix.

assertedValue provides an alternative value for the aspect of the markup in question—an alternative generic identifier, transcription, or attribute value, or the identifier of an <anchor> element (to indicate an alternative starting or ending location). If an assertedValue is given, the confidence level specified by degree applies to the alternative markup specified by assertedValue; if none is given, it applies to the markup in the text.
Datatype: CDATA
Values: generic identifier, attribute value, location (e.g. indicated by a reference to an <anchor> element or to an <xptr> element), or other appropriate alternative value.
Default: #IMPLIED
Example:
Elizabeth went to <persName id="p1">Essex</persName>
   <certainty target="p1" locus="#gi" assertedValue="place" 
degree="0.2"/> 
Note

This attribute makes it possible to indicate the degree of confidence in a specific alternative to some aspect of the markup. In the example above the encoder is expressing the likelihood (.2) that the generic identifier should be <place> rather than <persName>, which is the coded element.

desc further describes the uncertainty in prose, perhaps indicating its nature, cause, or the justification for the degree of confidence asserted.
Datatype: CDATA
Values: a prose description of how and why the markup is uncertain.
Default: #IMPLIED
Example:
Elizabeth went to 
<persName id="p1">Essex</persName>
 <certainty target="p1" locus="#gi" degree="0.2"
  desc="Time of writing indicates the Earl rather than the town" /> 
Note

In a given project, it may be possible to enumerate a finite list of recognized types and causes of uncertainty; in such cases, it will be useful to control the vocabulary used in this attribute, to aid later mechanical manipulation. It is not possible to suggest such a controlled vocabulary for general use.

given indicates conditions assumed in the assignment of a degree of confidence.
Datatype: CDATA
Values: a characterization of the conditions which are assumed in the assignment of a degree of confidence. This may be in prose.
Default: #IMPLIED
Example:
<!-- in the header, hand H1 is identified as that of MSM  -->
  <hand id="H1" scribe="MSM"/>
<!-- ... -->
<!-- in the text, the scribe has corrected 'Wessex' to 'Essex'  -->
  Elizabeth went to <corr id="C1" sic="Wessex" resp="MSM">Essex</corr>. 
<!-- we are 60%; certain that hand H1 is MSM,
     and 90%; certain that if H1 is MSM, then
     it is MSM who corrected 'Wessex' into 'Essex'.  -->
<certainty target="H1" locus="scribe" degree="0.6" id="P1"/>
<certainty target="C1" locus="resp" given="P1" degree="0.9"/>
Note

A project may wish to control the vocabulary used in this attribute.

The envisioned typical value of this attribute would be the identifier of another <certainty> element or a list of such identifiers. It may thus be possible to construct probability networks by chaining <certainty> elements together. Such networks would ultimately be grounded in unconditional <certainty> elements (with no value for given). The semantics of this chaining would be understood in this way: if a <certainty> element is specified, via a reference, as the assumption, then it is not the attribution of uncertainty that is the assumption, but rather the assertion itself. For instance, in the example above, the first <certainty> element indicates that the confidence in the identification of the new scribe as ‘msm’. The second indicates the degree of confidence that ‘Essex’ is a personal name, given that the new scribe is ‘msm’. Note that the given in the second <certainty> element is not the assertion that the likelihood that msm is the new scribe is 0.6, but simply the assertion that msm is the new scribe; this is a recommended convention to facilitate building networks.

The ambitious encoder may wish to attempt complex networks or probability assertions, experimenting with references to other elements or prose assertions, and deploying feature structure connectives such as <alt>, <join>, and <not>. However, we do not believe that the <certainty> element gives, at this time, a comprehensive ambiguity-free system for indicating certainty.

degree indicates the degree of confidence assigned to the aspect of the markup named by the locus attribute.
Datatype: CDATA
Values: Values of degree might be yes or no, the reals between 0 and 1, or traditional characterizations such as ‘doubtful’, ‘circa’, etc. Generally we recommend decimal numbers between 0 and 1, where larger numbers denote a greater degree of confidence in the assertions; 0 representing ‘certainly false’ and 1 representing ‘certainly true’.
Default: #IMPLIED
Example

(For discussion of this example, see section 17.1.2 Structured Indications of Uncertainty)

Earnest went to <anchor id="a1"/> old
<persName id="p1">Saybrook</persName>.
<certainty id="c1" target="p1" locus="#gi" degree="0.6"/>
<certainty target="p1" locus="startloc" given="c1" degree="0.9"/>
<certainty id="c2" target="p1" locus="#gi" 
   assertedValue="persName" degree="0.4"/>
<certainty target="p1" locus="startloc" given="c2" degree="0.5"/>
<certainty id="c3" target="p1" locus="startloc" 
   assertedValue="a1" given="c1" degree="0.5"/>

Module Declared in file teicert2; Additional tag set for certainty: enabled by TEI.certainty
Class metadata
Data Description Empty.
May occur within ab abbr activity actor add addName addrLine address admin affiliation analytic app argument author authority back bibl biblFull biblScope biblStruct birth bloc body byline camera caption case castGroup castItem castList cell channel cit cl classCode closer colloc constitution corr country creation damage date dateRange dateStruct dateline def del derivation descrip dictScrap distance distinct distributor div div0 div1 div2 div3 div4 div5 div6 div7 docAuthor docDate docEdition docImprint docTitle domain edition editor education emph entry entryFree epigraph epilogue etym expan extent factuality figDesc figure firstLang foreName foreign form front funder fw gen genName geogName gloss gram gramGrp graph group head headItem headLabel hi hyph imprimatur imprint interaction item itype l label lang langKnown language lbl lem lg lg1 lg2 lg3 lg4 lg5 list listBibl locale m measure meeting mentioned metDecl monogr mood name nameLink note num number occasion occupation offset ofig opener orgDivn orgName orgTitle orgType orig orth otherForm p per performance persName phr placeName pos preparedness principal prologue pron pubPlace publicationStmt publisher purpose q quote rdg rdgGrp re ref reg region rendition residence resp respStmt restore role roleDesc roleName row rs s salute seg sense series set settlement sic signed soCalled socecStatus sound sp speaker sponsor stage street stress subc supplied surname syll symbol table tagUsage tech term termEntry text tig time timeRange timeStruct title titlePage titlePart tns tr trailer trans u unclear usg view w wit witDetail witList witness writing xr xref
Declaration
<!ELEMENT certainty %om.RO;  EMPTY> 
<!ATTLIST certainty  
      %a.global;
      target IDREFS #REQUIRED
      locus CDATA #REQUIRED
      assertedValue CDATA #IMPLIED
      desc CDATA #IMPLIED
      given CDATA #IMPLIED
      degree CDATA #IMPLIED>
See further 17.1.2 Structured Indications of Uncertainty

Up: 35 Elements