Uni Ntikuain

Vertical Tabs

Reader
<?xml version="1.0" encoding="UTF-8"?>
<?xml-model href="http://www.tei-c.org/release/xml/tei/custom/schema/relaxng/tei_all.rng" type="application/xml" schematypens="http://relaxng.org/ns/structure/1.0"?>
<?xml-model href="http://www.tei-c.org/release/xml/tei/custom/schema/relaxng/tei_all.rng" type="application/xml"
	schematypens="http://purl.oclc.org/dsdl/schematron"?>
<TEI xmlns="http://www.tei-c.org/ns/1.0" xml:id="L103-tok">
   <teiHeader>
      <fileDesc>
         <titleStmt>
            <title xml:lang="mix"><seg xml:id="L103-00-01"><seg xml:id="L103-00-01-01">Uni</seg> <seg xml:id="L103-00-01-02">Ntikuain</seg></seg></title>
            <title xml:lang="esp"><seg xml:id="L103-00-02"><seg xml:id="L103-00-02-01">Las</seg> <seg xml:id="L103-00-02-02">tres</seg> <seg xml:id="L103-00-03">Ardillas</seg></seg></title>
            <author>María Gómez Hernández</author>
            <editor>Gisela Beckmann</editor>
            <editor>María M. Nieves</editor>
            <respStmt>
               <resp>TEI Encoding</resp>
               <name>Jack T. Bowers</name>  
            </respStmt>
         </titleStmt>
         <publicationStmt><p> This is an XML-TEI markup of the original publications by SIL Mexico for the purposes of:
            archival and for the creation of a body of Mixtepec-Mixtec language resources (LR) (ISO: mix).
            Encoded documents used with the permission of original authors.</p></publicationStmt>
         <notesStmt><note xml:id="L103-resumen" xml:lang="esp">Tres ardillas se metieron en problemas con un perro y un toro.</note></notesStmt>
         <sourceDesc>
            <bibl xml:id="bibl.L103"><title>Uni ntikuain</title> <author>Gómez Hernández, María</author>; <editor>Beckmann, Gisela</editor>; <editor>Nieves, María M.</editor>. <date>2007</date>. <publisher>Instituto Lingüístico de Verano, A.C.</publisher> <pubPlace>Tlalpan, D.F., México</pubPlace> Obtained from:<ptr target="http://www-01.sil.org/mexico/mixteca/mixtepec/L103-Ardillas-mix.htm"/> </bibl>
         </sourceDesc>
      </fileDesc>
      <profileDesc>
         <textDesc>
            <channel mode="w">@mode:'w' means that the primary means of utilising this document is by reading it's written form; </channel>
            <constitution type="single"/>
            <derivation type="original">this document is labeled "original" because the XML-TEI markup of it's contents have not made any alterations to the original content;</derivation>
            <domain type="squirrels"/>
            <factuality type="fiction"/>
            <interaction type="none"/>
            <preparedness type="NA">not applicable</preparedness>
            <purpose type="entertain"/>
         </textDesc>
      </profileDesc>
      <encodingDesc>
         <p>
            <list>
               <item><label>segment</label>
                  <elementSpec ident="seg">
                     <desc>In this document (and in all others in the SIL collection) the 'seg' element is used to encapsulate:
                        <list>
                           <item>i) the entire sentences from original document (first level);</item>
                           <item>ii) the individual lexical items contained within the sentences (second level);</item>
                        </list>
                     </desc>
                  </elementSpec>
               </item>
               <item><label>character</label>
                  <elementSpec ident="c">
                     <desc>Within the sentence level "seg" (segment) elements all punctuation characters (".", ";", ":", "¿", "?", "!", etc.) are encapsulated in the "c" (character) element;
                        In future stages of the project, each of these punctuation characters will be formally linked with the linguistic categories with which it is associated:<eg> a sentence that begins with: "¿"
                           (and ends with: "?") can be immediately recognized as sentence type "question/interrogative";</eg>  
                     </desc>
                  </elementSpec>
               </item>
               <item><label>division</label>
                  <elementSpec ident="div">
                     <desc>Where the page divisions in the original documents are, this encoding uses "div" (division) elements; this is to separate the linguistic, and knowledge data from formatting</desc>
                  </elementSpec>
                  
               </item>
            </list>
         </p>
         <projectDesc>
            <p>project description to come</p>
         </projectDesc>
         <classDecl>
            <taxonomy xml:id="tax.sil-mix">
               <category xml:id="pedagogical">
                  <catDesc>PEDAGOGICAL</catDesc>
                  <category xml:id="pedagogical-inter">
                     <catDesc>PEDAGOGICAL:INTERACTIVE</catDesc>
                  </category>
                  <category xml:id="pedagogical-ref">
                     <catDesc>PEDAGOGICAL:REFERENCE</catDesc>
                  </category>
               </category>
               <category xml:id="fiction">
                  <catDesc>FICTION</catDesc>
                  <category xml:id="fiction-fantasy">
                     <catDesc>FICTION:FANTASY</catDesc>
                  </category>
                  <category xml:id="fiction-realistic">
                     <catDesc>FICTION:REALISTIC</catDesc>
                  </category>
               </category>
               <category xml:id="folklore">
                  <catDesc>FOLKLORE</catDesc>
               </category>
            </taxonomy>
         </classDecl>
         <projectDesc>
            <p>
               This portion of the lexicography project deals with the only known body of literature in Mixtepec-Mixtec language which is comprised of series of 34* orthographic children's texts published by SIL Mexico (Summer Institute of Linguistics). The target audience of these documents is primarily young MIX speakers and are designed for use as classroom handouts and/or lessons for primary/elementary school-level. The topical content of these publications contain both culturally specific, and non-culturally specific subject matter supplemented by illustrations. Topics covered in these documents include: vocabulary, mathematics, telling time, geography, seasons, weather, local agricultural practices, fables, local Mixtec legends amongst others. These publications make up the second most important collection in this project and they remain the only source of reference for examples of how the working orthography of the Mixtec language is applied to the Mixtepec variety.
               The encoding of this portion of the MIX Language resources can be divided into the following components:
               <list>
                  <item>i) TEI-XML markup of documents that supports reuse and extension within this project and/or for other interested parties;</item>
                  
                  <item>ii) Annotation and glossing of lexical information and structures in documents; tasks include;
                     
                     <list>
                        <item>glossing of orthographical lexical items;</item>
                        <item>assignment of lexical properties to sentences, phrases and lexical items (morpho-/syntactic; pragmatic; semantic etc.);</item>
                        <item>linking corresponding bilingual items (Spanish-Mixtec) within each document;</item>
                        <item>entry of analyzed, glossed and categorized lexical information into TEI-Dictionary;</item>
                        <item>link orthographical forms of lexical items with phonetic transcriptions of spoken forms;</item></list></item>
                  
                  <item>iii) Classification of Document Types and Ontology Linking;
                     <list>
                        <item>replace current SIL document classification model<ref target="#tax.sil-mix"/> with established and expansible document system which can be used to classify any new document or literature produced in the language in the future;</item>
                        <item>as per the goal of the overall project, the use of language-specific glosses (including 'Interlingua') will be replaced by grounding the semantic profile of each lexical item by referencing ontological entities (from existing ontologies such as DBpedia, SKOS, etc.);</item>
                     </list>
                  </item>
               </list>
            </p>
         </projectDesc>
      </encodingDesc>
   </teiHeader>
   <text decls="#fiction-fantasy" xml:lang="mix">
      <body>
         <div xml:id="L103-01">
            <p><graphic url="L103-01.png"/>
               <seg xml:id="L103-01-01"><seg xml:id="L103-01-01-01">Iin</seg> <seg xml:id="L103-01-01-02">kii</seg><c>,</c> <seg xml:id="L103-01-01-03">uni</seg> <seg xml:id="L103-01-01-04">ntikuain</seg> <seg xml:id="L103-01-01-05">ntakita'in</seg> <seg xml:id="L103-01-01-06">yuku</seg> <seg xml:id="L103-01-01-07">ka</seg><c>.</c></seg>
               <seg xml:id="L103-01-02"><seg xml:id="L103-01-02-01">Sara</seg> <seg xml:id="L103-01-02-02">nikachia</seg> <seg xml:id="L103-01-02-03">tsi</seg> <seg xml:id="L103-01-02-04">inka</seg> <seg xml:id="L103-01-02-05">tuku</seg> <seg xml:id="L103-01-02-06">ntikuain</seg> <seg xml:id="L103-01-02-07">ka</seg><c>:</c></seg>
               <seg xml:id="L103-01-03"><c>—</c><seg xml:id="L103-01-03-01">Nako'on</seg> <seg xml:id="L103-01-03-02">nuu</seg> <seg xml:id="L103-01-03-03">itu</seg> <seg xml:id="L103-01-03-04">na</seg><c>,</c> <seg xml:id="L103-01-03-05">katso</seg> <seg xml:id="L103-01-03-06">ntixi</seg> <c>—</c><seg xml:id="L103-01-03-07">nikachia</seg><c>.</c></seg>
               <seg xml:id="L103-01-04"><seg xml:id="L103-01-04-01">Sara</seg> <seg xml:id="L103-01-04-02">ntatu'un</seg> <seg xml:id="L103-01-04-03">kueti</seg> <seg xml:id="L103-01-04-04">ra</seg> <seg xml:id="L103-01-04-05">kua'in</seg><c>.</c></seg>
               <seg xml:id="L103-01-05"><seg xml:id="L103-01-05-01">Ta</seg> <seg xml:id="L103-01-05-02">ntsai</seg> <seg xml:id="L103-01-05-03">nuu</seg> <seg xml:id="L103-01-05-04">itu</seg> <seg xml:id="L103-01-05-05">na</seg> <seg xml:id="L103-01-05-06">ka</seg><c>,</c> <seg xml:id="L103-01-05-07">ntakita'in</seg> <seg xml:id="L103-01-05-08">tsi</seg> <seg xml:id="L103-01-05-09">iin</seg> <seg xml:id="L103-01-05-10">tina</seg> <seg xml:id="L103-01-05-11">tsi'i</seg> <seg xml:id="L103-01-05-12">soko</seg><c>.</c> </seg>
               <seg xml:id="L103-01-06"><seg xml:id="L103-01-06-01">Sara</seg> <seg xml:id="L103-01-06-02">nikachi</seg> <seg xml:id="L103-01-06-03">tina</seg> <seg xml:id="L103-01-06-04">ka</seg> <seg xml:id="L103-01-06-05">saan</seg><c>:</c></seg>
               <seg xml:id="L103-01-07"><c>—</c><seg xml:id="L103-01-07-01">Vichi</seg> <seg xml:id="L103-01-07-02">ra</seg> <seg xml:id="L103-01-07-03">kunkatsi</seg> <seg xml:id="L103-01-07-04">yo</seg> <seg xml:id="L103-01-07-05">ntikuain</seg><c>,</c> <seg xml:id="L103-01-07-06">vari</seg> <seg xml:id="L103-01-07-07">tsi'i</seg> <seg xml:id="L103-01-07-08">ta</seg> <seg xml:id="L103-01-07-09">yu</seg> <seg xml:id="L103-01-07-10">soko</seg><c>,</c> <seg xml:id="L103-01-07-11">koo</seg> <seg xml:id="L103-01-07-12">staa</seg> <seg xml:id="L103-01-07-13">katsi</seg><c>.</c></seg>
               <graphic url="L103-02.png"/>
            </p>
         </div>
         <div xml:id="L103-02">
            <p><graphic url="L103-03.png"/>
               <seg xml:id="L103-02-01"><seg xml:id="L103-02-01-01">Sara</seg> <seg xml:id="L103-02-01-02">ntikuain</seg> <seg xml:id="L103-02-01-03">ka</seg> <seg xml:id="L103-02-01-04">niyu'u</seg> <seg xml:id="L103-02-01-05">xiin</seg> <seg xml:id="L103-02-01-06">vari</seg> <seg xml:id="L103-02-01-07">kunkatsi</seg> <seg xml:id="L103-02-01-08">tina</seg> <seg xml:id="L103-02-01-09">ka</seg> <seg xml:id="L103-02-01-10">ña</seg><c>,</c> <seg xml:id="L103-02-01-11">sketi</seg> <seg xml:id="L103-02-01-12">kua'in</seg> <seg xml:id="L103-02-01-13">ntantuki</seg> <seg xml:id="L103-02-01-14">na</seg> <seg xml:id="L103-02-01-15">ka'an</seg> <seg xml:id="L103-02-01-16">tsa'a</seg> <seg xml:id="L103-02-01-17">ti</seg><c>.</c></seg>
               <seg xml:id="L103-02-02"><seg xml:id="L103-02-02-01">Ntsaa</seg> <seg xml:id="L103-02-02-02">ti</seg> <seg xml:id="L103-02-02-03">nuu</seg> <seg xml:id="L103-02-02-04">kixi</seg> <seg xml:id="L103-02-02-05">iin</seg> <seg xml:id="L103-02-02-06">stiki</seg> <seg xml:id="L103-02-02-07">luu</seg> <seg xml:id="L103-02-02-08">nikachi</seg> <seg xml:id="L103-02-02-09">ti</seg> <seg xml:id="L103-02-02-10">saan</seg><c>:</c></seg>
               <seg xml:id="L103-02-03"><c>—</c><seg xml:id="L103-02-03-01">Na'a</seg><c>,</c> <seg xml:id="L103-02-03-02">ka'un</seg> <seg xml:id="L103-02-03-03">tsa'a</seg> <seg xml:id="L103-02-03-04">vari</seg> <seg xml:id="L103-02-03-05">kunkatsi</seg> <seg xml:id="L103-02-03-06">tina</seg> <seg xml:id="L103-02-03-07">ka</seg> <seg xml:id="L103-02-03-08">yu</seg><c>.</c></seg>
               <seg xml:id="L103-02-04"><seg xml:id="L103-02-04-01">Stiki</seg> <seg xml:id="L103-02-04-02">ka</seg> <seg xml:id="L103-02-04-03">nikachi</seg> <seg xml:id="L103-02-04-04">ti</seg><c>:</c></seg>
               <seg xml:id="L103-02-05"><c>—</c><seg xml:id="L103-02-05-01">Ma</seg> <seg xml:id="L103-02-05-02">sta'un</seg> <seg xml:id="L103-02-05-03">yu</seg> <seg xml:id="L103-02-05-04">vari</seg> <seg xml:id="L103-02-05-05">kixi</seg> <seg xml:id="L103-02-05-06">yu</seg><c>.</c></seg>
            </p>
         </div>
         <div xml:id="L103-03">
            <p>
               <seg xml:id="L103-03-01"><seg xml:id="L103-03-01-01">Ntakoi</seg> <seg xml:id="L103-03-01-02">ra</seg> <seg xml:id="L103-03-01-03">nikachia</seg> <seg xml:id="L103-03-01-04">ntikuain</seg> <seg xml:id="L103-03-01-05">ka</seg> <seg xml:id="L103-03-01-06">ntsani</seg> <seg xml:id="L103-03-01-07">kuia</seg> <seg xml:id="L103-03-01-08">ntikuain</seg> <seg xml:id="L103-03-01-09">ka</seg><c>.</c></seg>
               <seg xml:id="L103-03-02"><seg xml:id="L103-03-02-01">Sara</seg> <seg xml:id="L103-03-02-02">nikitsaa</seg> <seg xml:id="L103-03-02-03">kue</seg> <seg xml:id="L103-03-02-04">amigu</seg> <seg xml:id="L103-03-02-05">ntikuain</seg> <seg xml:id="L103-03-02-06">ka</seg> <seg xml:id="L103-03-02-07">ra</seg> <seg xml:id="L103-03-02-08">nikachi</seg> <seg xml:id="L103-03-02-09">kueti</seg><c>:</c></seg>
               <seg xml:id="L103-03-03"><c>—</c><c>¿</c><seg xml:id="L103-03-03-01">Nchii</seg> <seg xml:id="L103-03-03-02">kua'an</seg> <seg xml:id="L103-03-03-03">amigu</seg> <seg xml:id="L103-03-03-04">ko</seg><c>?</c> <c>—</c><seg xml:id="L103-03-03-05">nikachi</seg> <seg xml:id="L103-03-03-06">kueti</seg><c>.</c></seg>
            </p>
            <p><graphic url="L103-04.png"/>
               <seg xml:id="L103-03-04"><seg xml:id="L103-03-04-01">Sara</seg> <seg xml:id="L103-03-04-02">ntsaa</seg> <seg xml:id="L103-03-04-03">kueti</seg> <seg xml:id="L103-03-04-04">ika</seg> <seg xml:id="L103-03-04-05">tsa</seg> <seg xml:id="L103-03-04-06">ntsutsi</seg> <seg xml:id="L103-03-04-07">stiki</seg> <seg xml:id="L103-03-04-08">ka</seg> <seg xml:id="L103-03-04-09">amigu</seg> <seg xml:id="L103-03-04-10">ña</seg><c>.</c></seg>
               <seg xml:id="L103-03-05"><seg xml:id="L103-03-05-01">Sara</seg> <seg xml:id="L103-03-05-02">nikitsaa</seg> <seg xml:id="L103-03-05-03">kuu</seg> <seg xml:id="L103-03-05-04">kuaka</seg> <seg xml:id="L103-03-05-05">xeen</seg> <seg xml:id="L103-03-05-06">ini</seg> <seg xml:id="L103-03-05-07">kueti</seg> <seg xml:id="L103-03-05-08">ñantu'u</seg> <seg xml:id="L103-03-05-09">kueti</seg> <seg xml:id="L103-03-05-10">vari</seg> <seg xml:id="L103-03-05-11">ntsi'i</seg> <seg xml:id="L103-03-05-12">amigu</seg> <seg xml:id="L103-03-05-13">ña</seg><c>.</c></seg>
            </p>
         </div>
      </body>
   </text>
</TEI>
Uni Ntikuain Las tres Ardillas María Gómez Hernández Gisela Beckmann María M. Nieves TEI Encoding Jack T. Bowers

This is an XML-TEI markup of the original publications by SIL Mexico for the purposes of: archival and for the creation of a body of Mixtepec-Mixtec language resources (LR) (ISO: mix). Encoded documents used with the permission of original authors.

Tres ardillas se metieron en problemas con un perro y un toro. Uni ntikuain Gómez Hernández, María; Beckmann, Gisela; Nieves, María M.. 2007. Instituto Lingüístico de Verano, A.C. Tlalpan, D.F., México Obtained from:http://www-01.sil.org/mexico/mixteca/mixtepec/L103-Ardillas-mix.htm
@mode:'w' means that the primary means of utilising this document is by reading it's written form; this document is labeled "original" because the XML-TEI markup of it's contents have not made any alterations to the original content; not applicable

In this document (and in all others in the SIL collection) the 'seg' element is used to encapsulate: i) the entire sentences from original document (first level); ii) the individual lexical items contained within the sentences (second level); Within the sentence level "seg" (segment) elements all punctuation characters (".", ";", ":", "¿", "?", "!", etc.) are encapsulated in the "c" (character) element; In future stages of the project, each of these punctuation characters will be formally linked with the linguistic categories with which it is associated: a sentence that begins with: "¿" (and ends with: "?") can be immediately recognized as sentence type "question/interrogative"; Where the page divisions in the original documents are, this encoding uses "div" (division) elements; this is to separate the linguistic, and knowledge data from formatting

project description to come

PEDAGOGICAL PEDAGOGICAL:INTERACTIVE PEDAGOGICAL:REFERENCE FICTION FICTION:FANTASY FICTION:REALISTIC FOLKLORE

This portion of the lexicography project deals with the only known body of literature in Mixtepec-Mixtec language which is comprised of series of 34* orthographic children's texts published by SIL Mexico (Summer Institute of Linguistics). The target audience of these documents is primarily young MIX speakers and are designed for use as classroom handouts and/or lessons for primary/elementary school-level. The topical content of these publications contain both culturally specific, and non-culturally specific subject matter supplemented by illustrations. Topics covered in these documents include: vocabulary, mathematics, telling time, geography, seasons, weather, local agricultural practices, fables, local Mixtec legends amongst others. These publications make up the second most important collection in this project and they remain the only source of reference for examples of how the working orthography of the Mixtec language is applied to the Mixtepec variety. The encoding of this portion of the MIX Language resources can be divided into the following components: i) TEI-XML markup of documents that supports reuse and extension within this project and/or for other interested parties; ii) Annotation and glossing of lexical information and structures in documents; tasks include; glossing of orthographical lexical items; assignment of lexical properties to sentences, phrases and lexical items (morpho-/syntactic; pragmatic; semantic etc.); linking corresponding bilingual items (Spanish-Mixtec) within each document; entry of analyzed, glossed and categorized lexical information into TEI-Dictionary; link orthographical forms of lexical items with phonetic transcriptions of spoken forms; iii) Classification of Document Types and Ontology Linking; replace current SIL document classification model#tax.sil-mix with established and expansible document system which can be used to classify any new document or literature produced in the language in the future; as per the goal of the overall project, the use of language-specific glosses (including 'Interlingua') will be replaced by grounding the semantic profile of each lexical item by referencing ontological entities (from existing ontologies such as DBpedia, SKOS, etc.);

Iin kii , uni ntikuain ntakita'in yuku ka . Sara nikachia tsi inka tuku ntikuain ka : Nako'on nuu itu na , katso ntixi nikachia . Sara ntatu'un kueti ra kua'in . Ta ntsai nuu itu na ka , ntakita'in tsi iin tina tsi'i soko . Sara nikachi tina ka saan : Vichi ra kunkatsi yo ntikuain , vari tsi'i ta yu soko , koo staa katsi .

Sara ntikuain ka niyu'u xiin vari kunkatsi tina ka ña , sketi kua'in ntantuki na ka'an tsa'a ti . Ntsaa ti nuu kixi iin stiki luu nikachi ti saan : Na'a , ka'un tsa'a vari kunkatsi tina ka yu . Stiki ka nikachi ti : Ma sta'un yu vari kixi yu .

Ntakoi ra nikachia ntikuain ka ntsani kuia ntikuain ka . Sara nikitsaa kue amigu ntikuain ka ra nikachi kueti : ¿ Nchii kua'an amigu ko ? nikachi kueti .

Sara ntsaa kueti ika tsa ntsutsi stiki ka amigu ña . Sara nikitsaa kuu kuaka xeen ini kueti ñantu'u kueti vari ntsi'i amigu ña .

Toolbox

Themes:

Uni Ntikuain Las tres Ardillas María Gómez Hernández Gisela Beckmann María M. Nieves TEI Encoding Jack T. Bowers

This is an XML-TEI markup of the original publications by SIL Mexico for the purposes of: archival and for the creation of a body of Mixtepec-Mixtec language resources (LR) (ISO: mix). Encoded documents used with the permission of original authors.

Tres ardillas se metieron en problemas con un perro y un toro. Uni ntikuain Gómez Hernández, María; Beckmann, Gisela; Nieves, María M.. 2007. Instituto Lingüístico de Verano, A.C. Tlalpan, D.F., México Obtained from:http://www-01.sil.org/mexico/mixteca/mixtepec/L103-Ardillas-mix.htm
@mode:'w' means that the primary means of utilising this document is by reading it's written form; this document is labeled "original" because the XML-TEI markup of it's contents have not made any alterations to the original content; not applicable

In this document (and in all others in the SIL collection) the 'seg' element is used to encapsulate: i) the entire sentences from original document (first level); ii) the individual lexical items contained within the sentences (second level); Within the sentence level "seg" (segment) elements all punctuation characters (".", ";", ":", "¿", "?", "!", etc.) are encapsulated in the "c" (character) element; In future stages of the project, each of these punctuation characters will be formally linked with the linguistic categories with which it is associated: a sentence that begins with: "¿" (and ends with: "?") can be immediately recognized as sentence type "question/interrogative"; Where the page divisions in the original documents are, this encoding uses "div" (division) elements; this is to separate the linguistic, and knowledge data from formatting

project description to come

PEDAGOGICAL PEDAGOGICAL:INTERACTIVE PEDAGOGICAL:REFERENCE FICTION FICTION:FANTASY FICTION:REALISTIC FOLKLORE

This portion of the lexicography project deals with the only known body of literature in Mixtepec-Mixtec language which is comprised of series of 34* orthographic children's texts published by SIL Mexico (Summer Institute of Linguistics). The target audience of these documents is primarily young MIX speakers and are designed for use as classroom handouts and/or lessons for primary/elementary school-level. The topical content of these publications contain both culturally specific, and non-culturally specific subject matter supplemented by illustrations. Topics covered in these documents include: vocabulary, mathematics, telling time, geography, seasons, weather, local agricultural practices, fables, local Mixtec legends amongst others. These publications make up the second most important collection in this project and they remain the only source of reference for examples of how the working orthography of the Mixtec language is applied to the Mixtepec variety. The encoding of this portion of the MIX Language resources can be divided into the following components: i) TEI-XML markup of documents that supports reuse and extension within this project and/or for other interested parties; ii) Annotation and glossing of lexical information and structures in documents; tasks include; glossing of orthographical lexical items; assignment of lexical properties to sentences, phrases and lexical items (morpho-/syntactic; pragmatic; semantic etc.); linking corresponding bilingual items (Spanish-Mixtec) within each document; entry of analyzed, glossed and categorized lexical information into TEI-Dictionary; link orthographical forms of lexical items with phonetic transcriptions of spoken forms; iii) Classification of Document Types and Ontology Linking; replace current SIL document classification model with established and expansible document system which can be used to classify any new document or literature produced in the language in the future; as per the goal of the overall project, the use of language-specific glosses (including 'Interlingua') will be replaced by grounding the semantic profile of each lexical item by referencing ontological entities (from existing ontologies such as DBpedia, SKOS, etc.);

Iin kii , uni ntikuain ntakita'in yuku ka . Sara nikachia tsi inka tuku ntikuain ka : Nako'on nuu itu na , katso ntixi nikachia . Sara ntatu'un kueti ra kua'in . Ta ntsai nuu itu na ka , ntakita'in tsi iin tina tsi'i soko . Sara nikachi tina ka saan : Vichi ra kunkatsi yo ntikuain , vari tsi'i ta yu soko , koo staa katsi .

Sara ntikuain ka niyu'u xiin vari kunkatsi tina ka ña , sketi kua'in ntantuki na ka'an tsa'a ti . Ntsaa ti nuu kixi iin stiki luu nikachi ti saan : Na'a , ka'un tsa'a vari kunkatsi tina ka yu . Stiki ka nikachi ti : Ma sta'un yu vari kixi yu .

Ntakoi ra nikachia ntikuain ka ntsani kuia ntikuain ka . Sara nikitsaa kue amigu ntikuain ka ra nikachi kueti : ¿ Nchii kua'an amigu ko ? nikachi kueti .

Sara ntsaa kueti ika tsa ntsutsi stiki ka amigu ña . Sara nikitsaa kuu kuaka xeen ini kueti ñantu'u kueti vari ntsi'i amigu ña .