Universal Dependencies

Universal Dependencies, frequently abbreviated as UD, is an international cooperative project to create treebanks of the world's languages.[1] These treebanks are openly accessible and available. Core applications are automated text processing in the field of natural language processing (NLP) and research into natural language syntax and grammar, especially within linguistic typology. The project's primary aim is to achieve cross-linguistic consistency of annotation, while still permitting language-specific extensions when necessary. The annotation scheme has it roots in three related projects: Stanford Dependencies,[2] Google universal part-of-speech tags,[3] and the Interset interlingua[4] for morphosyntactic tagsets. The UD annotation scheme uses a representation in the form of dependency trees as opposed to a phrase structure trees. At the present time (January 2022), there are just over 200 treebanks of more than 100 languages available in the UD inventory.

Dependency structures

The UD annotation scheme produces syntactic analyses of sentences in terms of the dependencies of dependency grammar. Each dependency is characterized in terms of a syntactic function, which is shown using a label on the dependency edge. For example:[5]

First UD picture

This analysis shows that she, him, and a note are dependents of the left. The pronoun she is identified as a nominal subject (nsubj), the pronoun him as an indirect object (iobj) and the noun phrase a note as a direct object (obj) -- there is a further dependency that connects a to note, although it is not shown. A second example:

UD picture 2

This analysis identifies it as the subject (nsubj), is as the copula (cop), and for as a case marker (case), all of which are shown as dependents of the root word her, which is a pronoun. The next example includes an expletive and an oblique object:

UD picture 3

This analysis identifies there as an expletive (expl), food as a nominal subject (nsubj), kitchen as an oblique object (obl), and in as a case marker (case) -- there is also a dependency connecting the to kitchen, but it is not shown. The copula is in this case is positioned as the root of the sentence, a fact that is contrary to how the copula is analyzed in the second example just above, where it is positioned as a dependent of the root.

The examples of UD annotation just provided can of course give only an impression of the nature of the UD project and its annotation scheme. The emphasis for UD is on producing cross-linguistically consistent dependency analyses in order to facilitate structural parallelism across diverse languages. To this end, UD uses a universal POS tagset for all languages—although a given language does not have to make use of each tag. More specific information can be added to each word by means of a free morpho-syntactic feature set. The universal labels of dependency links can be specified with secondary relations, which are indicated as a secondary label behind a colon, e.g. nsubj:pass, following the "universal:extension" format.

Function words

Within the dependency grammar community, the UD annotation scheme is controversial. The main bone of contention concerns the analysis of function words. UD chooses to subordinate function words to content words,[6] a practice that is contrary to most works in the tradition of dependency grammar.[7] To briefly illustrate this controversy, UD would produce the following structural analysis of the sentence given:

Fourth UD picture, illustrates analysis of function words

This example is taken from the article here.[8] An alternative convention for showing dependencies is now used, different from the convention above. Since the syntactic functions are not important for the point at hand, they are excluded from this structural analysis. What is important is the manner in which this UD analysis subordinates the auxiliary verb will to the content verb say, the preposition to to the pronoun you, the subordinator that to the content verb likes, and the particle to to the content verb swim.

A more traditional dependency grammar analysis of this sentence, one that is motivated more by syntactic considerations than by semantic ones, looks like this:[9]

UD picture 5

This traditional analysis subordinates the content verb say to the auxiliary verb will, the pronoun you to the preposition to, the content verb likes to the subordinator that, and the content verb swim to the participle to.

Notes

  1. ^ de Marneffe, Marie-Catherine; Manning, Christopher D.; Nivre, Joakim; Zeman, Daniel (13 July 2021). "Universal Dependencies". Computational Linguistics. 47 (2): 255–308. doi:10.1162/coli_a_00402. S2CID 219304854.
  2. ^ "Stanford Dependencies". nlp.stanford.edu. The Stanford Natural Language Processing Group. Retrieved 8 May 2020.
  3. ^ Petrov, Slav (11 Apr 2011). "A Universal Part-of-Speech Tagset". arXiv:1104.2086 [cs.CL].
  4. ^ "Interset". cuni.cz. Institute of Formal and Applied Linguistics (Czech Republic). Retrieved 8 May 2020.
  5. ^ The three example analyses that appear in this section have been taken from the UD webpage here, examples 3, 21, and 23.
  6. ^ The choice was led by Nivre (2015).
  7. ^ The controversy surrounding UD and the status of function words in dependency grammar in general are discussed at length in Osborne & Gerdes (2019).
  8. ^ The structure is (1b) in Osborne & Gerdes (2019) article.
  9. ^ This structure is (1c) in Osborne & Gerdes (2019) article.

References

  • de Marneffe, Marie-Catherine, Christopher D. Manning, Joakim Nivre and Daniel Zeman. 2021. Universal Dependencies. In Computational Linguistics 47(2), 255–308. doi:10.1162/coli_a_00402
  • de Marneffe, Marie-Catherine, Bill MacCartney and Christopher D. Manning. 2006. Generating Typed Dependency Parses from Phrase Structure Parses. In the Proceedings of the Language Resources and Evaluation Conference (LREC) 2006, 449–454. Genoa.
  • de Marneffe, Marie-Catherine and Christopher D. Manning. 2008. The Stanford typed dependency representation. Proceedings of the COLING Workshop on Cross-Framework and Cross-Domain Parser Evaluation, 92–97. Sofia. doi:10.3115/1608858.1608859
  • de Marneffe, Marie-Catherine, Timothy Dozat, Natalia Silvaire, Katrin Haverinen, Filip Ginter, Joakim Nivre, Christopher D. Manning. 2014. Universal Stanford Dependencies: A cross-linguistic typology. In The International Conference on Language Resources and Evaluation (LREC) 2014, 4585–4592.
  • Nivre, Joakim. 2015. Towards a Universal Grammar for Natural Language Processing. CICLING 2015: 16th International Conference on Intelligent Text Processing and Computational Linguistics, 3-16. doi:10.1007/978-3-319-18111-0_1
  • Osborne, Timothy & Kim Gerdes. 2019. The status of function words in dependency grammar: A critique of Universal Dependencies (UD). Glossa: A Journal of General Linguistics 4(1), 17. doi:10.5334/gjgl.537.
  • Petrov, Slav, Dipon Das, and Ryan McDonald. 2012. A universal part-of-speech tagset. The International Conference on Language Resources and Evaluation (LREC) 2012, 2089–2096. Istanbul.
  • Zeman, Daniel. 2008. Reusable tagset conversion using tagset drivers. In The International Conference on Language Resources and Evaluation (LREC) 2008, 213–218. Marrakech.

Read other articles:

Character from the television series Stargate Atlantis Fictional character Elizabeth WeirTorri Higginson as Elizabeth WeirFirst appearanceLost City, Part 1 (SG-1)Last appearanceGhost in the Machine (Atlantis)Created byBrad Wright Robert C. CooperPortrayed byJessica Steen (SG-1 7.21–7.22)Torri HigginsonMichelle Morgan (Human-Asuran hybrid)Holly Dignard (stand-in for Old Weir in Before I Sleep)In-universe informationSpeciesHuman, later a Human-Asuran hybridOccupationDiplomat, Commander of the...

 

У Вікіпедії є статті про інших людей із прізвищем Хобзей. Павло Хобзей Народився 29 березня 1960(1960-03-29) (63 роки)м. Інта, Комі АРСР, Російська РФСР,  СРСРДіяльність педагог, математикAlma mater механіко-математичний факультет Київського національного університетуНауковий ст...

 

Обґрунтування добропорядного використання не вказано назву статті [?] Опис Помста і закон Постер фільму Помста і закон Джерело http://en.wikipedia.org/wiki/File:Sholay-poster.jpg Автор Sippy Films Мета використання Замінність Обсяг використаного матеріалу Низька роздільність? Добропорядне вик

River in the United States of America The Ohoopee River is a 119-mile-long (192 km)[1] river in east-central Georgia in the United States. It is a tributary of the Altamaha River, which flows to the Atlantic Ocean. Course The Ohoopee River rises in Washington County, about 2 miles (3 km) south of Tennille, and flows generally southeastwardly, through or along the boundaries of Johnson, Emanuel, Treutlen, Candler, Toombs and Tattnall Counties. It joins the Altamaha River 13 m...

 

In the differential geometry of surfaces, a Darboux frame is a natural moving frame constructed on a surface. It is the analog of the Frenet–Serret frame as applied to surface geometry. A Darboux frame exists at any non-umbilic point of a surface embedded in Euclidean space. It is named after French mathematician Jean Gaston Darboux. Darboux frame of an embedded curve Let S be an oriented surface in three-dimensional Euclidean space E3. The construction of Darboux frames on S first cons...

 

هذه المقالة يتيمة إذ تصل إليها مقالات أخرى قليلة جدًا. فضلًا، ساعد بإضافة وصلة إليها في مقالات متعلقة بها. (فبراير 2019) شريان قضيبي تفاصيل يتفرع من شريان فرجي غائر  يتكون من شريان قضيبي عميق[1]،  وشريان قضيبي ظهراني[1]،  وشريان بصلة القضيب[1]  FMA 66318  UBERON I...

Bagian dari Alkitab KristenPerjanjian BaruLukas 7:36-37 pada Papirus 3 Injil Matius Markus Lukas Yohanes SejarahKisah Para Rasul Surat Surat-surat Paulus Roma 1 Korintus 2 Korintus Galatia Efesus Filipi Kolose 1 Tesalonika 2 Tesalonika 1 Timotius 2 Timotius Titus Filemon Ibrani Surat-surat umum Yakobus 1 Petrus 2 Petrus 1 Yohanes 2 Yohanes 3 Yohanes Yudas ApokalipsWahyu Perjanjian Lama Portal KristenlbsBagian dari sebuah serial dari artikel-artikel tentangPaulus dalam Alkitab Kesusas...

 

A list of statues of George Washington, an American Founding Father, commanding general of the Continental Army during the American Revolutionary War, and the first U.S. president. List Image Statue name Location Date Sculptor Source Equestrian statue of George Washington Boston, Massachusetts Boston Public Garden 1869 Thomas Ball Equestrian statue of George Washington New York City, New York Union Square 1856 Henry Kirke Brown George Washington Philadelphia, Pennsylvania Independence Hall 19...

 

Economy of San MarinoCurrencyEuro (EUR, €)Trade organisationsEU Customs UnionCountry group Developed/Advanced[1] High-income economy[2] StatisticsPopulation 34,735 (1 January 2020)[3]GDP $1.807 billion (nominal, 2023)[4] $2.694 billion (PPP, 2023)[4] GDP rank 176th (nominal; 2023) 171st (PPP; 2023) GDP growth 4.6% (2022)[5] 1.2% (2023f)[5] 1.0% (2024f)[5] GDP per capita $52,949 (nominal, 2023)[4] $78,926 ...

هذه المقالة تحتاج للمزيد من الوصلات للمقالات الأخرى للمساعدة في ترابط مقالات الموسوعة. فضلًا ساعد في تحسين هذه المقالة بإضافة وصلات إلى المقالات المتعلقة بها الموجودة في النص الحالي. (مارس 2022) شعار بابوا غينيا الجديدة   التفاصيل المستعمل بابوا غينيا الجديدة البلد بابوا ...

 

2005 studio album by ThalíaEl Sexto SentidoStudio album by ThalíaReleased19 July 2005 (2005-07-19)Recorded2004–2005GenreLatin popdance-popLength65:52LanguageSpanishEnglishLabelEMI LatinProducerThalíaDan SheaCory RooneyEstéfanoJulio C. ReyesJosé Luis PagánThalía chronology Greatest Hits(2004) El Sexto Sentido(2005) Lunada(2008) Singles from El Sexto Sentido Amar sin ser amada/You Know He Never Loved YouReleased: 17 June 2005 Un alma sentenciadaReleased: 28 Octob...

 

Council of Ministers headed by Eknath Shinde Eknath Shinde ministry31st Cabinet of State of MaharashtraEknath ShindeHon'ble Chief Minister of MaharashtraDate formed30 June 2022People and organisationsGovernorBhagat Singh Koshyari (until 17th February 2023) Ramesh Bais (since 18th February 2023)Chief MinisterEknath Shinde (SHS)Deputy Chief MinisterDevendra Fadnavis (BJP) Ajit Pawar (NCP)Member parties  SHS  BJP  NCPStatus in legislatureGovernment (204)NDA (204)   BJP (105) ...

Antonio Ortiz Mena Presidente del Banco Interamericano de Desarrollo 1971-1988Predecesor Felipe HerreraSucesor Enrique V. Iglesias Secretario de Hacienda y Crédito Público de México 1 de diciembre de 1958-13 de agosto de 1970Presidente Adolfo López Mateos (1958-1964)Gustavo Díaz Ordaz (1964-1970)Predecesor Antonio Carrillo FloresSucesor Hugo B. Margáin Director general del Instituto Mexicano del Seguro Social 1 de diciembre de 1952-30 de noviembre de 1958Presidente Adolfo Ruiz CortinesP...

 

艦歴 発注: 起工: 1944年6月27日[1] 進水: 1944年10月1日[1] 就役: 1944年12月29日[1]1953年2月27日[1] 退役: 1952年4月[1]1973年3月23日[1] 除籍: 1973年3月23日[1] その後: 1974年3月21日にスクラップとして売却[1] 性能諸元 排水量: 建造時1,570 トン(水上)[2]2,414トン(水中)[2]GUPPY II改装後1,870 トン(水上)[3]2,440トン(水中) ...

 

For other places with the same name, see Kowalewice. Village in Łódź Voivodeship, PolandKowalewiceVillageKowalewiceCoordinates: 51°56′8″N 19°17′43″E / 51.93556°N 19.29528°E / 51.93556; 19.29528Country PolandVoivodeshipŁódźCountyZgierzGminaParzęczew Kowalewice [kɔvalɛˈvit͡sɛ] is a village in the administrative district of Gmina Parzęczew, within Zgierz County, Łódź Voivodeship, in central Poland.[1] It lies approximately 7 kilome...

Eurovision Song Contest 2014Country BelarusNational selectionSelection processNational finalSelection date(s)10 January 2014Selected entrantTeoSelected songCheesecakeSelected songwriter(s)Yuriy VashchukDmitry NovikFinals performanceSemi-final resultQualified (5th, 87 points)Final result16th, 43 pointsBelarus in the Eurovision Song Contest ◄2013 • 2014 • 2015► Belarus participated in the Eurovision Song Contest 2014 with the song Cheesecake writt...

 

Ancient synagogue in Priene, Turkey Priene SynagoguePlan of the Synagogue, 1904ReligionAffiliationJudaismStatusArchaeological siteLocationLocationPriene, TurkeyGeographic coordinates37°39′30.8″N 27°17′44.4″E / 37.658556°N 27.295667°E / 37.658556; 27.295667ArchitectureTypeSynagogueCompleted2nd century The Priene Synagogue is an ancient synagogue discovered by archaeologists in Priene, Turkey. The synagogue was discovered by archaeologists Theodor Wiegand and...

 

Sikorsky CH-53E Super Stallion adalah helikopter yang terbesar dan terberat di militer Amerika Serikat . Sebagai Sikorsky S-80 helikopter itu dikembangkan dari CH-53 Sea Stallion, terutama dengan menambahkan mesin ketiga, pisau ketujuh ke rotor utama dan rotor ekor canting 20 derajat. Helikopter itu dibangun oleh Sikorsky Aircraft untuk Korps Marinir Amerika Serikat . Yang sedang dikembangkan adalah CH-53K, yang akan dilengkapi dengan mesin baru, baling-baling komposit baru, dan kabin yang le...

Britain v. Germany, 1915, WWI See also: Second Battle of Artois and Battle of Festubert Battle of AubersPart of the Second Battle of Artois on the Western Front of the First World WarAubers Ridge and Festubert, 1915Date9 May 1915Locationsouth of Armentières, France50°36′N 2°49′E / 50.600°N 2.817°E / 50.600; 2.817Result German victoryBelligerents  United Kingdom  India  German EmpireCommanders and leaders Sir John French Crown Prince Rupprecht of...

 

Ruy Roque GameiroBorn(1906-02-27)27 February 1906Died18 August 1935(1935-08-18) (aged 29)Sintra RoadNationalityPortugueseEducationMarqués de Pombal Industrial SchoolKnown forSculptorSpouseMaría Elena Castelo Branco Portuguese sculptor This article has been translated from the article Ruy Roque Gameiro in the Portuguese Wikipedia, and requires proofreading. If you are confident enough in your fluency of English and Portuguese, please proofread it. (August 2022)Ruy Roque Gameiro (27...

 

Strategi Solo vs Squad di Free Fire: Cara Menang Mudah!