Share to: share facebook share twitter share wa share telegram print page

DBpedia

DBpedia
Developer(s)
Initial release10 January 2007 (17 years ago) (2007-01-10)
Stable release
DBpedia 2016-10 / 4 July 2017
Repository
Written in
Type
LicenseGNU General Public License
Websitedbpedia.org

DBpedia (from "DB" for "database") is a project aiming to extract structured content from the information created in the Wikipedia project. This structured information is made available on the World Wide Web using OpenLink Virtuoso.[1][2] DBpedia allows users to semantically query relationships and properties of Wikipedia resources, including links to other related datasets.[3]

The project was heralded as "one of the more famous pieces" of the decentralized Linked Data effort by Tim Berners-Lee, one of the Internet's pioneers.[4] As of June 2021, DBPedia contained over 850 million triples.

Background

The project was started by people at the Free University of Berlin and Leipzig University[5] in collaboration with OpenLink Software, and is now maintained by people at the University of Mannheim and Leipzig University.[6][7] The first publicly available dataset was published in 2007.[5] The data is made available under free licenses (CC BY-SA), allowing others to reuse the dataset; it therefore doesn't however use an open data license to waive the sui generis database rights.

Wikipedia articles consist mostly of free text, but also include structured information embedded in the articles, such as "infobox" tables (the pull-out panels that appear in the top right of the default view of many Wikipedia articles, or at the start of the mobile versions), categorization information, images, geo-coordinates and links to external Web pages. This structured information is extracted and put in a uniform dataset which can be queried.

Dataset

The 2016-04 release of the DBpedia data set describes 6.0 million entities, out of which 5.2 million are classified in a consistent ontology, including 1.5 million persons, 810,000 places, 135,000 music albums, 106,000 films, 20,000 video games, 275,000 organizations, 301,000 species and 5,000 diseases.[8] DBpedia uses the Resource Description Framework (RDF) to represent extracted information and consists of 9.5 billion RDF triples, of which 1.3 billion were extracted from the English edition of Wikipedia and 5.0 billion from other language editions.[8]

From this data set, information spread across multiple pages can be extracted. For example, book authorship can be put together from pages about the work, or the author.[further explanation needed]

One of the challenges in extracting information from Wikipedia is that the same concepts can be expressed using different parameters in infobox and other templates, such as |birthplace= and |placeofbirth=. Because of this, queries about where people were born would have to search for both of these properties in order to get more complete results. As a result, the DBpedia Mapping Language has been developed to help in mapping these properties to an ontology while reducing the number of synonyms. Due to the large diversity of infoboxes and properties in use on Wikipedia, the process of developing and improving these mappings has been opened to public contributions.[9]

Version 2014 was released in September 2014.[10] A main change since previous versions was the way abstract texts were extracted. Specifically, running a local mirror of Wikipedia and retrieving rendered abstracts from it made extracted texts considerably cleaner. Also, a new data set extracted from Wikimedia Commons was introduced.

As of June 2021, DBPedia contains over 850 million triples.[11]

Examples

DBpedia extracts factual information from Wikipedia pages, allowing users to find answers to questions where the information is spread across multiple Wikipedia articles. Data is accessed using an SQL-like query language for RDF called SPARQL.

For example, if one were interested in the Japanese shōjo manga series Tokyo Mew Mew, and wanted to find the genres of other works written by its illustrator Mia Ikumi. DBpedia combines information from Wikipedia's entries on Tokyo Mew Mew, Mia Ikumi and on this author's works such as Super Doll Licca-chan and Koi Cupid. Since DBpedia normalises information into a single database, the following query can be asked without needing to know exactly which entry carries each fragment of information, and will list related genres:

PREFIX dbprop: <http://dbpedia.org/ontology/>
PREFIX db: <http://dbpedia.org/resource/>
SELECT ?who, ?WORK, ?genre WHERE {
 db:Tokyo_Mew_Mew dbprop:author ?who .
 ?WORK  dbprop:author ?who .
 OPTIONAL { ?WORK dbprop:genre ?genre } .
}

Use cases

DBpedia has a broad scope of entities covering different areas of human knowledge. This makes it a natural hub for connecting datasets, where external datasets could link to its concepts.[12] The DBpedia dataset is interlinked on the RDF level with various other Open Data datasets on the Web. This enables applications to enrich DBpedia data with data from these datasets. As of September 2013, there are more than 45 million interlinks between DBpedia and external datasets including: Freebase, OpenCyc, UMBEL, GeoNames, MusicBrainz, CIA World Fact Book, DBLP, Project Gutenberg, DBtune Jamendo, Eurostat, UniProt, Bio2RDF, and US Census data.[13][14] The Thomson Reuters initiative OpenCalais, the Linked Open Data project of The New York Times, the Zemanta API[15] and DBpedia Spotlight also include links to DBpedia.[16][17][18] The BBC uses DBpedia to help organize its content.[19][20] Faviki uses DBpedia for semantic tagging.[21] Samsung also includes DBpedia in its "Knowledge Sharing Platform".

Such a rich source of structured cross-domain knowledge is fertile ground for artificial intelligence systems. DBpedia was used as one of the knowledge sources in IBM Watson's Jeopardy! winning system[22]

Amazon provides a DBpedia Public Data Set that can be integrated into Amazon Web Services applications.[23]

Data about creators from DBpedia can be used for enriching artworks' sales observations.[24]

The crowdsourcing software company, Ushahidi, built a prototype of its software that leveraged DBpedia to perform semantic annotations on citizen-generated reports. The prototype incorporated the "YODIE" (Yet another Open Data Information Extraction system) service[25] developed by the University of Sheffield, which uses DBpedia to perform the annotations. The goal for Ushahidi was to improve the speed and facility with which incoming reports could be validated managed.[26]

DBpedia Spotlight

DBpedia Spotlight is a tool for annotating mentions of DBpedia resources in text. This allows linking unstructured information sources to the Linked Open Data cloud through DBpedia. DBpedia Spotlight performs named entity extraction, including entity detection and name resolution (in other words, disambiguation). It can also be used for named entity recognition, and other information extraction tasks. DBpedia Spotlight aims to be customizable for many use cases. Instead of focusing on a few entity types, the project strives to support the annotation of all 3.5 million entities and concepts from more than 320 classes in DBpedia. The project started in June 2010 at the Web Based Systems Group at the Free University of Berlin.

DBpedia Spotlight is publicly available as a web service for testing and a Java/Scala API licensed via the Apache License. The DBpedia Spotlight distribution includes a jQuery plugin that allows developers to annotate pages anywhere on the Web by adding one line to their page.[27] Clients are also available in Java or PHP.[28] The tool handles various languages through its demo page[29] and web services. Internationalization is supported for any language that has a Wikipedia edition.[30]

Archivo ontology database

From 2020, the DBpedia project provides a regularly updated database of web‑accessible ontologies written in the OWL ontology language.[31] Archivo also provides a four star rating scheme for the ontologies it scrapes, based on accessibility, quality, and related fitness‑for‑use criteria. For instance, SHACL compliance for graph‑based data is evaluated when appropriate. Ontologies should also contain metadata about their characteristics and specify a public license describing their terms‑of‑use.[32][33] As of June 2021 the Archivo database contains 1368 entries.

History

DBpedia was initiated in 2007 by Sören Auer, Christian Bizer, Georgi Kobilarov, Jens Lehmann, Richard Cyganiak and Zachary Ives.[5]

See also

References

  1. ^ Bizer, Christian; Lehmann, Jens; Kobilarov, Georgi; Auer, Soren; Becker, Christian; Cyganiak, Richard; Hellmann, Sebastian (September 2009). "DBpedia - A crystallization point for the Web of Data" (PDF). Web Semantics: Science, Services and Agents on the World Wide Web. 7 (3): 154–165. CiteSeerX 10.1.1.150.4898. doi:10.1016/j.websem.2009.07.002. ISSN 1570-8268. Archived from the original (PDF) on 10 August 2017. Retrieved 11 December 2015.
  2. ^ "About DBpedia". DBpedia. Retrieved 14 January 2024.
  3. ^ "Komplett verlinkt — Linked Data" (in German). 3sat. 19 June 2009. Archived from the original on 6 January 2013. Retrieved 10 November 2009.
  4. ^ "Sir Tim Berners-Lee Talks with Talis about the Semantic Web". Talis. 7 February 2008. Archived from the original on 10 May 2013.
  5. ^ a b c DBpedia: A Nucleus for a Web of Open Data, available at [1], [2], or [3]
  6. ^ "Credits". DBpedia. Archived from the original on 21 September 2014. Retrieved 9 September 2014.
  7. ^ "Home". March 2024.
  8. ^ a b "YEAH! We did it again ;) – New 2016-04 DBpedia release". DBpedia. 19 October 2016. Retrieved 9 January 2019.
  9. ^ "DBpedia Mappings". mappings.dbpedia.org. Retrieved 3 April 2010.
  10. ^ "Changelog". DBpedia. September 2014. Retrieved 9 September 2014.
  11. ^ Holze, Julia (23 July 2021). "Announcement: DBpedia Snapshot 2021-06 Release". DBpedia Association. Retrieved 28 July 2021.
  12. ^ E. Curry, A. Freitas, and S. O'Riáin, "The Role of Community-Driven Data Curation for Enterprises", Archived 23 January 2012 at the Wayback Machine in Linking Enterprise Data, D. Wood, Ed. Boston, MA: Springer US, 2010, pp. 25-47.
  13. ^ "Statistics on links between Data sets", SWEO Community Project: Linking Open Data on the Semantic Web, W3C, retrieved 24 November 2009
  14. ^ "Statistics on Data sets", SWEO Community Project: Linking Open Data on the Semantic Web, W3C, retrieved 24 November 2009
  15. ^ "Zemanta API". dev.zemanta.com. Retrieved 26 July 2021.
  16. ^ Sandhaus, Evan; Larson, Rob (29 October 2009). "First 5,000 Tags Released to the Linked Data Cloud". The New York Times Blogs. Retrieved 10 November 2009.
  17. ^ "Life in the Linked Data Cloud". opencalais.com. Archived from the original on 24 November 2009. Retrieved 10 November 2009. Wikipedia has a Linked Data twin called DBpedia. DBpedia has the same structured information as Wikipedia – but translated into a machine-readable format.
  18. ^ "Zemanta talks Linked Data with SDK and commercial API". ZDNet. Archived from the original on 28 February 2010. Retrieved 10 November 2009. Zemanta fully supports the Linking Open Data initiative. It is the first API that returns disambiguated entities linked to dbPedia, Freebase, MusicBrainz, and Semantic Crunchbase.
  19. ^ "European Semantic Web Conference 2009 - Georgi Kobilarov, Tom Scott, Yves Raimond, Silver Oliver, Chris Sizemore, Michael Smethurst, Christian Bizer and Robert Lee. Media meets Semantic Web - How the BBC uses DBpedia and Linked Data to make Connections". eswc2009.org. Archived from the original on 8 June 2009. Retrieved 10 November 2009.
  20. ^ "BBC Learning - Open Lab - Reference". BBC. Archived from the original on 25 August 2009. Retrieved 10 November 2009. Dbpedia is a database version of Wikipedia. It is used in a lot of projects for a wide range of different reasons. At the BBC we are using it for tagging content.
  21. ^ "Semantic Tagging with Faviki". readwriteweb.com. Archived from the original on 29 January 2010.
  22. ^ David Ferrucci, Eric Brown, Jennifer Chu-Carroll, James Fan, David Gondek, Aditya A. Kalyanpur, Adam Lally, J. William Murdock, Eric Nyberg, John Prager, Nico Schlaefer, and Chris Welty "Building Watson: An Overview of the DeepQA Project." Archived 6 November 2020 at the Wayback Machine In AI Magazine Fall, 2010. Association for the Advancement of Artificial Intelligence (AAAI).
  23. ^ "Amazon Web Services Developer Community : DBpedia". developer.amazonwebservices.com. Archived from the original on 13 February 2010. Retrieved 10 November 2009.
  24. ^ Filipiak, Dominik; Filipowska, Agata (2 December 2015). "DBpedia in the Art Market". Business Information Systems Workshops. Lecture Notes in Business Information Processing. Vol. 228. pp. 321–331. doi:10.1007/978-3-319-26762-3_28. ISBN 978-3-319-26761-6.
  25. ^ "GATE.ac.uk - applications/yodie.html". gate.ac.uk. Retrieved 11 May 2020.
  26. ^ "ushahidi/platform-comrades". GitHub. 30 June 2019. Retrieved 9 March 2020.
  27. ^ Mendes, Pablo. "DBpedia Spotlight jQuery Plugin". jQuery Plugins. Archived from the original on 3 April 2011. Retrieved 15 September 2011.
  28. ^ DiCiuccio, Rob (25 September 2016). "PHP Client for DBpedia Spotlight". GitHub.
  29. ^ "Demo of DBpedia Spotlight". Retrieved 8 September 2013.
  30. ^ "Internationalization of DBpedia Spotlight". GitHub. Retrieved 8 September 2013.
  31. ^ "DBpedia Archivo". Retrieved 8 July 2021.
  32. ^ Frey, Johannes; Streitmatter, Denis; Götz, Fabian; Hellmann, Sebastian; Arndt, Natanael (27 October 2020). "DBpedia Archivo: a web-scale interface for ontology archiving under consumer-oriented aspects". In Sure-Vetter, York; Sack, Harald; Cudré-Mauroux, Philippe; Maleshkova, Maria; Pellegrini, Tassilo; Acosta, Maribel (eds.). Semantic systems: the power of AI and knowledge graphs. Cham, Switzerland: Springer. doi:10.1007/978-3-030-59833-4_2. ISBN 978-3-030-59832-7. S2CID 219939266. Download as PDF or ePUB. Open access icon
  33. ^ Frey, Johannes; Streitmatter, Denis; Götz, Fabian; Hellmann, Sebastian; Arndt, Natanael (10 September 2020). DBpedia Archivo: a web-scale interface for ontology archiving under consumer-oriented aspects. Leipzig, Germany: Institut für Angewandte Informatik (InfAI). Retrieved 8 July 2021. YouTube video 00:10:38.

External links

Baca informasi lainnya yang berhubungan dengan : article

Article 19 Article 20

Read other articles:

La región de Toyo (東予地方, Tōyo-chihō?) es una de las regiones en que se subdivide la prefectura de Ehime, y corresponde a su porción oriental. Actualmente está conformada por las ciudades de Shikokuchuo, Niihama, Saijo e Imabari; y el pueblo de Kamijima del distrito de Ochi. Ubicación de la región de Toyo en la prefectura de Ehime. Véase también Región de Chuyo Región de Nanyo Control de autoridades Proyectos Wikimedia Datos: Q9067702 Datos: Q9067702

English rugby league club For the club of the same name founded in 1883, see Hunslet F.C. (1883). This article includes a list of general references, but it lacks sufficient corresponding inline citations. Please help to improve this article by introducing more precise citations. (November 2016) (Learn how and when to remove this template message) Hunslet R.L.F.C.Club informationFull nameHunslet Rugby League Football ClubNickname(s)ParksidersColoursFounded1973; 50 years ago (19…

1932年ロサンゼルスオリンピックの飛込競技(1932ねんロサンゼルスオリンピックのとびこみきょうぎ)は、1932年8月8日から8月13日の競技日程で実施された。 概要 種目は男女3m飛板飛び込み、10m高飛び込みの4種目を実施した。 競技結果 男子 3m飛板飛び込み 日程:8月8日 順位 選手名 国・地域 合計 1 マイケル・ガリツェン アメリカ合衆国 161.38 2 ハロルド・スミス アメリカ

Лінія Дельвіга Лінія Дельвіга на мапі Східної Галичини Лінія Дельвіга — демаркаційна лінія між Українською Народною Республікою та Польщею, встановлена в червні 1919 року. На початку червня 1919 р. Українська Галицька Армія розпочала Чортківський наступ (офензиву), в хо…

American strategist and expert on China (born 1945) Michael PillsburyBornMichael Paul Pillsbury (1945-02-08) February 8, 1945 (age 78)California, USNationalityAmericanEducationStanford University (B.A.)Columbia University (Ph.D.)Occupation(s)Foreign policy strategist, government official, authorYears active1978–PresentKnown forGrand strategy, Chinese studiesPolitical partyRepublicanChinese nameChinese白邦瑞TranscriptionsStandard MandarinHanyu PinyinBái Bāngruì Michael Paul…

هذه المقالة يتيمة إذ تصل إليها مقالات أخرى قليلة جدًا. فضلًا، ساعد بإضافة وصلة إليها في مقالات متعلقة بها. (سبتمبر 2017) لينز كيسلر كيسلر في عام 2013 معلومات شخصية الاسم عند الميلاد (بالدنماركية: Lenina Christiansen)‏  الميلاد 31 مارس 1966(1966-03-31)كوبنهاغن، الدنمارك الجنسية مملكة الدنمارك…

  Siprus   Siprus Utara   Turki Siprus dan Turki telah terlibat dalam sengketa atas sejauh mana mereka zona ekonomi eksklusif, seolah-olah dipicu oleh eksplorasi minyak dan gas di area mediterania. Turki keberatan dengan pengeboran Siprus di perairan yang diklaim Siprus berdasarkan hukum maritim internasional. Sengketa zona maritim saat ini menyentuh sengketa Siprus dan Aegea yang abadi; Turki adalah satu-satunya negara anggota Perserikatan Bangsa-Bangsa yang tidak …

1949 Venezuelan filmThe Demon is an AngelFilm posterDirected byCarlos Hugo ChristensenStarringSusana Freyre Juana Sujo Juan Carlos ThorryProductioncompanyBolívar FilmsDistributed byBolívar FilmsRelease date1949CountryVenezuelaLanguageSpanish The Demon is an Angel (Spanish:Un ángel sin pudor) is a 1949 Venezuelan comedy film directed by Carlos Hugo Christensen and starring Susana Freyre, Juana Sujo and Juan Carlos Thorry.[1] Cast Susana Freyre Juana Sujo Juan Carlos Thorry References ^…

Париж - Люксембург 1969 Детали гонкиМаршрут7-й Париж — ЛюксембургСоревнованиеСупер Престиж Перно 1969Этапы2Даты5 – 6 август 1969Дистанция478 кмСтраны Франция ЛюксембургМесто стартаПарижМесто финишаЛюксембургКоманды14Стартовало гонщиков98Финишировало гонщиков73Средняя скорос…

Đừng nhầm lẫn với Đĩa tiền hành tinh. Tinh vân Westbrook, một tinh vân tiền hành tinh. Tinh vân tiền hành tinh là một thiên thể tồn tại chóng vánh ở giai đoạn trước khi một ngôi sao tiến hóa từ pha cuối nhánh tiệm cận khổng lồ[1] thành tinh vân hành tinh. Một tinh vân tiền hành tinh phát ra bức xạ hồng ngoại mạnh và nó là một loại tinh vân phản xạ. Đây là giai đoạn tiến hóa có độ …

Neighborhoods of Queens in New York CityHollisNeighborhoods of QueensSt. Gabriel's Hollis Episcopal ChurchLocation within New York CityCoordinates: 40°43′N 73°46′W / 40.71°N 73.76°W / 40.71; -73.76Country United StatesState New YorkCityNew York CityCounty/BoroughQueensCommunity DistrictQueens 12[1]Population (2010)[2] • Total20,269Ethnicity • Black64.0% • Asian10.7% • White2.3% •…

O Universo Estendido DC (em inglês: DC Extended Universe; DCEU) é uma franquia cinematográfica estadunidense que abrange um universo fictício compartilhado entre diversos filmes de super-heróis. Os títulos deste universo cinematográfico abordam tramas centradas nos personagens das revistas em quadrinhos da DC Comics e são distribuídos pela Warner Bros., uma das principais companhias do cinema norte-americano. A linha narrativa do Universo Estendido DC teve início com o filme Man of Ste…

Paul Radomski in the uniform of an Untersturmführer of the Allgemeine SS Paul Otto von Radomski (21 September 1902 – 14 March 1945) was an SS functionary of Nazi Germany. During World War II, he commanded the Syrets concentration camp in the occupied Ukraine, and the Haidari concentration camp, near Athens, Greece. SS career Radomski was an Old Fighter of the Nazi Party, and one of the early companions of the eventual RSHA chief Reinhard Heydrich.[1] Radomski served six months in pris…

List of events ← 2010 2009 2008 2011 in Switzerland → 2012 2013 2014 Decades: 1990s 2000s 2010s 2020s See also: History of Switzerland Timeline of Swiss history List of years in Switzerland Events from 2011 in Switzerland. Events Public holidays in one or several cantons of Switzerland are marked (¹). January January 1: New Year's Day¹ January 2: Berchtoldstag¹ February March March 19:Saint Joseph's Day March 29:Good Friday March 31:Easter Day April April 1:Easter Monday May May…

Railway station in West Sussex, England WarnhamGeneral informationLocationWarnham, District of HorshamEnglandGrid referenceTQ170339Managed bySouthernPlatforms2Other informationStation codeWNHClassificationDfT category F2HistoryOpened1 May 1867Passengers2017/18 13,0442018/19 14,1502019/20 12,8762020/21 2,7122021/22 6,942 NotesPassenger statistics from the Office of Rail and Road Warnham railway station serves the village of Warnham in West Sussex, England. It is 33 miles 46 chains (54.0…

Jodi Shelton, 2013 Jodi Shelton[1] born April 3, 1965, is the co-founder and chief executive officer (CEO) of the Global Semiconductor Alliance (GSA). In 2018, with the inaugural Rising Women of Influence Award,[2] Shelton launched the GSA Women's Leadership Initiative (WLI). The vision of the Initiative is to significantly increase the number of women in leadership roles in the industry, the capital dedicated to women-led start-ups, and the number of STEM-focused female candidat…

Vehicle system with 8 powered wheels Chassis of an 8×8 vehicle Eight-wheel drive, often notated as 8WD or 8×8, is a drivetrain configuration that allows all eight wheels of an eight-wheeled vehicle to be drive wheels (that is, to receive power from the engine) simultaneously. Unlike four-wheel drive drivetrains, the configuration is largely confined to heavy-duty off-road and military vehicles, such as armored vehicles, tractor units or all-terrain vehicles such as the Argo Avenger. Operation …

二二八和平公園228 Peace Memorial Park基本資料類型公園位置 中華民國臺北市中正區凱達格蘭大道3號面積76,180 平方公尺建造1899 年持有人青年公園管理所狀態全年開放停車場景通停車場公共交通 淡水信義線台大醫院站地圖 二二八和平公園繁体字 二二八和平公園 简化字 二二八和平公园 标音官话(現代標準漢語)- 汉语拼音 Èrèrbā Hépíng Jìniàn Gōngyuán - 國語羅馬字…

Huss Park Attractions GmbHTypeGmbHIndustryManufacturingFounded2007 (2007)HeadquartersBremen, GermanyArea servedWorldwideKey peopleMirko J. Schulze (Managing Director)ProductsAmusement ridesWebsitewww.hussrides.com Enterprise at TusenFryd in 2005 (Norway). Opened: 1988 and closed: 2006 Sledgehammer, a HUSS Giant Frisbee at Bobbejaanland, Belgium HUSS Park Attractions (legal name: Huss Park Attractions GmbH[1]) is a company that specializes in developing and manufacturing amusement ri…

2008 single by Gorilla ZoeLostSingle by Gorilla Zoefrom the album Don't Feed da Animals ReleasedOctober 14, 2008Recorded2008GenreHip hopLength4:48LabelBad Boy SouthBlockAtlanticSongwriter(s)Christopher Gholson, Alonzo Mathis, Ebony LoveProducer(s)Drumma BoyGorilla Zoe singles chronology Juice Box (2007) Lost (2008) What It Is (2009) Lost is a song by American hip hop recording artist Gorilla Zoe, released October 14, 2008 as the lead single from his second studio album Don't Feed Da Animals …

Kembali kehalaman sebelumnya