Superimposed code

Edge-notched card with data for a bibliographic item. Edges have not yet been notched.

A superimposed code such as Zatocoding is a kind of hash code that was popular in marginal punched-card systems.

Marginal punched-card systems

Many names, some of them trademarked, have been used for marginal punched-card systems: edge-notched cards, slotted cards, E-Z Sort, Zatocards, McBee, McBee Keysort, Flexisort, Velom, Rocket, etc. The center of each card held the relevant information—typically the name and author of a book, research paper, or journal article on a nearby shelf; and a list of subjects and keywords. Some sets of cards contained all the information required by the user on the card itself, handwritten, typewritten, or on microfilm (aperture card). Every card in a stack had the same set of pre-punched holes. The user would find the particular cards relevant to a search by aligning the holes in the set of cards (using a card holder or card tray), inserting one or more knitting-needle-like rods all the way through the stack, so the desired cards (which had been notched or cut open) fell out from the irrelevant cards in the collection (left un-notched), which remain on the needle(s). A user could repeat this selection many times to form a complex Boolean searching query. A card that was relevant to 2 or more subjects would have the slot(s) for each of those subjects cut out, so that card would drop out when either one or the other or both subjects was selected . The "superimposed code" coding systems, such as Zatocoding, saved space by entering several or all subjects in the same field; such a "superimposed code" stores much more information in less space, but at the cost of occasional "false" selections.[1]

Once you have a collection of index cards, one per book, research paper, or journal article in a library, with a list of keywords (subjects) discussed in a particular book written on that book's card, the "obvious way" to code those subjects is to count up the total number of subjects used in the entire collection R, make a row of R holes near the top of every card, and for each subject actually discussed in a particular book, cut a slot from the hole corresponding to that subject in the card corresponding to that book. [2] Naturally, this also requires a separate list of every subject used in the collection that indicates which hole is punched for each subject. Unfortunately, there may be thousands of distinct subjects in the collection, and it is impractical to punch thousands of holes in every card. While it may not seem possible to use less than 1 hole per subject, superimposed code systems can solve this problem.

Superimposed codes

The Zatocoding system of information retrieval was developed by Calvin Mooers in 1947.[3]

Calvin Mooers invented Zatocoding at M.I.T., a mechanical information retrieval system based on superimposed codes, and formed the Zator Company in 1947 to commercialize its applications.[4] The particular superimposed code used in that system is called Zatocoding, while the marginal-punched card information retrieval system as a whole is called "Zator".[5]

Setting up a superimposed code for a particular library goes something like this:

  • Going through every card in the index, a list of all R subjects used in this particular library is created, and the maximum number of subjects r actually written on a single card is noted. (For example, say we have 8000 subjects, and the librarian decides to index only the top r=4 subjects per book).
  • The librarian looks at the physical edge-notched card, and notes the number of holes N in each card. (If N >= R, then we could use the "obvious way" mentioned above—the whole point of Zatocoding is that it works even when N is much less than R).
  • The librarian chooses some number n of slots per subject—typically [2]
  • On the list of all R subjects, for each subject write down which holes will be slotted for that subject. Rather than slotting one hole per subject in "the obvious way", a superimposed code will slot n holes per subject. (There are several ways to pick these patterns—those distinguish between the various superimposed codes; we discuss them below).
  • When a new book comes in, make a new card for it:
    • Get a blank card with the standard N holes in it and write down the name of the book, etc. in the middle.
    • Write down the subjects covered by the book on the card.
    • For each of the top r subjects, look up that subject in the big list, and see which n slots to cut for that subject, and cut them.
    • When the card is finished, it may have up to r*n slots cut into it—but more likely at least some of the subject slot patterns overlapped, resulting in only v < r*n slots.

Later, when we need to find books on some particular subject, we look up that subject in our list of all R subjects, find the corresponding slot pattern of n slots, and put n needles are through the whole stack in that pattern. All of the cards that have been cut with that pattern will fall out. It is possible that a few other, undesired cards may also fall out—cards who have several subjects whose hole patterns overlap in such a way as to mimic the desired pattern. The probability F of some undesired card with v slots cut in it falling through when we select some pattern of n needles is approximately . Most systems have a N large enough and r small enough such that, v < N/2 (i.e., the card is less than half-punched), so that probability of an undesired card falling through is less than .[2]

There are several different ways to choose which holes will be slotted for each subject.

(Several variations of Zatocoding were developed. Bourne describes a variant "for newer retrieval systems that require high performance of the superimposed coding system",[6] using an approach Mooers published in 1959.[7])

Zatocoding

Setting up a Zatocode for a particular list of R subjects goes something like this:[2]

  • For the first subject, pick n of the N slots randomly.
  • For the second subject, pick n of the N slots randomly—but make sure this pattern is not identical to the first subject.
  • ...
  • For the R'th subject, pick n of the N slots randomly—but make sure it's not identical to any previous subject.

Other superimposed codes

A Zatocode requires a code book that lists every subject and a randomly generated notch code associated with each one. Other "direct" superimposed codes have a fixed hash function for transforming the letters in (one spelling of) a subject into a notch code. Such codes require a much shorter code book that describes the translation of letters in a word to the corresponding notch code, and can in principle easily add new subjects without changing the code book.[5]

A Bloom filter can be considered a kind of superimposed code.[8]

References

  1. ^ Robert V. Williams. "Punched Cards: A Brief Tutorial". computing now 2002.
  2. ^ a b c d W. Ross Ashby. W. Ross Ashby's Journal: Zato-coding 1960 Sep. 22. p. 6208-6222
  3. ^ "About the Cover". College and Research Libraries News, April 2008. [1][2]
  4. ^ Eugene Garfield. "Continuing relevance of superimposed coding. Journal of Information Science 8 (1984) 181.
  5. ^ a b Herbert Marvin Ohlman. "Subject-Word Letter Frequencies with Applications to Superimposed Coding". Proceedings of the International Conference on Scientific Information (1959).
  6. ^ Bourne, Charles P. (1963). Methods of Information Handling. John Wiley & Sons, Inc. p. 67.
  7. ^ Mooers, Calvin N. (April 1959). The Application of Simple Pattern Inclusion Selection to Large-Scale Information Retrieval Systems. Zator Company.
  8. ^ James Blustein; and Amal El-Maazawi. "Bloom Filters - A Tutorial, Analysis, and Survey". p. 11.

Read other articles:

この記事は最新の出来事(2022年ロシアのウクライナ侵攻)に影響を受ける可能性があります。情報は出来事の進行によって急速に変更される可能性があります。(2022年2月) この記事の項目名には以下のような表記揺れがあります。 ヴォロディミル・ゼレンスキー[1] ウラジーミル・ゼレンスキー ウォロディミル・ゼレンスキーВолодимир Зеленський 2022年

  关于与「生存遊戲 (電子遊戲)」標題相近或相同的条目,請見「生存游戏」。   提示:此条目的主题不是大逃殺遊戲或射擊類生存遊戲。 以Minecraft的生存模式为例子 生存遊戲是動作遊戲的一個子類,多設置在隨機或以腳本生成的開放、持久世界(英语:Persistent world)中。玩家在遊戲開始時通常只有極少的裝備,並需要收集資源、工具、武器和建造住所,以儘

Dalam artikel ini, nama keluarganya adalah Park. Park Hee-jinLahir4 Juni 1973 (umur 50)Yangpyeong-gun, Gyeonggi-do, Korea SelatanNama lainBak Hui-jinPendidikanSeoul Institute of the ArtsTahun aktif1998–sekarangAgenSY EntertainmentDikenal atasPartners for Justice Sweet Revenge 2Melting Me Softly Park Hee-jin (lahir 4 Juni 1973) adalah pemeran, model, pelawak, pembawa acara televisi, dan penyanyi Korea Selatan.[1] Ia paling dikenal untuk perannya dalam seri Partners for...

Miracle of AleaGenre Drama Fantasi SkenarioHenovia RosalindaCeritaDwi IlalangSutradara Rachmad Rangkuti[a] Tri Willy Ramadhan Wahid Setyanto Pemeran Makayla Rose Alwi Djamalulail Najla Putri Yeyen Lidya Chand Kelvin Metta Permadi Jerico Gowtara Farel Faradina Tika Guntur Nugraha Kevin Andrean Tamee Irely Angelia Livie Washifa Assegaf Ucup Pitung Fatu Milda Penggubah lagu temaEross CandraLagu pembukaSahabat Sejati - Sheila on 7Lagu penutupSahabat Sejati - Sheila on 7Penata musikA...

This article needs additional citations for verification. Please help improve this article by adding citations to reliable sources. Unsourced material may be challenged and removed.Find sources: No Exit song – news · newspapers · books · scholar · JSTOR (July 2013) (Learn how and when to remove this template message) 1999 single by BlondieNo ExitSingle by Blondiefrom the album No Exit ReleasedOctober 22, 1999Recorded1998GenrePop rock, rap rockLeng...

Russian artist Vladimir Tatlin as sailor, 1914-15 Vladimir Yevgrafovich Tatlin (28 December [O.S. 16 December] 1885 – 31 May 1953)[1] was a Russian,[2][3][4][5][6] Ukrainian,[7][8][9][10] and Soviet painter, architect and stage-designer. Tatlin achieved fame as the architect who designed The Monument to the Third International, more commonly known as Tatlin's Tower, which he began in 1919.[...

Major Methodist movement in Great Britain following its split from the Church of England Wesleyan Methodist ChurchClassificationProtestantTheologyWesleyanGovernanceConnexionalismRegionGreat BritainFounderJohn WesleyOrigin1790sBranched fromChurch of EnglandMerged intoMethodist Church of Great BritainDefunct1932 (Methodist Union)Other name(s)Wesleyan Methodist Connexion The Wesleyan Methodist Church (also named the Wesleyan Methodist Connexion) was the majority Methodist movement in England fol...

Not to be confused with 2022 United States Senate election in Oregon. 2022 Oregon State Senate election ← 2020 November 8, 2022 2024 → 16 of 30 seats in the Oregon State Senate16 seats needed for a majority   Majority party Minority party Third party   Leader Rob Wagner Tim Knopp Brian Boquist Party Democratic Republican Independent Leader since May 22, 2020 October 22, 2021 January 15, 2021 Leader's seat 19th district 27th district 12th district Last...

  关于与「腓特烈一世 (符腾堡国王)」標題相近或相同的条目,請見「腓特烈一世」。 此條目没有列出任何参考或来源。 (2021年2月22日)維基百科所有的內容都應該可供查證。请协助補充可靠来源以改善这篇条目。无法查证的內容可能會因為異議提出而被移除。 腓特烈一世符腾堡统治者符騰堡公爵1797年12月22日 — 1803年2月25日前任腓特烈二世·歐根符腾堡选侯1803年2月25...

У этого термина существуют и другие значения, см. Джокер (значения). Джокерангл. Joker Иллюстрация к серии комиксов «Бэтмен: Три Джокера» (2020), изображающая основные воплощения Джокера от Золотого века (снизу) до Серебряного века (слева) и современной эпохи (справа). Художни...

Indian actress (1945–2021) JayanthiBornKamala Kumari(1945-01-06)6 January 1945Bellary, Madras Presidency, British Raj (now in Karnataka, India)Died26 July 2021(2021-07-26) (aged 76)Banashankari, BangaloreOccupation(s)Actress, producerYears active1960–2021SpousePeketi SivaramChildren1 Kamala Kumari (6 January 1945 – 26 July 2021), known by her stage name Jayanthi, was an Indian actress known for her work in Kannada cinema and Tamil cinema.[1][2] She was noted fo...

Norrland Engineer BattalionNorrlands ingenjörbataljonActive1905–2004Country SwedenAllegianceSwedish Armed ForcesBranchSwedish ArmyTypeEngineerSizeBattalionPart ofVI Army Divisoon (1902–1927)Upper Norrland's Troops (1928–1942)VI Military District (1942–1966)Upper Norrland Military District (1966–1993)Northern Military District (1993–2000)Norrbotten Regiment (2000–2004)Garrison/HQBodenMotto(s)Nos nihil efficere non possumus (Here are no impossibilities)ColorsBlackMarchB...

1983 live album by Harlan EllisonOn the Road with Ellison Volume 1Live album by Harlan EllisonReleased1983, CD reissue 2001Recorded1981-1983GenreSpoken Word, ComedyLength41:49LabelDeep Shag RecordsProducerShelley LevinsonHarlan Ellison chronology On the Road with Ellison Volume 1(1983) On the Road with Ellison Volume 2(2004) First released in 1983 as an extremely limited edition vinyl album, On the Road with Ellison Volume 1 was reissued on CD in 2001 by Deep Shag Records. The CD feat...

Iranian cyclist Behnam MalekiPersonal informationBorn (1992-12-02) 2 December 1992 (age 31)IranTeam informationCurrent teamRetiredDisciplineRoadRoleRiderProfessional teams2011Azad University Iran2013Ayandeh Continental Team2014–2015Tabriz Petrochemical Team2016Tabriz Shahrdari Team2017Pishgaman Cycling Team2018Tabriz Shahrdary Team Behnam Maleki (born 2 December 1992) is an Iranian former professional cyclist.[1] Major results 2010 1st Road race, National Junior Road Champ...

American comic strip character For the con man Joseph Yellow Kid Weil, see Joseph Weil. Comics character The Yellow KidThe Yellow KidPublication informationPublisherNew York WorldFirst appearance17 February 1895Created byRichard F. OutcaultIn-story informationFull nameMickey DuganTeam affiliationsHogan's Alley Handwritten claim for copyright on The Yellow Dugan Kid to the Librarian of Congress on September 7, 1896 The Yellow Kid (Mickey Dugan[1]) is an American comic strip character t...

American entertainment news program This article needs additional citations for verification. Please help improve this article by adding citations to reliable sources. Unsourced material may be challenged and removed.Find sources: Showbiz Tonight – news · newspapers · books · scholar · JSTOR (April 2020) (Learn how and when to remove this template message) Showbiz TonightGenreEntertainment news programPresented byA. J. Hammer (2005–2014)Country of or...

2017 European Rowing ChampionshipsLocationRačice, Czech RepublicDates24–28 May 2017← 2016 Beetzsee2018 Glasgow → The 2017 European Rowing Championships were held in Račice, Czech Republic, between 24 and 28 May 2017.[1] Medal summary Men Event Gold Time Silver Time Bronze Time M1x[2]  Ondřej Synek (CZE) 6:48.13  Damir Martin (CRO) 6:50.02  Stanislau Shcharbachenia (BLR) 6:52.99 M2-[3]  Italy (ITA)Matteo...

Novel by Julian Barnes This article contains unreferenced categories (Category:Hardboiled crime novels). Please help improve this article by adding citations to reliable sources. Unsourced categories may be challenged and removed. (April 2022) (Learn how and when to remove this template message) Duffy First editionAuthorDan Kavanagh(pseudonym of Julian Barnes)CountryUnited KingdomLanguageEnglishPublisherJonathan Cape[1]Publication date3 Jul 1980Media typePrintPages181[1&...

Terremoto de Pakistán de 2013 7.7 en potencia de Magnitud de Momento (MW) 7.7 en escala de Richter (ML) ParámetrosFecha y hora 24 de septiembre de 2013Profundidad 20 kmCoordenadas del epicentro 27°00′00″N 65°30′50″E / 27, 65.514ConsecuenciasZonas afectadas  Pakistán India OmánMercalli IX (Violento)Víctimas 825 muertos[1]​700 heridos[2]​[3]​[editar datos en Wikidata] El terremoto de Pakistán de 2013 fue un terremoto de ...

National Soccer League 1982Negara AustraliaJuara bertahan Sydney CityJuara Sydney City(gelar ke-4)Tempat kedua St. George-BudapestPencetak gol terbanyak John Kosmina (Sydney City)(23 gol)← 1981 1983 → National Soccer League 1982 adalah edisi ke-6 dari penyelenggaraan National Soccer League, turnamen liga sepak bola tertinggi di Australia. Sydney City merupakan juara bertahan setelah menjadi juara pada edisi sebelumnya. Musim ini dimenangkan oleh Sydney City berdasarkan peringkat akhi...