Share to: share facebook share twitter share wa share telegram print page

Supercomputer operating system

A supercomputer operating system is an operating system intended for supercomputers. Since the end of the 20th century, supercomputer operating systems have undergone major transformations, as fundamental changes have occurred in supercomputer architecture.[1] While early operating systems were custom tailored to each supercomputer to gain speed, the trend has been moving away from in-house operating systems and toward some form of Linux,[2] with it running all the supercomputers on the TOP500 list in November 2017. In 2021, top 10 computers run for instance Red Hat Enterprise Linux (RHEL), or some variant of it or other Linux distribution e.g. Ubuntu.

Given that modern massively parallel supercomputers typically separate computations from other services by using multiple types of nodes, they usually run different operating systems on different nodes, e.g., using a small and efficient lightweight kernel such as Compute Node Kernel (CNK) or Compute Node Linux (CNL) on compute nodes, but a larger system such as a Linux-derivative on server and input/output (I/O) nodes.[3][4]

While in a traditional multi-user computer system job scheduling is in effect a tasking problem for processing and peripheral resources, in a massively parallel system, the job management system needs to manage the allocation of both computational and communication resources, as well as gracefully dealing with inevitable hardware failures when tens of thousands of processors are present.[5]

Although most modern supercomputers use the Linux operating system,[6] each manufacturer has made its own specific changes to the Linux-derivative they use, and no industry standard exists, partly because the differences in hardware architectures require changes to optimize the operating system to each hardware design.[1][7]

Operating systems used on top 500 supercomputers

Context and overview

In the early days of supercomputing, the basic architectural concepts were evolving rapidly, and system software had to follow hardware innovations that usually took rapid turns.[1] In the early systems, operating systems were custom tailored to each supercomputer to gain speed, yet in the rush to develop them, serious software quality challenges surfaced and in many cases the cost and complexity of system software development became as much an issue as that of hardware.[1]

The supercomputer center at NASA Ames

In the 1980s the cost for software development at Cray came to equal what they spent on hardware and that trend was partly responsible for a move away from the in-house operating systems to the adaptation of generic software.[2] The first wave in operating system changes came in the mid-1980s, as vendor specific operating systems were abandoned in favor of Unix. Despite early skepticism, this transition proved successful.[1][2]

By the early 1990s, major changes were occurring in supercomputing system software.[1] By this time, the growing use of Unix had begun to change the way system software was viewed. The use of a high level language (C) to implement the operating system, and the reliance on standardized interfaces was in contrast to the assembly language oriented approaches of the past.[1] As hardware vendors adapted Unix to their systems, new and useful features were added to Unix, e.g., fast file systems and tunable process schedulers.[1] However, all the companies that adapted Unix made unique changes to it, rather than collaborating on an industry standard to create "Unix for supercomputers". This was partly because differences in their architectures required these changes to optimize Unix to each architecture.[1]

As general purpose operating systems became stable, supercomputers began to borrow and adapt critical system code from them, and relied on the rich set of secondary functions that came with them.[1] However, at the same time the size of the code for general purpose operating systems was growing rapidly. By the time Unix-based code had reached 500,000 lines long, its maintenance and use was a challenge.[1] This resulted in the move to use microkernels which used a minimal set of the operating system functions. Systems such as Mach at Carnegie Mellon University and ChorusOS at INRIA were examples of early microkernels.[1]

The separation of the operating system into separate components became necessary as supercomputers developed different types of nodes, e.g., compute nodes versus I/O nodes. Thus modern supercomputers usually run different operating systems on different nodes, e.g., using a small and efficient lightweight kernel such as CNK or CNL on compute nodes, but a larger system such as a Linux-derivative on server and I/O nodes.[3][4]

Early systems

The first Cray-1 (sample shown with internals) was delivered to the customer with no operating system.[8]

The CDC 6600, generally considered the first supercomputer in the world, ran the Chippewa Operating System, which was then deployed on various other CDC 6000 series computers.[9] The Chippewa was a rather simple job control oriented system derived from the earlier CDC 3000, but it influenced the later KRONOS and SCOPE systems.[9][10]

The first Cray-1 was delivered to the Los Alamos Lab with no operating system, or any other software.[11] Los Alamos developed the application software for it, and the operating system.[11] The main timesharing system for the Cray 1, the Cray Time Sharing System (CTSS), was then developed at the Livermore Labs as a direct descendant of the Livermore Time Sharing System (LTSS) for the CDC 6600 operating system from twenty years earlier.[11]

In developing supercomputers, rising software costs soon became dominant, as evidenced by the 1980s cost for software development at Cray growing to equal their cost for hardware.[2] That trend was partly responsible for a move away from the in-house Cray Operating System to UNICOS system based on Unix.[2] In 1985, the Cray-2 was the first system to ship with the UNICOS operating system.[12]

Around the same time, the EOS operating system was developed by ETA Systems for use in their ETA10 supercomputers.[13] Written in Cybil, a Pascal-like language from Control Data Corporation, EOS highlighted the stability problems in developing stable operating systems for supercomputers and eventually a Unix-like system was offered on the same machine.[13][14] The lessons learned from developing ETA system software included the high level of risk associated with developing a new supercomputer operating system, and the advantages of using Unix with its large extant base of system software libraries.[13]

By the middle 1990s, despite the extant investment in older operating systems, the trend was toward the use of Unix-based systems, which also facilitated the use of interactive graphical user interfaces (GUIs) for scientific computing across multiple platforms.[15] The move toward a commodity OS had opponents, who cited the fast pace and focus of Linux development as a major obstacle against adoption.[16] As one author wrote "Linux will likely catch up, but we have large-scale systems now". Nevertheless, that trend continued to gain momentum and by 2005, virtually all supercomputers used some Unix-like OS.[17] These variants of Unix included IBM AIX, the open source Linux system, and other adaptations such as UNICOS from Cray.[17] By the end of the 20th century, Linux was estimated to command the highest share of the supercomputing pie.[1][18]

Modern approaches

The Blue Gene/P supercomputer at Argonne National Lab

The IBM Blue Gene supercomputer uses the CNK operating system on the compute nodes, but uses a modified Linux-based kernel called I/O Node Kernel (INK) on the I/O nodes.[3][19] CNK is a lightweight kernel that runs on each node and supports a single application running for a single user on that node. For the sake of efficient operation, the design of CNK was kept simple and minimal, with physical memory being statically mapped and the CNK neither needing nor providing scheduling or context switching.[3] CNK does not even implement file I/O on the compute node, but delegates that to dedicated I/O nodes.[19] However, given that on the Blue Gene multiple compute nodes share a single I/O node, the I/O node operating system does require multi-tasking, hence the selection of the Linux-based operating system.[3][19]

While in traditional multi-user computer systems and early supercomputers, job scheduling was in effect a task scheduling problem for processing and peripheral resources, in a massively parallel system, the job management system needs to manage the allocation of both computational and communication resources.[5] It is essential to tune task scheduling, and the operating system, in different configurations of a supercomputer. A typical parallel job scheduler has a master scheduler which instructs some number of slave schedulers to launch, monitor, and control parallel jobs, and periodically receives reports from them about the status of job progress.[5]

Some, but not all supercomputer schedulers attempt to maintain locality of job execution. The PBS Pro scheduler used on the Cray XT3 and Cray XT4 systems does not attempt to optimize locality on its three-dimensional torus interconnect, but simply uses the first available processor.[20] On the other hand, IBM's scheduler on the Blue Gene supercomputers aims to exploit locality and minimize network contention by assigning tasks from the same application to one or more midplanes of an 8x8x8 node group.[20] The Slurm Workload Manager scheduler uses a best fit algorithm, and performs Hilbert curve scheduling to optimize locality of task assignments.[20] Several modern supercomputers such as the Tianhe-2 use Slurm, which arbitrates contention for resources across the system. Slurm is open source, Linux-based, very scalable, and can manage thousands of nodes in a computer cluster with a sustained throughput of over 100,000 jobs per hour.[21][22]

See also

References

  1. ^ a b c d e f g h i j k l m Encyclopedia of Parallel Computing by David Padua 2011 ISBN 0-387-09765-1 pages 426–429.
  2. ^ a b c d e Knowing machines: essays on technical change by Donald MacKenzie 1998 ISBN 0-262-63188-1 page 149–151.
  3. ^ a b c d e Euro-Par 2004 Parallel Processing: 10th International Euro-Par Conference 2004, by Marco Danelutto, Marco Vanneschi and Domenico Laforenza ISBN 3-540-22924-8 page 835.
  4. ^ a b An Evaluation of the Oak Ridge National Laboratory Cray XT3 by Sadaf R. Alam, et al., International Journal of High Performance Computing Applications, February 2008 vol. 22 no. 1 52–80.
  5. ^ a b c Open Job Management Architecture for the Blue Gene/L Supercomputer by Yariv Aridor et al in Job scheduling strategies for parallel processing by Dror G. Feitelson 2005 ISBN 978-3-540-31024-2 pages 95–101.
  6. ^ Vaughn-Nichols, Steven J. (June 18, 2013). "Linux continues to rule supercomputers". ZDNet. Retrieved June 20, 2013.
  7. ^ "Top500 OS chart". Top500.org. Archived from the original on 2012-03-05. Retrieved 2010-10-31.
  8. ^ Targeting the computer: government support and international competition by Kenneth Flamm 1987 ISBN 0-8157-2851-4 page 82 [1]
  9. ^ a b The computer revolution in Canada by John N. Vardalas 2001 ISBN 0-262-22064-4 page 258.
  10. ^ Design of a computer: the Control Data 6600 by James E. Thornton, Scott, Foresman Press 1970 page 163.
  11. ^ a b c Targeting the computer: government support and international competition by Kenneth Flamm 1987 ISBN 0-8157-2851-4 pages 81–83.
  12. ^ Lester T. Davis, The balance of power, a brief history of Cray Research hardware architectures in "High performance computing: technology, methods, and applications" by J. J. Dongarra 1995 ISBN 0-444-82163-5 page 126 [2].
  13. ^ a b c Lloyd M. Thorndyke, The Demise of the ETA Systems in "Frontiers of Supercomputing II by Karyn R. Ames, Alan Brenner 1994 ISBN 0-520-08401-2 pages 489–497.
  14. ^ Past, present, parallel: a survey of available parallel computer systems by Arthur Trew 1991 ISBN 3-540-19664-1 page 326.
  15. ^ Frontiers of Supercomputing II by Karyn R. Ames, Alan Brenner 1994 ISBN 0-520-08401-2 page 356.
  16. ^ Brightwell, Ron Riesen, Rolf Maccabe, Arthur. "On the Appropriateness of Commodity Operating Systems for Large-Scale, Balanced Computing Systems" (PDF). Retrieved January 29, 2013.{{cite web}}: CS1 maint: multiple names: authors list (link)
  17. ^ a b Getting up to speed: the future of supercomputing by Susan L. Graham, Marc Snir, Cynthia A. Patterson, National Research Council 2005 ISBN 0-309-09502-6 page 136.
  18. ^ Forbes magazine, 03.15.05: Linux Rules Supercomputers
  19. ^ a b c Euro-Par 2006 Parallel Processing: 12th International Euro-Par Conference, 2006, by Wolfgang E. Nagel, Wolfgang V. Walter and Wolfgang Lehner ISBN 3-540-37783-2.
  20. ^ a b c Job Scheduling Strategies for Parallel Processing: by Eitan Frachtenberg and Uwe Schwiegelshohn 2010 ISBN 3-642-04632-0 pages 138–144.
  21. ^ SLURM at SchedMD
  22. ^ Jette, M. and M. Grondona, SLURM: Simple Linux Utility for Resource Management in the Proceedings of ClusterWorld Conference, San Jose, California, June 2003 [3]

Baca informasi lainnya yang berhubungan dengan : article

Article 19 Article 20

Read other articles:

بورتوبالو دي كابو باسيرو     الإحداثيات 36°41′10″N 15°08′10″E / 36.686111111111°N 15.136111111111°E / 36.686111111111; 15.136111111111  [1] تقسيم إداري  البلد إيطاليا[2]  التقسيم الأعلى سرقوسة  [لغات أخرى]‏مقاطعة سرقوسة  خصائص جغرافية  المساحة 15.09 كيلومتر مربع (9 أكتو

سفارة كوريا الجنوبية في السويد كوريا الجنوبية السويد الإحداثيات 59°19′56″N 18°06′12″E / 59.33222222°N 18.10333333°E / 59.33222222; 18.10333333 البلد السويد  المكان ستوكهولم الموقع الالكتروني الموقع الرسمي تعديل مصدري - تعديل   سفارة كوريا الجنوبية في السويد هي أرفع تمثيل دبلوماسي[1…

Park in Lincoln County, Kentucky Isaac Shelby Cemetery State Historic SiteLocation in KentuckyShow map of KentuckyIsaac Shelby Cemetery State Historic Site (the United States)Show map of the United StatesLocationLincoln, Kentucky, United StatesCoordinates37°34′12″N 84°46′45″W / 37.57000°N 84.77917°W / 37.57000; -84.77917[1]Area.5 acres (0.20 ha)[2]Elevation1,001 ft (305 m)[1]Established1951[3]Governing bodyKe…

Casa de Heidi en Heididorf Heidiland es un destino turístico de Suiza donde se desarrolla la acción de la novela Heidi de la escritora Johanna Spyri. La región se sitúa en los Alpes, cerca de la frontera entre Liechtenstein y Austria e incluye principalmente a la comuna de Maienfeld en el Cantón de los Grisones dentro del distrito de Landquart a orillas del río Rin.[1]​[2]​[3]​ El circuito turístico comienza en la Estación de Maienfeld atravesando la localidad hacia Hei…

NSB Class 92 dengan desain corak lama Mittnabotåget NSB Class 92 adalah seri kereta rel diesel Norwegia. Mulai dibuat pada tahun 1984 oleh perusahaan kereta api Jerman, DÜWAG. Seri kereta api ini dibuat sebanyak 15 unit dan dioperasikan oleh Perusahaan kereta api nasional Norwegia (NSB). Kereta api ini melayani transportasi pada jalur Trøndelag, Mittnabotåget, dan Røros. NSB Class 92 mulai dioperasikan pada tahun 1984 dan 1985 untuk menggantikan pelayanan kereta kelas 86 dan kelas 91 yang r…

GS Yuasa Corporation 株式会社ジーエス・ユアサ コーポレーションJenisPublik (K.K)Kode emitenTYO: 6674Komponen Nikkei 225IndustriPeralatan listrikPendahuluJapan Storage Battery Co., Ltd.Yuasa CorporationDidirikan1917; 105 tahun lalu (1917) (Japan Storage Battery; kemudian GS) 1918; 104 tahun lalu (1918) (Yuasa Storage Battery) 1 April 2004; 19 tahun lalu (2004-04-01) (melalui penggabungan)PendiriGenzou ShimadzuShichizaemon YuasaKantorpusatInobanba-cho, Nishinosho,…

3rd episode of the 18th season of The Simpsons Please Homer, Don't Hammer 'EmThe Simpsons episodeEpisode no.Season 18Episode 3Directed byMike B. Anderson Ralph SosaWritten byMatt WarburtonProduction codeHABF20Original air dateSeptember 24, 2006 (2006-09-24)Episode featuresChalkboard gagA baby beat me up (written by Principal Skinner during the episode)Couch gagThe couch is replaced by a vending machine filled with various characters; Ralph Wiggum selects a Homer figurine and …

Species of oak tree Quercus rugosa Conservation status Least Concern (IUCN 3.1)[1] Scientific classification Kingdom: Plantae Clade: Tracheophytes Clade: Angiosperms Clade: Eudicots Clade: Rosids Order: Fagales Family: Fagaceae Genus: Quercus Subgenus: Quercus subg. Quercus Section: Quercus sect. Quercus Species: Q. rugosa Binomial name Quercus rugosaNée Natural range of Quercus rugosa Synonyms[2] List Quercus conglomerata Trel. Quercus decipiens M.Martens & Galeot…

Microcystis Microcystis aeruginosa Klasifikasi ilmiah Kerajaan: Bacteria Upakerajaan: Eubacteria Filum: Cyanobacteria Kelas: Cyanophyceae Ordo: Chroococcales Famili: Microcystaceae Genus: MicrocystisKützing, 1833 spesies Banyak (lihat bawah) Microcystis adalah genus cyanobacteria air tawar.[1] Etimologi Nama Microcystis berasal dari kata dalam bahasa Yunani[2] mikros (kecil) + kystis Karakteristik Ciri-ciri Microcystis adalah terdiri dari sel-sel kecil (diameternya hanya beberap…

Sebuah atlas grafik yang terdiferensiasi untuk globe. Hasil kalkulus mungkin tidak kompatibel antara grafik jika atlas tidak terdiferensiasi. Di tengah dan kanan grafik, Tropic of Cancer digambarkan sebagai kurva yang halus, sedangkan pada bagan kiri memiliki sudut yang tajam. Gagasan tentang lipatan terdiferensialkan memurnikan lipatan dengan mewajibkan fungsi yang mengubah antar grafik menjadi terdiferensiasi. Dalam matematika, lipatan terdiferensialkan adalah sebuah jenis lipatan yang secara …

This article is an orphan, as no other articles link to it. Please introduce links to this page from related articles; try the Find link tool for suggestions. (November 2015) Hong Kong is an important port in the Far East and has relied on entrepôt trade to survive its economy for more than a century. This article is part of a series on theHistory of Hong Kong Timeline Prehistoric Imperial  (221 BC – 1800s) Bao'an County and Xin'an County British Hong Kong (1841–1941, 1945–1997)…

Franco Battiato Información personalNombre de nacimiento Francesco BattiatoNacimiento 23 de marzo de 1945Ionia, ItaliaFallecimiento 18 de mayo de 2021 (76 años)Milo (Italia) Residencia Milo, ItaliaNacionalidad ItalianaLengua materna ItalianoInformación profesionalOcupación Cantautor, compositor, director de cine, escritor, cantante, guionista, pintor, realizador, músico y guionista de cine Área Pop, composición, canto, dirección cinematográfica, guión cinematográfico, pintura y compos…

Local municipality in Free State, South AfricaTokologoLocal municipality SealLocation in the Free StateCoordinates: 28°32′31″S 25°14′13″E / 28.54194°S 25.23694°E / -28.54194; 25.23694CountrySouth AfricaProvinceFree StateDistrictLejweleputswaSeatBoshofWards4Government[1] • TypeMunicipal council • MayorBoikie David ObotsengArea • Total9,326 km2 (3,601 sq mi)Population (2011)[2] • …

В Википедии есть статьи о других людях с такой фамилией, см. Федин. Константин Федин Имя при рождении Константин Александрович Федин Дата рождения 12 (24) февраля 1892(1892-02-24) Место рождения Саратов, Российская Империя Дата смерти 15 июля 1977(1977-07-15) (85 лет) Место смерти Москва, СССР&#…

American astronaut (1962–2003) Kalpana ChawlaBorn(1962-03-17)March 17, 1962[2]Karnal, East Punjab, India (present-day Haryana state)Died1 February 2003(2003-02-01) (aged 40)Aboard Space Shuttle Columbia over Texas, U.S. in the Space Shuttle Columbia disasterCitizenshipIndia (1962–1991)United States (1991–2003)Alma materTagore Baal Niketan Senior Secondary School, Karnal Punjab Engineering College (BE)University of Texas at Arlington (MS)University of Colorado at Boulder (…

2012 single by the Rolling Stones Doom and GloomSingle by the Rolling Stonesfrom the album GRRR! Released11 October 2012Recorded22–23 August 2012StudioGuillaume Tell Studios (Paris, France)GenreHard rock, blues rockLength3:59LabelUniversal MusicSongwriter(s)Jagger–RichardsProducer(s)Don Was, The Glimmer Twins, Jeff Bhasker, Emile Haynie (co.)The Rolling Stones singles chronology No Spare Parts (2011) Doom and Gloom (2012) One More Shot (2013) Doom and Gloom is the lead single taken from GRRR…

Settling of the blood in the lower, or dependent, portion of the body postmortem Livor mortis in a dead body Timeline of postmortem changes, including livor mortis. Stages of death Pallor mortis Livor mortis Algor mortis Rigor mortis Putrefaction Decomposition Skeletonization Fossilization vte Depiction of a body after suicide hanging. Livor mortis is fixed in the legs and distal upper extremities because these were the dependent parts. Livor mortis (Latin: līvor – bluish color, bruise, morti…

British pharmacist of Pakistani origin Nadia BukhariBornLondon, United KingdomNationalityBritish PakistaniEducationPharmacistAlma materUCL School of PharmacyOccupation(s)Pharmacist, Associate Professor, Activist for Gender Equity in education and healthcare industry.Known forFirst Muslim female and British Pakistani board member of the Fellow of the Royal Pharmaceutical Society (RPS) for England, UKHonoursFellow of the Royal Pharmaceutical Society (RPS)WebsiteNadia Bukhari at UCL Nadia…

Municipal-level Turkish law enforcement body Municipal policeBelediye zabıtasıAgency overviewFormed4 September 1826Parent agencyMunicipalities Municipal police (Turkish: Belediye zabıtası) is a law enforcement body responsible for ensuring the implementation and control of public services in different municipalities in Turkey. The police officers also use their powers as traffic police in metropolitan municipalities. History The origin of the law enforcement organization in Turkey is based o…

This article is about the paradigm in the academic study of religion. For other uses, see World religions (disambiguation). Category in the study of religion Symbols commonly associated with six of the religions labelled world religions: clockwise from the top, these represent Judaism, Islam, Buddhism, Hinduism, Taoism, and Christianity. World religions is a category used in the study of religion to demarcate at least five—and in some cases more—religions that are deemed to have been especia…

Kembali kehalaman sebelumnya