~8.5 million Microsoft Windows operating systems crash worldwide, causing global disruption of critical services
On 19 July 2024, American cybersecurity company CrowdStrike distributed a faulty update to its Falcon Sensor security software that caused widespread problems with Microsoft Windows computers running the software. As a result, roughly 8.5 million systems crashed and were unable to properly restart[1] in what has been called the largest outage in the history of information technology[2] and "historic in scale".[3]
The outage disrupted daily life, businesses, and governments around the world. Many industries were affected—airlines, airports, banks, hotels, hospitals, manufacturing, stock markets, broadcasting, gas stations, retail stores, and more—as were governmental services, such as emergency services and websites.[4][5] The worldwide financial damage has been estimated to be at least US$10 billion.[6]
Within hours, the error was discovered and a fix was released,[7] but because many affected computers had to be fixed manually,[8] outages continued to linger on many services.[9][10]
Background
CrowdStrike produces a suite of security software products for businesses, designed to protect computers from cyberattacks. Falcon, CrowdStrike's endpoint detection and response agent, works at the operating system kernel level on individual computers to detect and prevent threats.[11]Patches are routinely distributed by CrowdStrike to its clients to enable their computers to address new threats.[12]
CrowdStrike's own post-incident investigation identified several errors that led to the release of a fault update to the "Crowdstrike Sensor Detection Engine":[13][non-primary source needed]
In the programming language C, the length of arrays must be treated and checked separately. However, the length was not checked before access. An array with 21 fields was expected, but the channel file was in an older data format with only 20 fields.
The channel files did not contain a version number field that was checked.
There were no staggered rollouts, but the update was distributed to all customers simultaneously. Even critical infrastructure was not specially treated.
On 19 July at 04:09 UTC, CrowdStrike distributed a faulty configuration update for its Falcon sensor software running on Windows PCs and servers. A modification to a configuration file which was responsible for screening named pipes, Channel File 291, caused an out-of-bounds memory read[14] in the Windows sensor client that resulted in an invalid page fault. The update caused machines to either enter into a bootloop or boot into recovery mode.[15][16]
Almost immediately, Windows virtual machines on the Microsoft Azure cloud platform began rebooting and crashing,[17] and at 06:48 UTC, Google Compute Engine also reported the problem. The problem affected systems running Windows 10 and Windows 11 running the CrowdStrike Falcon software.[18][15] Most personal Windows PCs were unaffected, as CrowdStrike's software is primarily used by organizations.[18] The CrowdStrike software did not provide a way for subscribers to delay the installation of its content files.[19] Computers running macOS and Linux were unaffected, as the problematic content file was only for Windows,[20] but similar problems had affected Linux distributions of CrowdStrike software in April 2024.[21][22]
CrowdStrike reverted the content update at 05:27 UTC,[23] and devices that booted after the revert were not affected.[24]
At 07:15 UTC, Google said that the CrowdStrike update was at fault.[25] Within hours, CrowdStrike CEOGeorge Kurtz confirmed that CrowdStrike's faulty kernel configuration file update had caused the problem.[8][7] At 09:45 UTC, Kurtz confirmed that the fix was deployed[26][27] and that the problem was not the result of a cyberattack.[8][28]
The impact to companies in the Central United States was exacerbated by an unrelated outage with Microsoft Azure the previous day.[29] On 18 July, the Azure platform had an outage that blocked some companies' access to their storage and to Microsoft 365 applications in Azure's Central United States region.[30]
Remedy
Affected machines could be restored by rebooting while connected to the network; ideally while connected to Ethernet, thus providing the opportunity to download the reverted channel file, with multiple reboots reportedly required.[31][23]
If crashes persisted, remediation required booting into safe mode or the Windows Recovery Environment and deleting any .sys file beginning with C-00000291- and with timestamp 04:09 UTC in the %windir%\System32\drivers\CrowdStrike\ directory.[28] As this process needed to be done locally on each individual machine, it was "expected to take days" for affected businesses to restore all systems.[32] Technical staff needed to reboot the affected computers individually with manual intervention on each system.[33][34][35]
On devices with Windows' BitLocker disk encryption enabled, which corporations often use to increase security, fixing the problem was exacerbated because the 48-digit numeric Bitlocker recovery keys (unique to each system) required manual input, with additional challenges supplying the recovery keys to end users working remotely. Additionally, several organisations utilising local servers for Bitlocker recovery key storage could not access keys that were stored on servers that themselves had crashed.[36][37]
Microsoft has also recommended restoring a backup from before 18 July to fix the issue.[38]
Impact
Outages were experienced worldwide,[2][39][40] reflecting the wide use of Microsoft Windows and CrowdStrike software by global corporations in numerous business sectors.[41] At the time of the incident, CrowdStrike said it had more than 24,000 customers,[42] including nearly 60% of Fortune 500 companies and more than half of the Fortune 1000.[43][44] On 20 July, Microsoft estimated that 8.5 million devices were affected by the update, which it said was less than one percent of all Windows devices.[1]
Widespread outages were immediately reported across multiple countries, with major global disturbances experienced by the general public sweeping from east to west from time zone to time zone.[citation needed][clarification needed] At 04:09 UTC on 19 July, the time when the faulty update was issued, it was the middle of the business day of Oceania and Asia, the early morning hours in Europe, and midnight in much of the Americas.
Some countries were less affected. China, which has striven toward self-sufficiency in IT, saw little impact to key services such as airlines and banks, although foreign businesses and luxury hotels in the country were affected.[45] Russia and Iran—both restricted by international sanctions from using the services of American high-tech companies—reported no disruptions.[46][47]
Cyber risk quantification company, Kovrr, calculated that the total cost to the UK economy will likely fall between £1.7 and £2.3 billion ($2.18 and $2.96 billion).[48]
A specialist cloud outage insurance business estimated that the top 500 US companies by revenue, excluding Microsoft, had faced near $5.4bn (£4.1bn) in financial losses because of the outage, but only between $540m (£418m) to $1.08bn (£840m) of those losses would be insured.[49]
CrowdStrike liability
CrowdStrike's own terms and conditions for their Falcon software limit liability to "fees paid", effectively a refund. Larger customers may have negotiated different terms.[50]
In the EU, it is possible that CrowdStrike will be held liable under a GDPR regulation related to the impact of security incidents on user data. The regulation is best known in relation to data leaks but also applies to data destruction. It is unclear whether temporary loss of access to data is enough to trigger liability, or whether GDPR applies to all incidents related to security or only unauthorised access.[51]
Further, the incident could be classed as a "personal data breach" which would be a data breach of the GDPR under Article 4 named "Definitions", paragraph 12.[51][52] On 19 July 2024, a data-protection expert reported a breach of Article 32 named "Security of processing".[51][53]
Air transport
Globally, 5,078 air flights, 4.6% of those scheduled that day, were cancelled.[54][27] An unrelated Microsoft Azure outage, affecting services such as Microsoft 365, compounded airlines' problems.[55]
In India, the outage affected Indigo Airlines, Akasa Air, SpiceJet, and Vistara. Handwritten boarding passes were being issued during the outage. The Ministry of Civil Aviation asked and ordered the airlines as well as the airports to be compassionate and provide food as well as seats to waiting customers as needed.[78][79][80][81] As of 18:14 IST (12:44 UTC), over 200 Indian flights had been cancelled; IndiGo alone cancelled 192.[82] Airlines that relied on Microsoft Azure for their services were affected.[83]Air India and SpiceJet said that none of its flights were cancelled due to the outage, attributing it to their robust cyber system however, minute delays were reported.[84]
Europe
Prague Airport[5][85] in Czechia, Budapest Airport[27] in Hungary, Bratislava Airport[86] in Slovakia, and Schiphol Airport[87] in the Netherlands experienced problems. Planes were not allowed to land at Zurich Airport.[54] Near Brussels, Charleroi Airport employees manually checked passengers in, but other software alleviated problems by 10:00 (UTC+2) and there were minimal delays.[88]ENAIRE's Aena, the Spanish national airport traffic control manager, mentioned an IT outage in their website and social media.[89] All Spanish airports reported disruptions.[90]Charles de Gaulle Airport and Orly Airport experienced check-in problems and suspended flights.[91]Poznań–Ławica Airport and Warsaw Chopin Airport experienced check-in disruptions. An emergency system was activated, and check-in processes were slower.[92]Berlin Brandenburg Airport announced that since around 07:00 (UTC+2), operational processes were affected by "IT problems at an external provider", and that they planned to stop flights until 08:00 UTC.[81][93] While passenger handling continued with some restrictions, there were delays and airlines cancelled some flights.[94] Several airlines (Eurowings, Ryanair, Vueling, and Turkish Airlines) in Hamburg Airport had to issue tickets by hand.[95] Croatian and Swedish air traffic control were also disrupted.[96][97]
Swiss International Air Lines had 30% of flights grounded.[98]Lufthansa in Germany experienced problems with the "profile and booking retrieval" features of their website.[99] Ryanair's booking and check-in services were unavailable and the airline was "forced to cancel a small number of flights", advising passengers to arrive at airports at least three hours before departure.[100][101]Wizz Air said the outage put its online services offline.[102][103] Dutch airline KLM suspended most operations, announcing that flight handling was impossible with the issue,[citation needed] and Transavia Airlines experienced problems.[87][104]Finnair reported that they were having trouble sending emails and SMS messages to customers.[105] In Greece, citizens and tourists saw delays at major airports, notably at Athens International Airport and at Heraklion International Airport. This disruption, occurring at the peak of the tourist season, resulted in chaotic scenes as passengers were forced to wait for hours for their flights. Contributing factors included severe staff shortages and new schedules.[106][107] In Heraklion, eight flights were problematic. The airport's chief, George Pliakas, indicated that flights were being manually arranged to manage the disruption, but the influx of arriving flights strained the system.[108][109]
Several UK airports had problems, including Edinburgh Airport, whose departure boards froze,[110] and Gatwick Airport, where automatic barcode scanning stopped working and had to be checked manually.[54] Amadeus, which manages UK baggage at Heathrow, said they were affected by the IT outage.[111] Disruption to flights was anticipated in the Isle of Man, particularly to and from the UK, but ultimately minimal.[112]
Middle East and North Africa
Tunisia experienced temporary airport disruptions.[113] Turkish Airlines cancelled some of its flights due to the outage.[114]
In the mid-morning of Friday 19 July, a ground stop was issued by United, Delta, and American Airlines, halting takeoffs but allowing aircraft aloft to reach their destinations.[16][115]Allegiant Air was also grounded by the outage.[5][116] Around 10:30 a.m. Eastern time, AP reported that about 1,500 flights had already been cancelled in the United States due to the outage.[66] American Airlines, United, and Allegiant recovered relatively quickly after Friday.[117][118]
But Delta, by far the hardest hit of the US major airlines, experienced an operational meltdown that continued through the weekend. The airline cancelled more than 1,200 flights on Friday.[118][119] Thousands of stranded travellers were forced to spend the night at Hartsfield–Jackson Atlanta International Airport, Delta's largest hub and the busiest airport in the world by passenger traffic.[119]Metro Atlanta hotels and rental car companies were overwhelmed by the crisis, leaving travellers no option but to stay in the airport.[120][121] One traveller attempting to return home to Tampa (after giving up on reaching California) reported that Amtrak was charging $1,000 for a one-way train ticket from Atlanta to Tampa.[122] Visibly distraught passengers with nowhere to go were seen trying to sleep in the airport on hard linoleum floors without blankets or food.[119] The airport's custodial staff were also overwhelmed, with restrooms and trash reportedly "out of control".[121] Without warning, Delta banned unaccompanied minors on its flights through the end of 23 July.[123] This imposed hardship on parents who had been counting on that service to enable their children to fly without the expense of an accompanying adult.[123]
Delta cancelled more than 1,400 flights on 20 July,[124] and more than 1,300 flights on 21 July.[125] With so many passengers still stuck in Hartsfield–Jackson after two consecutive nights, the airport implemented a "concessions crisis plan" and a plan to reunite passengers with their checked baggage.[124] However, passengers in Atlanta continued to report "jam-packed" conditions and "heartbreaking" scenes in the terminals.[126]
On 21 July, Delta CEO Ed Bastian apologised to customers in a statement and revealed that the outage had left one of Delta's crew-tracking software programs "unable to effectively process the unprecedented number of changes triggered by the system shutdown".[125][127] Delta CIO Rahul Samant said the program had been brought back online around 11 a.m. on 19 July, but was overwhelmed by the backlog of updates awaiting processing and had been trying to catch up ever since.[126] After the ground stop left too many crew members in the wrong places, Delta struggled to assemble enough pilots and flight attendants at airport gates to operate scheduled flights.[127] Many flights were repeatedly delayed and finally cancelled because the one or two crew members who made it to the gate for a particular flight kept hitting their legal flight time limit before the airline could finish fully staffing the flight, and this caused the crisis to snowball as those crew and their aircraft were now in the wrong place for the following day's flights.[127] (A similar phenomenon occurred during the 2022 Southwest Airlines scheduling crisis.[125]) That same day, US Secretary of TransportationPete Buttigieg said on social media that the US Department of Transportation had received hundreds of complaints about Delta, and reminded the airline of its legal obligations to affected passengers.[125]
On 22 July, Delta cancelled more than 1,200 flights.[126] On 23 July, the Department of Transportation announced the launch of a formal investigation into Delta's treatment of passengers.[126] Delta officials promised to cooperate but said the airline was focused on its recovery.[126] Senator Maria Cantwell, in her capacity as chair of the Senate Committee on Commerce, Science, and Transportation, wrote to Bastian to express her concern for Delta passengers.[126] On 23 July, Secretary Buttigieg estimated that over 500,000 passengers had been affected by Delta flight cancellations.[128] He told a press conference, "There's a lot of things I'm very concerned about, including people being on hold for hours and hours, trying to get a new flight, people having to sleep on airport floors, even accounts of unaccompanied minors being stranded in airports, unable to get on a flight".[129] He told CBS News: "Stories about people in lines of more than a hundred people with just one customer service agent serving them at an airport, that's completely unacceptable."[130] By then, numerous passengers had ended up in different airports than their baggage because of Delta's flight cancellations, resulting in large piles of unclaimed suitcases and other checked baggage at Delta's airport terminals around the world.[131]
On 25 July, Delta returned to normal flight operations, having cancelled more than 7,000 flights; passengers had filed more than 5,000 complaints about Delta with the Department of Transportation.[132] On 26 July, The Washington Post reported that the department was investigating allegedly misleading communications from Delta that offered only credit towards future Delta flights as compensation for cancelled flights and failed to clearly notify passengers of their legal right to a cash refund.[129]
On 31 July, Delta CEO Ed Bastian said the disruption had cost the airline $500 million, and he said that Delta would sue CrowdStrike to recoup some of its losses.[133] On 8 August, Delta confirmed in a filing with the US Securities and Exchange Commission that over 7,000 flights had been cancelled over five days, and estimated its losses at $380 million in lost revenue and $170 million in expenses (adding up to about $550 million).[134] Delta also estimated that around 1.3 million passengers had been affected by the flight cancellations.[135]
Southwest Airlines (the third largest US major airline by domestic passengers) was entirely unaffected. A Southwest spokesperson confirmed that the airline had seen no impact from the CrowdStrike outage but refused to confirm speculation among aviation industry analysts that it had been shielded by its notoriously outdated software.[137]
The flight delays meant that many people who had traveled to the 2024 Republican National Convention—which concluded the day the outages started—were stuck in the convention's host city of Milwaukee, Wisconsin.[138][139]
Microsoft and CrowdStrike stocks fell as a result of the outage. CrowdStrike's stock fell more than 11% on 19 July, although Microsoft stock was down less than 1%.[142][5]
In India, the Reserve Bank of India said that only 10 banks and NBFCs were affected by the outage;[152][153] few banks use CrowdStrike tools and many banks' critical systems do not run on the cloud. NSE, BSE,[153] and India's largest bank, State Bank of India, said they were unaffected.[154]
In Brazil, Bradesco Bank confirmed it was affected. During the morning customers were able to login, but at 12:00 UTC the bank disabled the login button.[155]
The London Stock Exchange, while operating normally, was unable to push news updates to its website.[54] English gambling company Ladbrokes Coral and English supermarket chain Morrisons also reported problems.[54] Polish banks, including Santander Bank Polska, ING Bank Śląski and mBank, encountered issues related to the outage. Santander BP's helpline, video, and chat services were affected. PKO Bank Polski clarified that its iPKO and IKO services were stable, but other banks faced difficulties.[156] In Finland, OP Financial Group reported minor disruptions on investment partner and stock savings accounts.[157]Sense Bank in Ukraine experienced outages due to the update.[158]
Paraguayan banks Ueno and Banco Continental were affected; their customers were unable to log in.[159]
In the United States, there were outages in 911 service or disruptions in 911 call centres' operation in some parts of Alaska,[168] Arizona,[169] Florida,[170] Iowa,[171] Indiana,[172] Kansas,[173] Michigan,[174] Minnesota,[175] New York,[176] Ohio,[177] Oregon,[178] Pennsylvania,[179] and Virginia.[171] 911 was down for all of New Hampshire.[180][181] In addition, Alaska was experiencing issues with non-emergency call centres.[181] Many call centres switched to working backup systems.[166]
The CM/ECF and PACER computer systems used by the US federal courts were unaffected.[182] However, several state courts reported problems with their computer systems, including courts in Alaska, California, Delaware, Idaho, Kansas, Maryland, Massachusetts, Michigan, Nevada, New York, and Pennsylvania.[182] In New York City, courts and correctional facilities were disrupted, delaying a hearing in the trial of Harvey Weinstein for sex offenses.[40]
UK rail companies were also affected.[110][199] Cab riders in London could not pay with credit or with debit cards and thus required cash.[54] In Sweden[200] and Belgium,[201] tickets for public transport could not be sold, and Keolis Nederland experienced issues.[87][104]
Healthcare
Many hospitals across North America paused non-urgent surgeries and visits.[202] Some affected hospitals, while remaining open, had limited, if any, access to patient records.[203] In the United States, Memorial Sloan Kettering Cancer Center postponed all procedures that required anaesthesia, the Mass General Brigham hospital system cancelled all non-emergency procedures and medical visits,[204][205] and the Cincinnati Children's Hospital Medical Center was also affected.[16]University Health Network experienced technical issues in Canada, saying hospitals' clinical activity would continue but warning that appointments may be delayed.[189] A number of other Canadian hospitals faced difficulties, with Newfoundland and Labrador Health Services activating contingency plans as patient record systems were affected.[190]LabCorp[206] and Quest Diagnostics[207] were affected by the outage. Kaiser San Jose Medical Center lost access to patient records, as well as systems that monitored newborn baby's vital signs, and the security systems to keep babies from being taken.[208] Other hospitals lost surveillance cameras and the ability for employee badges to unlock secure areas.[208]
England's National Health Service (NHS) said that the issues are "causing disruption in the majority of [English] GP practices",[27] with some of its services, such as GP surgeries, which rely on a software product called EMIS Web, unable to view and manage medical records, issue and manage prescriptions, or make appointments.[54]Manx Radio reported that GP surgeries were affected in the Isle of Man.[209] The London Ambulance Service experienced an unprecedented surge[54] in 999 and 111 calls following the outage, responding to 4,500 emergency calls by 17:00 (BST).[210]
Two-thirds of Northern Ireland's general practices (GPs) were affected. At hospitals radiation therapy, bookings for operating theatres, and staff rosters are also affected.[54]
In the Netherlands, the outages affected two hospitals—the Scheperziekenhuis in Emmen and the Slingeland Ziekenhuis in de Achterhoek—and numerous emergency aid stations were also affected, including those in Emmen, Hoogeveen, and Stadskanaal.[214]
The Central Health information system in Croatia was affected, although it was clarified that it was due to a concurrent issue tied with moving their servers to a new location.[215]
The pharmaceutical company Krka in Slovenia suffered a full production outage and sent its workforce home.[216]
Media and communications
Numerous American TV stations were unable to broadcast because of the global outage. KSHB-TV, one of the affected stations, had to resort to airing national news via Scripps News.[66]ESPN was unable to air the morning editions of SportsCenter on the morning of the outage in America, instead airing ESPN Radio's Unsportsmanlike, simulcasting with ESPN2.[217][self-published source?] ESPN and ESPN2 later simulcasted Get Up! and First Take in place of SportsCenter, albeit without on-air graphics or b-roll.[218][219] Various Paramount channels were also affected including Nicktoons (with its West Coast feed switching to an old emergency feed), TeenNick, BET Her, NickMusic and most channels on the Pluto TV service. MeTV Toons was sent off the air for five and a half hours. Mercedes AMG PETRONAS F1 Team (which is sponsored by CrowdStrike) also suffered issues on the Friday of the Hungarian Grand Prix, with a Mercedes spokesperson confirming that the team had to manually address the problem on every computer it used. The issue also affected their engine customers, McLaren, Aston Martin and Williams.[220] Many video screens in New York City's Times Square turned off.[221]
When some companies let their employees go home early as a result of the incident,[222] the topic "Thank you Microsoft for an early vacation" momentarily became Weibo's most popular term.[223]Universal Studios Japan announced that they would not be selling tickets via ticket booths over the weekend due to the outage; however, tickets would still be sold online or via designated ticket sales sites.[66]
Vodafone experienced outages.[158] The issue affected the office laptops of DPG Media Belgium – which impacts JOE and QMusic Radio, banks, post services, and government agencies.[211] Telephone communication with the urban services in Antwerp were also affected.[224] The Centre for Cybersecurity Belgium stated that the impact in Belgium was limited.[211]Sky News was unable to broadcast live in the UK,[225] as was the BBC's CBBC, a free-to-air children's television channel.[54] Irish national broadcaster RTÉ said its newsroom was hit by "intermittent internet outages" with minimal impact to output.[226][227] The Canadian Broadcasting Corporation was also affected.[140]
Several French TV channels affected by the issues include TF1, TFX, LCI and Canal+ Group networks.[228] Phone and internet service provider Bouygues Telecom has also announced the unavailability of its customer service as a result of the outage.[229] The operations of the 2024 Summer Olympics, scheduled to start the following week in Paris, France, were also affected. The outage occurred a day after the Olympic Village opened and organisers were processing the arrivals of athletes and delegates. The organising committee said that a contingency plan was activated and that only the delivery of uniforms and accreditations were affected.[230] The incident slowed down the operations, with the accreditation desk at the press centre closed and security checks done manually using a list of names.[54][231]
IT workers and the BPO industry were affected in the Philippines.[232] Numerous Singaporean companies, including SPH Media, Singtel, and M1, reported various levels of service difficulties throughout the day on 19 July.[68][69]
Supermarkets in the Philippines were affected due to crashed POS systems.[236][dubious – discuss] German supermarket chain Tegut closed some of its stores.[237] Customers experienced payment problems at Foodstuffs and Woolworths supermarkets in New Zealand.[61]Coles was also affected.[5] British grocery chain Waitrose could only accept cash from customers.[238] Self-checkout and online order systems at some Australian retailers and fast food chains were out of service.[39]
The mobile application of the Starbucks coffee chain was limited to basic viewing of accounts made before the update; ordering was limited to in-store purchases only; some outlets saw cash register software crashes.[239]
In the United States, sporting goods retailer Dick's Sporting Goods closed some of its stores and saw temporary outages to its website.[240][241]
In Norway, the pharmacy chain Apotek1 and the insurance company Tryg suspended services; the Vitusapotek and Boots pharmacy chains were also affected.[243][244] Norway experienced little further impact because of CrowdStrike's limited market share in the country.[245]
Amazon saw disruption to its warehouse operations and internal software. An app used in Amazon warehouses to manage schedules and time-off requests called 'A to Z' was taken down by the outage and an internal service called 'Anytime Pay' became unavailable to employees. Operations were briefly halted at some sites, while Amazon's trucking operations were disrupted, with drivers saying a platform they use called 'Relay' suffered issues meaning they were briefly unable to pick up loads at warehouses.[246]
Other sectors
The outage affected terminal operations at DCT Gdańsk, a major container hub in the Baltic port of Gdańsk in Poland.[247] Shipping ports in the US were unaffected for the most part, although the Port of Houston (which handles the most foreign tonnage) closed briefly.[248]
In Sweden, the Malmberget mine was evacuated as a precaution.[249] Tickets for soccer games could not be sold.[250]
In the United States, security provider ADT was affected.[41]
In a live interview on NBC's Today, CrowdStrike CEO George Kurtz apologised to the public. He said company leaders were "deeply sorry for the impact that we've caused to customers, to travellers, to anyone affected by this, including our companies".[252] CrowdStrike warned that malicious actors might try to pose as its staff or independent researchers claiming to help fix the problem.[253]
CrowdStrike offered $10 UberEats vouchers to some employees at companies that sell and support its software as thanks for helping Crowdstrike customers recover, prompting ridicule given the costs associated with the outage.[254] Uber flagged the code as suspicious as it was used so frequently, so it did not work for some users.[255]
CrowdStrike won the 2024 Pwnie Awards for the Most Epic Fail,[256] which CrowdStrike president Michael Sentonas accepted in person at DEF CON's annual Pwnie Awards show.[257] A parody website named ClownStrike was created in the aftermath of the incident; CrowdStrike later sent a Digital Millennium Copyright Act (DMCA) takedown notice to the owner of the site.[258]
Political
The Australian government held a national emergency meeting to address the outage. The National Coordination Mechanism was activated; Prime Minister Anthony Albanese said, "I understand Australians are concerned about the outage that is unfolding globally and affecting a wide range of services. My Government is working closely with the National Cyber Security Coordinator".[61][259] He later said, "There is no impact to critical infrastructure, government services, or Triple-0 services at this stage".[260]Victorians were advised to call Triple-0 if a fire alarm sounds or smoke is detected, as some automatic alarms in buildings may not automatically call fire services due to the outage.[5]
United States president Joe Biden's administration was in touch with CrowdStrike to offer assistance, and on 23 July, Kurtz was invited to appear before Congress to explain how the outage occurred and what CrowdStrike was doing to prevent future incidents.[66]
The UK government's COBR committee met to discuss the incident.[261]
India's Minister of Information and Technology Ashwini Vaishnav said that the government was in touch with Microsoft. The government's cybersecurity agency CERT-IN classified the incident as "critical".[262]
In Russia, the government noted that the sanctions and boycotts placed on Russia as a result of its invasion of Ukraine in 2022 had unintentionally shielded it from the outage.[263] Russia's Digital Communications Ministry said, "At the moment, the ministry has not received reports of system failures at Russian airports," and "The situation with Microsoft once again shows the importance of import substitution of foreign software, primarily at critical information infrastructure facilities." The Russian Federal Air Transport Agency confirmed that no domestic airlines were affected.[263] The Kremlin stated that its systems were working as normal.[263]
An editorial in the Chinese state-run Global Times suggested a need for decreased dependency on Western firms, stating that reliance on "top companies to lead network security efforts" can introduce security risks and noting the perceived irony of the United States leading global security efforts while major companies monopolise the industry.[264]
Industry
Cybersecurity consultant Troy Hunt called the incident the "largest IT outage in history", adding: "This is basically what we were all worried about with Y2K, except it's actually happened this time".[265][266]Slate described it as "Y2K Lite".[267]
AirAsia CEO Tony Fernandes demanded answers and compensation for millions of dollars in revenue he said the company had lost in the incident.[269]
Chinese cybersecurity companies such as 360 Security, QAX and Tencent took advantage of the CrowdStrike incident to promote their own software.[270]
News reporters have used the term "digital pandemic" to describe the outage.[271][272][273][274]
Criminal
Governments worldwide and cybersecurity agencies warned of digital phishing scams after the incident. Cyber criminals started sending phishing emails purporting to be CrowdStrike support and impersonating CrowdStrike staff in phone calls shortly afterward.[275][276][277]
Analysis
Cause
The 19 July update was an instance of a template that was tested and released in March 2024 as part of an update to the Falcon Sensor software. This new instance, Channel File 291, passed validation due to a bug in CrowdStrike's content verification software.[278][14] The Falcon Sensor itself parses the file differently in a way that led to a software crash in kernel mode.
Centralisation and homogeneity
The outage raised questions about oligopoly and centralisation in the information technology sector.[279][280] The majority of the world's computers use Microsoft Windows, creating a monoculture that reduces resiliency.[140][280]Ciaran Martin, a cybersecurity expert, said, "This is a very, very uncomfortable illustration of the fragility of the world's core internet infrastructure".[140] Critical infrastructure expert Gregory Falco said, "Cybersecurity providers are part of this homogenous backbone of modern systems and are so core to how we operate that a glitch in their operations will have similar impacts to failures in systems that are household names".[281] Security experts suggested more redundancy to avoid single points of failure,[279] wider use of decentralised and heterogeneous federated systems,[280] and public anger at the failure of political leaders to regulate for diversity and competition.[140] Conversely, cybersecurity expert Andrew Plato argued that monocultures are a net positive, “from a security perspective, there’s actually a lot of benefits to running a smaller, standardized set of software, because it allows you to spot a problem quicker and easier."[282]
IT practices
Experts speculate that the update was not put through routine patch management procedures (testing the update in a sandbox) to verify there were no problems.[283]
Mandating disclosure of breaches and vulnerabilities has also been suggested.[284] In an interview with Wired, cybersecurity consultant Jake Williams said that this outage has "shown why pushing updates without IT intervention is unsustainable," and that "people may now demand changes in this operating model."[285]
Operating system design and antitrust enforcement
Microsoft blamed a 2009 antitrust agreement with the European Union that they said forced them to sustain low-level kernel access to third-party developers.[286][287][288] The document does not explicitly state that Microsoft has to provide kernel-level access, but says Microsoft must provide access to the same APIs used by its own security products.[287] The EU rejected the allegations. The European Commission spokesperson told Euronews that "Microsoft is free to decide on its business model. It is for Microsoft to adapt its security infrastructure to respond to threats in line with EU competition law. Additionally, consumers are free to benefit from competition and choose between different cybersecurity providers."
The spokesperson also said that "the incident was not limited to the European Union and that Microsoft has never raised any concerns about security with the Commission either before or after the incident."[289]
In Linux, it is possible to use eBPF instead of kernel modules to program this type of software.[290]
Since macOS Catalina (2019), this type of software can use the Endpoint Security Framework instead of a kernel extension, and this approach has been gradually enforced.[291]
^Sorkin, Andrew Ross; Mattu, Ravi; Warner, Bernhard; Kessler, Sarah; Merced, Michael J. de la; Hirsch, Lauren; Livni, Ephrat; Gaffney, Austyn (19 July 2024). "Counting the Costs of a Global IT Outage". The New York Times. ISSN0362-4331. Archived from the original on 19 July 2024. Retrieved 20 July 2024.
^Tidy, Joe (19 July 2024). "Crowdstrike Says IT Problems Will Take Time to Fix". BBC. Archived from the original on 25 July 2024. Retrieved 20 July 2024. Technical staff will need to go and reboot each and every computer affected, which could be a monumental task.
^McMahon, Liv; Fraser, G. (24 July 2024). "CrowdStrike to improve testing after 'bug' caused outage". BBC. Archived from the original on 24 July 2024. Retrieved 24 July 2024. the top 500 US companies by revenue, excluding Microsoft, had faced some $5.4bn (£4.1bn) in financial losses.
^"Article 4 GDPR Security of Processing". Intersoft Consulting. 2018. Archived from the original on 25 July 2024. Retrieved 25 July 2024. a breach of security leading to the accidental or unlawful destruction, loss, alteration, unauthorised disclosure of, or access to, personal data transmitted, stored or otherwise processed
^"Article 32 GDPR Security of Processing". Intersoft Consulting. 2018. Archived from the original on 14 July 2024. Retrieved 25 July 2024. said he reported it as a breach of article 32 of the GDPR
^Bennett, Tess; Smith, Paul; Tillett, Andrew; Tom, Richardson (19 July 2024). "Businesses crippled by widespread IT outage". Australian Financial Review. Archived from the original on 19 July 2024. Retrieved 19 July 2024.
^""感谢微软, 提前放假"!Windows全球大范围蓝屏" ["Thanks Microsoft, early holiday"! Windows Worldwide Wide Blue Screen]. news.ifeng.com (in Chinese). Archived from the original on 25 July 2024. Retrieved 19 July 2024.