Severe acute respiratory syndrome coronavirus 2

Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is the strain of coronavirus that causes coronavirus disease 2019 (COVID-19), the respiratory illness responsible for the COVID-19 pandemic. Colloquially known as simply the coronavirus, it was previously referred to by its provisional name, 2019 novel coronavirus (2019-nCoV), and has also been called human coronavirus 2019 (HCoV-19 or hCoV-19). The World Health Organization declared the outbreak a Public Health Emergency of International Concern on 30 January 2020, and a pandemic on 11 March 2020.
SARS-CoV-2 is a Baltimore class IV positive-sense single-stranded RNA virus that is contagious in humans. As described by the U.S. National Institutes of Health, it is the successor to SARS-CoV-1, the strain that caused the 2002–2004 SARS outbreak.
Taxonomically, SARS-CoV-2 is a strain of severe acute respiratory syndrome-related coronavirus (SARSr-CoV). It is believed to have zoonotic origins and has close genetic similarity to bat coronaviruses, suggesting it emerged from a bat-borne virus. There is no evidence yet to link an intermediate host, such as a pangolin, to its introduction to humans. The virus shows little genetic diversity, indicating that the spillover event introducing SARS-CoV-2 to humans is likely to have occurred in late 2019. In September 2020, based on data analysis, researchers reported the discovery of the genome of the virus’s index case.
Epidemiological studies estimate each infection results in 5.7 new ones when no members of the community are immune and no preventive measures taken. The virus primarily spreads between people through close contact and via respiratory droplets produced from coughs or sneezes. It mainly enters human cells by binding to the receptor angiotensin converting enzyme 2 (ACE2).

Contents
1Terminology
2Virology
2.1Infection and transmission
2.2Reservoir and zoonotic origin
2.3Phylogenetics and taxonomy
2.4Structural biology
3Epidemiology
4See also
5References
6Further reading
7External links

Terminology
The name “2019-nCoV” in use in a trilingual sign at a Lisbon health facility in February 2020.

See also: Trump administration communication during the COVID-19 pandemic § Terminology
During the initial outbreak in Wuhan, China, various names were used for the virus; some names used by different sources included the “coronavirus” or “Wuhan coronavirus”. In January 2020, the World Health Organisation recommended “2019 novel coronavirus” (2019-nCov) as the provisional name for the virus. This was in accordance with WHO’s 2015 guidance against using geographical locations, animal species, or groups of people in disease and virus names.
On 11 February 2020, the International Committee on Taxonomy of Viruses adopted the official name “severe acute respiratory syndrome coronavirus 2” (SARS-CoV-2). To avoid confusion with the disease SARS, the WHO sometimes refers to SARS-CoV-2 as “the COVID-19 virus” in public health communications and the name HCoV-19 was included in some research articles.
The general public often calls both the virus and the disease it causes, “coronavirus”. U.S. President Donald Trump referred to the virus as the “Chinese virus” in tweets, interviews, and White House press briefings, which drew some criticism that he was stigmatizing the disease with racial or nationalistic overtones.

Virology
Infection and transmission
Main article: Transmission of COVID-19
Human-to-human transmission of SARS-CoV-2 was confirmed on 20 January 2020, during the COVID-19 pandemic. Transmission was initially assumed to occur primarily via respiratory droplets from coughs and sneezes within a range of about 1.8 metres (6 ft). Laser light scattering experiments suggest speaking— as an additional mode of transmission. Other studies have suggested that the virus may be airborne as well, with aerosols potentially being able to transmit the virus. During human-to-human transmission, an average 1000 infectious SARS-CoV-2 virions are thought to initiate a new infection.
Indirect contact via contaminated surfaces is another possible cause of infection. Preliminary research indicates that the virus may remain viable on plastic (polypropylene) and stainless steel (AISI 304) for up to three days, but does not survive on cardboard for more than one day or on copper for more than four hours; the virus is inactivated by soap, which destabilises its lipid bilayer. Viral RNA has also been found in stool samples and semen from infected individuals.
The degree to which the virus is infectious during the incubation period is uncertain, but research has indicated that the pharynx reaches peak viral load approximately four days after infection or the first week of symptoms, and declines after. On 1 February 2020, the World Health Organization (WHO) indicated that “transmission from asymptomatic cases is likely not a major driver of transmission”. However, an epidemiological model of the beginning of the outbreak in China suggested that “pre-symptomatic shedding may be typical among documented infections” and that subclinical infections may have been the source of a majority of infections. That may explain how out of 217 onboard a cruise liner that docked at Montevideo, only 24 of 128 who tested positive for viral RNA showed symptoms. Similarly, a study of ninety-four patients hospitalized in January and February 2020 estimated patients shed the greatest amount of virus two to three days before symptoms appear and that “a substantial proportion of transmission probably occurred before first symptoms in the index case”.
A study by a team of researchers from the University of North Carolina found that the nasal cavity is seemingly the dominant initial site for infection with subsequent aspiration-mediated virus seeding into the lungs in SARS-CoV-2 pathogenesis. They found that there was an infection gradient from high in proximal towards low in distal pulmonary epithelial cultures, with a focal infection in ciliated cells and type 2 pneumocytes in the airway and alveolar regions respectively.
There is some evidence of human-to-animal transmission of SARS-CoV-2, including examples in felids. Some institutions have advised those infected with SARS-CoV-2 to restrict contact with animals.
There are still a lot of questions about reinfection and long-term immunity. It is not known how common reinfection is, but reports have indicated that it is occurring with variable severity. The first reported case of reinfection was a 33-year-old man from Hong Kong who first tested positive on 26 March 2020, was discharged on 15 April 2020 after two negative tests, and tested positive again on 15 August 2020 (142 days later), which was confirmed by whole-genome sequencing showing that the viral genomes between the episodes belong to different clades. The findings had the implications that herd immunity may not eliminate the virus if reinfection is not an uncommon occurrence and that vaccines may not be able to provide lifelong protection against the virus. Another case study described a 25-year-old man from Nevada who tested positive for SARS-CoV-2 on 18 April 2020 and on 5 June 2020 (separated by two negative tests). Since genomic analyses showed significant genetic differences between the SARS-CoV-2 variant sampled on those two dates, the case study authors determined this was a reinfection. The man’s second infection was symptomatically more severe than the first infection, but the mechanisms that could account for this is not known.
Reservoir and zoonotic origin
Transmission of SARS-CoV-1 and SARS-CoV-2 from mammals as biological carriers to humans

The first known infections from the SARS-CoV-2 strain were discovered in Wuhan, China. The original source of viral transmission to humans remains unclear, as does whether the strain became pathogenic before or after the spillover event. Because many of the first individuals found to be infected by the virus were workers at the Huanan Seafood Market, it has been suggested that the strain might have originated from the market. However, other research indicates that visitors may have introduced the virus to the market, which then facilitated rapid expansion of the infections. A phylogenetic network analysis of 160 early coronavirus genomes sampled from December 2019 to February 2020 revealed that the virus type most closely related to the bat coronavirus was most abundant in Guangdong, China, and designated type “A”. The predominant type among samples from Wuhan, “B”, is more distantly related to the bat coronavirus than the ancestral type “A”.
Research into the natural reservoir of the virus strain that caused the 2002–2004 SARS outbreak has resulted in the discovery of many SARS-like bat coronaviruses, most originating in the Rhinolophus genus of horseshoe bats. Phylogenetic analysis indicates that samples taken from Rhinolophus sinicus show a resemblance of 80% to SARS-CoV-2. Phylogenetic analysis also indicates that a virus from Rhinolophus affinis, collected in Yunnan province and designated RaTG13, has a 96% resemblance to SARS-CoV-2.
Samples taken from Rhinolophus sinicus, a species of horseshoe bats, show an 80% resemblance to SARS-CoV-2.

Bats are considered the most likely natural reservoir of SARS-CoV-2, but differences between the bat coronavirus and SARS-CoV-2 suggest that humans were infected via an intermediate host. Although studies have suggested some likely candidates, the number and identities of intermediate hosts remain uncertain. Nearly half of the strain’s genome has a phylogenetic lineage distinct from known relatives.
Chinese pangolin
The pangolin coronavirus has up to 92% resemblance to SARS-CoV-2.

A phylogenetics study published in 2020 indicates that pangolins are a reservoir host of SARS-CoV-2-like coronaviruses. However, there is no direct evidence to link pangolins as an intermediate host of SARS-CoV-2 at this moment. While there is scientific consensus that bats are the ultimate source of coronaviruses, the pangolin CoV is sister to both RaTG13 as well as SARS-CoV-2. Based on whole-genome sequence similarity, a pangolin coronavirus candidate strain was found to be less similar than RaTG13, but more similar than other bat coronaviruses to SARS-CoV-2. Therefore, based on maximum parsimony and current sample data, a specific population of bats is more likely to have directly transmitted SARS-CoV-2 to humans than a pangolin, while an evolutionary ancestor to bats was the source of general coronaviruses.
A metagenomics study published in 2019 had previously revealed that SARS-CoV, the strain of the virus that causes SARS, was the most widely distributed coronavirus among a sample of Sunda pangolins. On 7 February 2020, South China Agricultural University in Guangzhou announced that researchers discovered a pangolin sample with a particular coronavirus – a single nucleic acid sequence of the virus was “99% similar” to that of a protein-coding RNA of SARS-CoV-2. The authors state that “the receptor-binding domain of the S protein  of the newly discovered Pangolin-CoV is virtually identical to that of 2019-nCoV, with one amino acid difference.” Microbiologists and geneticists in Texas have independently found evidence of reassortment in coronaviruses suggesting the involvement of pangolins in the origin of SARS-CoV-2. The majority of the viral RNA is related to a variation of bat coronaviruses. The spike protein appears to be a notable exception, however, possibly acquired through a more recent recombination event with a pangolin coronavirus. SARS-CoV-2’s entire receptor binding motif appears to have been introduced through recombination from coronaviruses of pangolins. Such a recombination event may have been a critical step in the evolution of SARS-CoV-2’s capability to infect humans. Recombination events have been key steps in the viral evolutionary process that lead to the emergence of new human diseases. Structural analysis of the receptor-binding domain (RBD) and human angiotensin-converting enzyme 2 (ACE2) complex revealed key mutations on the RBD, such as F486 and N501, which form contacts with ACE2. These residues are found in the pangolin coronavirus.
Pangolins are protected under Chinese law, but their poaching and trading for use in traditional Chinese medicine remains common in the black market. Deforestation, wildlife farming and trade in unsanitary conditions increases the risk of new zoonotic diseases.
All available evidence suggests that SARS-CoV-2 has a natural animal origin and is not genetically engineered. Nevertheless, laboratory origin of SARS-CoV-2 cannot be ruled out. According to computational simulations on protein folding, the RBD of the spike protein of SARS-CoV-2 should have an unremarkable binding affinity. In actuality, however, it has very efficient binding to the human ACE2 receptor. To expose the RBD for fusion, furin proteases must first cleave the S protein. Furin proteases are abundant in the respiratory tract and lung epithelial cells. Additionally, the backbone of the virus does not resemble any previously described in scientific literature used for genetic modification. The possibility that the virus could have gained the necessary adaptations through cell culture in a laboratory setting is challenged by scientists who assert that “the generation of the predicted O-linked glycans… suggest the involvement of an immune system.”
Phylogenetics and taxonomy
Genomic information
SARS-CoV-2 genome.svg
Genomic organisation of isolate Wuhan-Hu-1, the earliest sequenced sample of SARS-CoV-2
NCBI genome ID 86693
Genome size 29,903 bases
Year of completion 2020
Genome browser (UCSC)
SARS-CoV-2 belongs to the broad family of viruses known as coronaviruses. It is a positive-sense single-stranded RNA (+ssRNA) virus, with a single linear RNA segment. Other coronaviruses are capable of causing illnesses ranging from the common cold to more severe diseases such as Middle East respiratory syndrome (MERS, fatality rate ~34%). It is the seventh known coronavirus to infect people, after 229E, NL63, OC43, HKU1, MERS-CoV, and the original SARS-CoV.
Like the SARS-related coronavirus strain implicated in the 2003 SARS outbreak, SARS-CoV-2 is a member of the subgenus Sarbecovirus (beta-CoV lineage B). Its RNA sequence is approximately 30,000 bases in length. SARS-CoV-2 is unique among known betacoronaviruses in its incorporation of a polybasic cleavage site, a characteristic known to increase pathogenicity and transmissibility in other viruses.
With a sufficient number of sequenced genomes, it is possible to reconstruct a phylogenetic tree of the mutation history of a family of viruses. By 12 January 2020, five genomes of SARS-CoV-2 had been isolated from Wuhan and reported by the Chinese Center for Disease Control and Prevention (CCDC) and other institutions; the number of genomes increased to 42 by 30 January 2020. A phylogenetic analysis of those samples showed they were “highly related with at most seven mutations relative to a common ancestor”, implying that the first human infection occurred in November or December 2019. As of 7 May 2020, 4,690 SARS-CoV-2 genomes sampled on six continents were publicly available.
On 11 February 2020, the International Committee on Taxonomy of Viruses announced that according to existing rules that compute hierarchical relationships among coronaviruses based on five conserved sequences of nucleic acids, the differences between what was then called 2019-nCoV and the virus strain from the 2003 SARS outbreak were insufficient to make them separate viral species. Therefore, they identified 2019-nCoV as a strain of Severe acute respiratory syndrome-related coronavirus.
In July 2020, scientists reported that a more infectious SARS-CoV-2 variant with spike protein variant G614 has replaced D614 as the dominant form in the pandemic. In October 2020 scientists reported in a preprint that a variant, 20A.EU1, was first observed in Spain in early summer and has become the most frequent variant in multiple European countries. They also illustrate the emergence and spread of other frequent clusters of sequences using Nextstrain.
In October 2020, researchers discovered a possible overlapping gene named ORF3d, in the Covid-19 virus genome. It is unknown if the protein produced by ORF3d has any function, but it provokes a strong immune response. ORF3d has been identified before, in a variant of coronavirus that infects pangolins.
Structural biology
Figure of a spherical SARSr-CoV virion showing locations of structural proteins forming the viral envelope and the inner nucleocapsid
Structure of a SARSr-CoV virion

Each SARS-CoV-2 virion is 50–200 nanometres in diameter. Like other coronaviruses, SARS-CoV-2 has four structural proteins, known as the S (spike), E (envelope), M (membrane), and N (nucleocapsid) proteins; the N protein holds the RNA genome, and the S, E, and M proteins together create the viral envelope. The spike protein, which has been imaged at the atomic level using cryogenic electron microscopy, is the protein responsible for allowing the virus to attach to and fuse with the membrane of a host cell; specifically, its S1 subunit catalyzes attachment, the S2 subunit fusion.
SARS-CoV-2 spike homotrimer focusing upon one protein subunit with an ACE2 binding domain highlighted
SARS-CoV-2 spike homotrimer with one protein subunit highlighted. The ACE2 binding domain is magenta.

Protein modeling experiments on the spike protein of the virus soon suggested that SARS-CoV-2 has sufficient affinity to the receptor angiotensin converting enzyme 2 (ACE2) on human cells to use them as a mechanism of cell entry. By 22 January 2020, a group in China working with the full virus genome and a group in the United States using reverse genetics methods independently and experimentally demonstrated that ACE2 could act as the receptor for SARS-CoV-2. Studies have shown that SARS-CoV-2 has a higher affinity to human ACE2 than the original SARS virus strain. SARS-CoV-2 may also use basigin to assist in cell entry.
Initial spike protein priming by transmembrane protease, serine 2 (TMPRSS2) is essential for entry of SARS-CoV-2. After a SARS-CoV-2 virion attaches to a target cell, the cell’s protease TMPRSS2 cuts open the spike protein of the virus, exposing a fusion peptide in the S2 subunit, and the host receptor ACE2. After fusion, an endosome forms around the virion, separating it from the rest of the host cell. The virion escapes when the pH of the endosome drops or when cathepsin, a host cysteine protease, cleaves it. The virion then releases RNA into the cell and forces the cell to produce and disseminate copies of the virus, which infect more cells.
SARS-CoV-2 produces at least three virulence factors that promote shedding of new virions from host cells and inhibit immune response. Whether they include downregulation of ACE2, as seen in similar coronaviruses, remains under investigation (as of May 2020).
SARS-CoV-2 emerging from a human cell
SARS-CoV-2 virions emerging from a human cell
Digitally colourised scanning electron micrographs of SARS-CoV-2 virions (yellow) emerging from human cells cultured in a laboratory

Epidemiology
Main article: COVID-19 pandemic
Micrograph of SARS-CoV-2 virus particles isolated from a patient
Transmission electron micrograph of SARS-CoV-2 virions (red) isolated from a patient during the COVID-19 pandemic

Based on the low variability exhibited among known SARS-CoV-2 genomic sequences, the strain is thought to have been detected by health authorities within weeks of its emergence among the human population in late 2019. The earliest case of infection currently known is dated back to 17 November 2019 or possibly 1 December 2019. The virus subsequently spread to all provinces of China and to more than 150 other countries in Asia, Europe, North America, South America, Africa, and Oceania. Human-to-human transmission of the virus has been confirmed in all these regions. On 30 January 2020, SARS-CoV-2 was designated a Public Health Emergency of International Concern by the WHO, and on 11 March 2020 the WHO declared it a pandemic.
The basic reproduction number ({\displaystyle R_{0}}R_{0}) of the virus has been estimated to be around 5.7. This means each infection from the virus is expected to result in 5.7 new infections when no members of the community are immune and no preventive measures are taken. The reproduction number may be higher in densely populated conditions such as those found on cruise ships. Many forms of preventive efforts may be employed in specific circumstances to reduce the propagation of the virus.
There have been about 82,000 confirmed cases of infection in mainland China. While the proportion of infections that result in confirmed cases or progress to diagnosable disease remains unclear, one mathematical model estimated that 75,815 people were infected on 25 January 2020 in Wuhan alone, at a time when the number of confirmed cases worldwide was only 2,015. Before 24 February 2020, over 95% of all deaths from COVID-19 worldwide had occurred in Hubei province, where Wuhan is located. As of 13 December 2020, the percentage had decreased to 0.20%.
As of 13 December 2020, there have been 71,866,583 total confirmed cases of SARS-CoV-2 infection in the ongoing pandemic. The total number of deaths attributed to the virus is 1,607,798. Many recoveries from confirmed infections go unreported, but at least 46,983,190 people have recovered from confirmed infections.