Introduction

Parvoviruses are small non-enveloped viruses with single-stranded DNA genomes ~ 5 to 6 kb in length and possess two main open reading frames (ORFs) encoding non-structural proteins involved in transcription and DNA replication as well as the capsid protein(s) [1]. They belong to the Parvoviridae family of viruses which includes the subfamilies Parvovirinae infecting vertebrates and the Desovirinae that infect arthropods [1]. The Parvovirinae contain eight different genera: Amdoparvovirus, Aveparvovirus, Bocaparvovirus, Copiparvovirus, Dependoparvovirus, Erythroparvovirus, Protoparvovirus and Tetraparvovirus. With the increase in metagenomic studies, the number of parvovirus species, from a diverse range of hosts, has greatly increased in recent years [2,3,4] and transcriptome studies suggest that parvoviruses might span the entire animal kingdom [5].

The genus Copiparvovirus contains two species recognized by the International Committee on the Taxonomy of Viruses (ICTV), the first of which Ungulate copiparvovirus 1 (bovine parvovirus 2 or BPV2) was identified in 2001 in commercial bovine serum [6]. Ungulate copiparvovirus 2 (porcine parvovirus 4) was found in the lung lavage of a diseased pig, co-infected with porcine circovirus type 2 in the USA in 2005 [7]. Further, related, but unclassified viruses have since been reported including two viruses identified in pigs—porcine parvovirus 5 [8] and porcine parvovirus 6 [9] and a virus termed Sesavirus found in the feces of a Californian sea lion pup [10]. During an investigation of the bovine virome of US calf serum, bosavirus was identified [3] and has been found in cattle persistently infected with pestiviruses [11]. A virus identified in sera from European roe deer has been designated Ungulate copiparvovirus 3 and this virus was first detected in Ioxodes ricinus ticks found on the deer [12]. Horse parvovirus CSF was found in the cerebrospinal fluid of an animal with neurological deficits [2] and a second equine virus, termed EqPV-H, was detected in the liver of a horse that died of equine serum hepatitis—Theiler’s disease [13]. Further studies with EqPV-H have shown a strong association with cases of fatal Theiler’s disease and subclinical hepatitis in animals in contact with the animals where a fatal outcome was observed [14]. The study we present here describes the identification of a novel bovine copiparvovirus, designated bovine copiparvovirus species 3 isolate JB9 (bocopivirus 3-JB9) and a related virus, MK1 that shares 99% nucleotide identity with JB9, found in pooled fetal bovine serum (FBS).

Results

Cell culture supernatant from A549 cells infected with hepatitis E virus (HEV) obtained from swine feces was investigated by metagenomic analysis as previously described [15]. Sequences were compared to the GenBank non-redundant protein database using BLASTx with an E value cutoff of < 10–5. As expected, HEV sequences were obtained; however, a small number of reads were related to BPV2, a member of the Copiparvovirus genus, generating three discrete contigs: one mapping to the non-structural protein and the others to the viral capsid protein. Using PCR, it was demonstrated that the novel sequences were present in the FBS used during the HEV cell culture. Sequence gaps between the respective contigs were determined by PCR and sequencing using primers located near the ends of the contigs. The approach to extend the sequences at the 5′ and 3′ ends is described in the supplementary methods using a mutant Taq polymerase (SD polymerase) with high strand-displacement activity [16]. From the analysis of the 5′ end of the genome, it can be inferred, based on primer binding and extension, that positive sense viral genomes are packaged into virus particles.

Droplet digital PCR (ddPCR) was performed using specific primers and hydrolysis probes for the three contigs identified in the initial metagenomic analysis (Supplementary materials and methods). The ddPCR analysis revealed that the concentration of viral DNA was ~ 50,000 copies/ml of FBS irrespective of which assay was used suggesting products were amplified from the same template. Pre-treatment of the FBS with DNAse prior to nucleic acid extraction did not affect the copy number demonstrating that the novel parvovirus DNA was encapsidated and protected from nuclease digestion. The copy number in the cell culture supernatant was tenfold lower than in the bovine serum; this lower concentration reflected the dilution of the serum in the cell culture medium. There was no evidence of replication of the novel parvovirus in the cell culture (either in the cells or cell supernatant) when monitored by ddPCR (data not shown).

Further lots of pooled bovine sera (n = 7 from 4 commercial suppliers sourced in South America and Australia) as well as individual bovine sera or whole blood (n = 43, sourced in Europe) were tested for the novel parvovirus by real-time PCR (Supplementary Materials and Methods) using the three sets of primers/probes used in the ddPCRs. Table 1 provides a summary of the observed cycle threshold (CT) values for the samples that yielded positive results in at least one of the PCRs. Of the individual serum and whole blood samples, one serum sample had CT values of > 37 for all 3 PCRs, whilst 3 of the whole blood samples had CT values of > 37 just for the C2-qPCR. It was not possible to perform sequence analysis on these samples, likely due to low viral loads. In the case of the serum pools, one pool was positive in only one PCR with a CT value of > 39, another pool, however, was positive in all three assays with CT values ranging from ~ 29 to 32 and sequencing (Supplementary Table 1) showed it to be closely related to the virus identified by metagenomic sequencing. The virus strain identified by metagenomic sequencing was termed JB9 and the related virus was termed MK1, both were South American in origin and share ~ 99% nucleotide identity.

Table 1 Testing of individual bovine serum and blood samples for the presence of the novel bovine parvovirus by real-time—CT values of samples testing positive in at least one of more of the three screening PCRs

The sequence determined for strain JB9 is 5599 nucleotides in length with a GC content of ~ 44.5%. In the case of strain MK1, a sequence of 4743 nucleotides was determined. The two novel virus strains are ~ 99% identical and contain two large ORFs. In the case of strain JB9, ORF1 is 1644 nucleotides in length, encoding a protein of 547 amino acids and ORF2 is ~ 3273 nucleotides, encoding 1090 amino acids. It was not possible to amplify the sequences corresponding the very C-terminus of ORF2 of either strain. Comparison of the JB9 ORF2 amino acid sequence with that of BPV 2 reveals that the JB9-encoded sequence is actually slightly longer than that of BPV2, suggesting that the JB9 sequence is unlikely to be significantly longer and is close to being full length. The two main bovine parvovirus JB9 ORFs are separated by 322 nucleotides (as is the case with MK1), this is similar to BPV2 where the distance separating the two main ORFs is 352 nucleotides [6]. Both ORF1 and ORF2 are in the same reading frame and 36 nucleotides’ downstream from the ORF1 stop codon is a further small putative peptide of 45 amino acids, which is found in the same reading frame as both ORF1 and ORF2. The ORF1-encoded protein is homologous to the non-structural protein of parvoviruses NS1, showing greatest amino acid identity with BPV2 (40%); the organizational structure is shown in Fig. 1a. Similar to other parvoviruses, ORF1 contains a HXH domain (amino acids 94–96)—a metal-binding catalytic unit of the endonuclease domain, involved in parvovirus DNA replication as well as helicase motifs (i.e., GPXNTGKX—amino acids 331–338) and ATPase motifs involved in viral DNA replication [4]. The amino acid sequence of the ORF2-encoded protein is homologous to the main viral capsid protein (VP1) of parvoviruses, sharing ~ 47% amino acid identity with VP1 of BPV2 over the near-full length sequence of the strain JB9. The amino acid sequence of the JB9 capsid contains phospholipase A2 motifs—YxGxG (amino acids 285–289) and HdxxY (amino acids 308–312), required for parvovirus infectivity [17] and a glycine-rich region (amino acids 487–491) involved in parvovirus cell entry [18]. Seventy-four nucleotides upstream of the ORF1 start codon is a TATA box; the sequences in the 5′ region of the genome are predicted to form several small hairpins. Phylogenetic analysis (Fig. 1b) was performed using sequences of ORF1 and representative members of the Copiparvovirus genus and prototype strains of other Parvovirinae genera using MEGA 7 [19]. The analysis demonstrates the genetic relatedness of JB9 and MK1 with other members of the Copiparvovirus genus, in particular, BPV2 and the recently identified roe deer copiparvovirus which was originally detected in ticks raises some interesting possibilities in copiparvovirus transmission.

Fig. 1
figure 1

a Genome structure of bovine copiparvovirus 3 isolate JB9 (a) compared to bovine parvovirus 2 (Accession number AF406966). The single-stranded DNA genomes encode two open reading frames (ORFs), ORF 1 encoding the non-structural (NS) protein and ORF2 encoding the structural protein (capsid); the nucleotides numbers above each ORF indicate the position of the respective coding regions for the two viral sequences. The bovine copiparvovirus 3 isolate JB9 ORF2 is near-full length but incomplete. b Phylogenetic tree showing the relationship between bovine copiparvovirus 3 isolates JB9 and MK1and other parvoviruses. The analysis was performed using sequences of ORF1 and representative members of the Copiparvovirus genus and prototype strains of other Parvovirinae genera. Sequence alignment was performed using Clustal Omega (https://www.ebi.ac.uk/Tools/msa/clustalo/) with the default settings. A phylogenetic tree was generated using the neighbor-joining method based on the Kimura 2-paramter method in MEGA 7 [19]. The percentage of replicate trees in which the associated taxa clustered together in the bootstrap test (1000 replicates) are shown next to the branches

Discussion

The identification of JB9 and MK1 as novel species in the Copiparvovirus genus adds to the growing number of parvoviruses identified in bovine serum [3, 6, 20]. To date, attempts to isolate the bovine parvovirus JB9 from the contaminated batch of serum in cell culture, including the inoculation of Madin–Darby bovine kidney cells, have been unsuccessful, possibly due the presence of neutralizing antibodies in the serum pool. Although the role of JB9 in bovine pathology, if any, remains to be elucidated, knowledge of the sequence and the availability of specific assays will be important for further investigations. Viruses detected in bovine serum pose a potential concern for manufacture of biological medicinal products when serum is used as a raw material or a medium supplement. This is similar to the detection of human parvovirus 4 (PARV4) in plasma fractionation pools and in factor VIII concentrates [21, 22]. Theiler’s disease, a form of equine hepatitis, has been reported to occur for over a century, albeit infrequently, in horses after the transfusion of different types of equine-derived blood products. The recent identification of EqPV-H, another copiparvovirus, and growing evidence of its role in Theiler’s disease [14] suggests that there may be further examples of copiparvovirus-associated pathologies which remain unexplored.