Introduction

Patent registration generates huge data repositories that constitute an important source of knowledge about innovation. A growing body of literature in economics, business management, network science, and the legal field, suggests that analyses of patent data can provide ample information about innovation processes, about the traits of specific inventions, and about technological domains. This growing awareness, together with developments in big data analyses have led to an upsurge in the exploration of information stored in patent databases in the recent decades.Footnote 1

A significant part of the research in this area is devoted to the study of patent citations, and specifically to the study of highly cited patents. The perception underlying this line of scholarship is that patent citations reflect technological relations between the citing and the cited patents (e.g. Trajtenberg 1990; Harhoff et al. 1999; Jaffe and Trajtenberg 2003; Hall et al. 2005). Therefore, a large number of follow-on citations received by a patent is perceived as an indication of the impact, technological quality, value, and even breakthrough nature, of the patented technology (e.g., Fleming, 2001; Dahlin and Behrens, 2005; Érdi et al. 2013; Arts and Veugelers 2015).Footnote 2

Yet, there is another side to patents citations that can provide important information about innovation and about the patent system, but has been essentially ignored in the literature to date: patents that do not receive any citations. In this Article we term them “uncited patents”. While some scholarship incidentally observed that patents without citations could be a proxy for low quality inventions,Footnote 3 no study has systematically focused on uncited patents. How many uncited patents are there? Does the percentage of uncited patents change over time, or across technological fields? Can we identify traits that characterize uncited patents? These and additional questions have been left unanswered in the literature to date. The lack of focus on uncited patents may be understandable, in light of the general human tendency to concentrate on positive information (in our case: successful patents that receive many citations), and ignore negative information.Footnote 4 Yet, if patent citations provide information about the quality and value of patented technologies, then studying patents without citations is necessary, in order to get a fuller picture about the patent system, and the way it fulfills its ultimate task: promoting valuable innovation.Footnote 5

In this Article we aim to fill-in this gap, by providing the first systematic exploration of uncited patents. We analyze all U.S. patents granted between 1976 and 2008 (3,079,587 observations overall). First, we focus on the ratio of uncited patents out of all patents granted each year, and track the changes in this percentage during a period of more than three decades. Second, we inquire whether there are differences in uncited patents across technological fields, comparing, primarily, the fields of pharmaceuticals and software-related patents. Third, we inquire whether there is a link between an “uncited status” and specific patent traits. As part of the latter inquiry we examine a series of factorsincluding the number of backward citations a patent has, the number of inventors listed on the patent, the number of subclasses to which the patent belongs, the number of claims in the patent, and the degree of similarity between the uncited patents and the prior art which they citein order to explore whether there are differences between uncited and cited patents. We also explore the relations between an “uncited status” and the payment of patent renewal fees (that prevents the expiration of the patent before the end of the term).

Our findings reveal a robust pattern in which the ratio of uncited patents out of all granted patents decreased between 1976 and the mid-1990s, but has been significantly increasing since then. Graphically, then, the ratio of uncited patents across the period of our study forms a rough “U” shape. These findings are robust after controlling for patents’ age and additional factors, and are consistent across technological fields. Since patent citations are perceived as an indication of technological quality, impact, and value, these findings are concerning.

Interestingly, we observe differences among technological fields in the ratio of uncited patents. Prominently, the percentage of uncited patents is significantly higher for pharmaceutical drug patents, in comparison to patents in the software and tech-related fields.Footnote 6

We also find that uncited patents are negatively associated with the number of backward citations, the number of subclasses, and the number of claims in the patent.Footnote 7 In other words, large numbers of backward citations, subclasses, and claims increase a patent’s chances of being cited. Because numbers of claims, subclasses and backward citations were recognized in the literature as positively related to high-quality patents, these latter findings indicate that on average, the quality of uncited patents may indeed be lower, in comparison to cited patents.Footnote 8 In addition, we find that uncited patents are negatively associated with the payment of patent renewal fees by the patent owners, relative to cited patents, a finding which provides another indication for lower quality.

The general picture emerging from our study is concerning. Out of all patents granted by the USPTO, the percentage of uncited patents across domains has been increasing significantly from 1996 through 2008, which is the last year in our study. We discuss possible explanations for this trend, and, while we cannot offer a definitive interpretation, we locate our findings in the context of the literature concerning “patent explosion” and patent quality.

More broadly, our systematic focus on uncited patents presents a new, additional measure to evaluate the shape of the patent system, and opens up a new research avenue in the analysis of patent data. We conclude by sketching, in broad strokes, additional directions for exploration that would shed further light on the negative information embedded in patent repositories, and complement the research on the positive aspects thereof. Such studies could provide us with a deeper and more nuanced understanding of the patent system, as well as the innovation processes it seeks to promote.

This Article proceeds as follows: We begin with the theoretical background on the scientific uses of patent citations as a proxy for processes, traits, and trends of innovation. We then proceeds to quickly review the scholarly discussion of patent quality that is relevant for the following analysis. The next section describes our dataset, methodology, and findings, which are also graphically presented in a series of figures and detailed in technical appendices. We then discuss the potential significance and potential policy implications of these findings, describe several limitations of our study, and sketche potential directions for further research.

Theoretical background: patent data and patent citations

Our study of uncited patents draws on several strands of literature. The first is the large body of economic, network science, and legal scholarship that explores patent data, in order to gain valuable insights about the patent system, and the innovation ecosystem. These studies are based on the understanding that patent repositories are not merely registries of legal rights, but also an excellent source for data, which can provide us with new insights about innovation processes, about the traits of specific inventions, and about the shape of the patent system more generally (e.g., Trajtenberg, 1990; Jaffe and Trajtenberg 2003; Érdi et al., 2013). The registration requirement, which is a pre-condition for obtaining patent rights, implies that patent repositories include significant and highly structured information with respect to each and every registered patent. In addition to information about the invention itself, the registration includes a host of additional data, such as data about the inventors, the patent owners, the technological classes to which the invention is assigned by the Patent Office, and additional factors.Footnote 9

A prominent thread within this body of research is devoted to patent citations, and particularly to highly cited patents. Patent citations are citations of prior art pertaining to the invention. These citations are commonly comprised of previous patents, and infrequently also of scientific literature. Because a patentable invention must be “new “ and “non-obvious” relative to the prior art,Footnote 10 prior art plays a crucial legal role in the decision whether the invention deserves patent protection (e.g., Érdi et al., 2013). Therefore, the citation of relevant prior art is required as part of submitting a patent application.Footnote 11 The applicant’s citations are reviewed by the Patent Office examiners, who often contribute additional citations (e.g., Alcacer et al., 2009).

Citations, therefore, reflect relations between inventions: broadly speaking, backward citations—citations made by a patentreflect the previous technologies related, or providing building blocks to the patented invention, while forward citations—citations received by a patentreflect the invention’s impact on subsequent technologies (e.g., Trajtenberg et al. 2003b; Lanjouw and Schankerman, 2001). The underlying assumption is that if the technology embedded in the patent is valuable for technological progress, future patents relying on that technology would cite the original patent (e.g., Trajtenberg 1991; Fleming 2001; Érdi et al. 2013; Arts and Veugelers 2015).

Indeed, numerous studies found positive correlations between large numbers of forward citations and various external indications of value. Trajtenberg’s pioneering study of patent citations in the field of CT scanners (1990) showed a close association between high number of forward citations and the amounts expended on R&D. Alberta et al. (1991) found that highly cited patents were valued by experts as technically important; Harhoff et al. (1999) and Moore (2005) found a positive association between patent citations and the payment of patent renewal fees, so that patents renewed to their full term were “significantly more highly cited than patents allowed to expire before their full term” (Harhoff et al. 1999: 515). Allison et al. (2004) found that litigated patents had higher numbers of forward citations, relative to non-litigated patents. Since the involvement of a patent in litigation is a strong indication of its value, this finding provides another indication for the link between forward citations and patent quality (broadly defined). Likewise, Hall et al. (2005) found that firms whose technology was based on highly cited patents had higher market values.Footnote 12 Forward citations have therefore become an acceptable, if noisy, indication for “patent quality”, in a broad sense which refers to the technical quality, impact or value of inventions (e.g., Fleming 2001: 142; Strandburg et al. 2006; Csardi et al. 2009: 108).

A related line of research investigates various traits of highly cited patents, in an effort to understand what “building blocks” make a successful invention. Studies in this vein found positive association between high numbers of forward citations and the number of technological sub-classes to which the patent belongs, as determined by the patent office (Schoenmakers and Duysters 2010; Yoshikane et al. 2012). Similarly, research found that highly cited patents have, on average, more backward citations (namely, they cite more patents), and also have more claims, relative to other patents (Kelley et al. 2013; Lanjouw and Schankerman, 2004). Likewise, Wuchty et al. (2007) found that patents invented by more than one inventor, are more likely to be highly cited than patents with a single inventor. However, the relations between these factors and uncited patents were not systematically examined.

Additional studies, of particular relevance to our research, concentrate on patent citation patterns. These studies examine temporal changes that occur in patent citations over the years in order to draw broader insights about the patent system and about innovation processes. Several studies indicate that patents reach the peak of their citations during the early years after patent grant, with certain variations among technological domains (Strandburg et al. 2006; Mehta et al. 2010).

Kuhn et al. (2018) detect a recent change in patent citation patterns whereby a small minority of patent applications are generating a large majority of patent citations, and argue that the technological similarity between citing and cited patents has significantly weakened in recent years. Strandburg et al. observed an increasing gap between the least and most cited patents since the late 1980s, and suggested that this increase may result from issuing patents on more trivial advances, of lesser technical value. A subsequent work showed that this trend has leveled around 2000 (Strandburg et al. 2006; Csardi et al. 2009). Other research found that although newer patents have more backward citations (i.e., cite more patents) in comparison to older patents, the average number of citations received by a patent has been declining over the years, and proposed that this trend may stem from the grant of lower quality patents (Mehta et al. 2010).

Concomitantly, numerous legal scholars expressed concerns about a decline in patent quality, and argued that the grant of low-quality patents contributes to a sharp increase in the numbers of patent application and patents granted over the past decades, a trend often referred to as a “patent explosion” (e.g., Allison et al. 2010; Masur 2011: 472; Chien 2018). This scholarship further submits that this phenomenon buttresses non-practicing entities (commonly known as patent trolls), and produces other negative externalities that overall hinder, rather than promote, innovation (e.g., Sichelman 2010).

However, the rich literature on patent citations, whether its focus is on overall citation patterns or on the attributes of inventions, largely concentrates on positive information and on highly cited patents. Conversely, patents that receive no citations have not been the subject of substantial scholarly attention. While several recent studies focused on uncited scientific papers (Larviere et al. 2009; Noorden 2017), uncited patents have not received such focus. Yet, negative informationin our case, information about patents with no citationsis essential for getting a full picture of the patent system, and more broadly of the innovation ecosystem.Footnote 13 We therefore seek to fill in this gap in the literature, by providing a first systematic inquiry of uncited patents. The next Section details the methods of our investigation and describes our findings.

Data, methods and findings

What percentage of patents receive no citations at all? Are there differences in these percentage across technological domains? Does the pattern change over time? And are there specific traits which can be associated with uncited patents? In order to explore these questions, we analyzed data extracted from the USPTO database, on all US patents granted between 1976 and 2008. Our data includes a total number of 3,079,587 observations.Footnote 14

For each year in our study period, we identified the absolute numbers and percentage of patents granted that have not been cited. As we explain in the following section, we control for patents’ age and include citations for 10 years following patent grant.

In order to check the robustness of our findings, we further performed several robustness analyses. Primarily, (1) we repeated our analysis while excluding self-citations from the citation counts; (2) we repeated our analysis while considering citations to published patent applications in addition to citations to granted patents; and (3) we repeated our analysis with a universe of patents comprised of uncited patents and patents with only a single citation (“once-cited patents”). The results we describe in the following paragraphs are robust with respect to all these robustness analyses, which are reported in detail below and in the Appendix.

Temporal pattern of non-citation

As indicated above, according to previous studies patents usually reach their citation peak within approximately four years after issuance (Strandburg et al. 2006; Mehta et al. 2010; cf. Hall et al. 2003). Nevertheless, an “uncited” status is never final. The exploratory and combinatory nature of innovation, together with the legal requirement to cite any relevant prior art, imply that a patent can receive citations at any point during its lifetime, and also after its expiry (Strandburg et al. 2006; cf. Fleming and Sorenson 2001). Thus, older patents have more opportunities to gain citations. As a result, in order to accurately compare the percentages of uncited patents across the period of our study we needed to control for patents’ age. We therefore considered citations received in the first 10 years following the grant of the patent. Consequently, our data includes patents issued no later than 2008, so as to allow all patents in our database an equal ten-year period to gain citations.

Our data indicate that, overall, eleven percent of the patents issued during the period of our study did not receive any citations during ten years from issuance. Yet, our data reveals an interesting temporal pattern. Figure 1 shows this temporal pattern, namely the ratio of uncited patents to total patents issued each year during our study period, after controlling for patents age.

Fig. 1
figure 1

Uncited Patents 1976–2008 (10 years)

Figure 1 demonstrates a U-shaped pattern: From 1976 to (roughly) 1996, the percentage of patents that were uncited decreased over time from essentially 22 percent in 1976 to less than 8 percent in 1996. Since 1996, the percentage of uncited patents increased steadily over time, reaching 17 percent in 2008.

Interestingly, this trend is contrary to the trend observed in studies of scientific papers (Larviere et al. 2009). While the percentage of uncited science has been decreasing over the years, the percentage of uncited patents has been continuously increasing since the mid-1990s. In the “Discussion” section we discuss the significance of this pattern and suggest possible explanations.

Non-citation patterns across technological fields

Does this temporal pattern subsist across technological fields? Are there differences in the percentages of uncited patents among the different fields? We performed an initial inquiry of this question, by distinguishing between three categories of patents: (1) pharmaceutical drug patents; (2) software-related patents, which we defined in a broad manner, as including patents in information and communications technology (ICT), fin-tech and med-tech; and (3) all other patents, namely patents not included in categories (1) and (2). In order to identify the patents which belong to each of these groups, we used the patent classification system, which assigns each patent into subclasses, in accordance with the invention’s technological features.Footnote 15

Figure 2 shows the percentage of uncited patents for each of the three categories, together with the general, all-patents data.

Fig. 2
figure 2

Uncited patents by categories (10 years)

Similar to our overall data, the pattern of uncited patents in all the three categoriessoftware-related (yellow curve) pharmaceuticals (blue curve), and others (grey curve)exhibits a U-shape. The percentage of uncited patents in each of these categories decreased between 1976 and (roughly) 1996, and increased since 1996 and until the end of our period.

However, despite the general similarity in the pattern, there are striking differences among the three groups in the percentages of uncited patents. The percentage of uncited patents in the pharmaceutical drugs field is significantly higher across the entire period, in comparison to software related patents. The ratio of uncited patents in our third, residual, category lies somewhere between the two other categories, and closely tracks the overall non-citation ratio. The fact that a higher percentage of pharmaceutical patents remain uncited, in comparison to software related patents, is somewhat counter intuitive, given that patent protection in the pharmaceutical field is often perceived in the literature as more necessary and justified, in comparison to the field of software. We return to this finding in the “Discussion” section.

Lack of citations and patent traits

Is there a possible association between lack of citations and certain patent traits? We examined this question with respect to the following five factors, which we include as explanatory variables in our study:

  1. (1)

    Number of backward citations–the number of citations made by a patent to preexisting patents;

  2. (2)

    Number of subclasses—the number of subclasses to which the patent was assigned by the USPTO examiners, in accordance with the invention’s technological traits;

  3. (3)

    Number of claims listed on the patent. The claims determine the scope of the exclusive rights afforded to the patentee. Their initial number is determined by the applicant, and then reviewed by the USPTO examiners;

  4. (4)

    Number of inventors appearing on the patent; and

  5. (5)

    Backwards similarity—a variable which reflects the degree of similarity between the patent and its backward citations.

The information about the first four factors appears on the patent. In order to calculate backwards similarity we used the similarity index developed by Lanjouw and Schankerman (2001). Simply put, the index determines similarity on a scale between 0 and 1, in accordance with the fraction of backward citations that belong to the same class as the citing patent, out of the total number of backward citations. To illustrate, a similarity index that equals 1 implies that all cited patents belong to the same subclasses as the citing patent, while a value of 0 implies that all backward citations belong to subclasses that are different from the subclasses of the citing patent.

Intuitively, one would expect that higher values of the first three factors would increase the likelihood of a patent to be cited, since these factors are perceived as indications of the invention’s technological breadth, quality, and impact. Indeed, previous studies found positive associations between higher numbers of backward citations (e.g., Yoshikane et al. 2012; Kelley et al. 2013), sub-classes (e.g., Schoenmakers and Duysters 2010; Yoshikane et al. 2012; Kelley et al. 2013), and claims (e.g., Tong and Frame 1994; cf. Lanjouw and Schankerman 2004) on the one hand, and a high number of forward citations on the other. In addition, Allison et al. (2004) found that litigated patents (which are perceived as valuable patents), have higher numbers of claims and backwards citations relative to non-litigated patents. Using data on patents issued in 1991, Moore (2005) further found that patents which expired early due to non-payment of renewal fees by their owners generally had fewer backward citations and fewer claims, relative to patents that were maintained to their full term. Previous research also indicates that patents produced by more than one inventor are more likely to be highly cited than those produced by a single inventor, when self-citations are included (Wuchty et al. 2007). As for the fifth factor, namely the similarity between the patent and its backward citations, existing studies indicate that the relations between this factor and the patent’s quality and impact are more nuanced (see, e.g. Phene et al. 2006; Nemet et al. 2012).

The two tables above provide summary data for these variables. Table 1 described the cited patents, while Table 2 describes the uncited patents:

Table 1 Descriptive statistics: 1976–2008: cited patents
Table 2 Descriptive Statistics: 1976–2008: uncited patents

The data in Tables 1 and 2 indicate that uncited patents have, on average, fewer backward citations and less claims than cited patents, and also belong to fewer technological subclasses. Backward similarity values and the number of inventors are roughly the same for cited and uncited patents.

We further conducted a regression analysis of these data. The details of our regression and formal econometric analysis, including a summary of our regression results, appear in Appendix 1.

Our analysis demonstrates that patents with more backward citations, more subclasses, and more claims are much more likely to receive at least one citation, and this effect is statistically significant.Footnote 16 Since numbers of claims, subclasses, and backward citations have been previously recognized as positively related to highly cited patents, which are considered high-quality patents, these findings are not entirely surprising. Yet, they provide an initial indication that the quality of uncited patents is indeed lower.

Importantly, the regression results in Tables 3 and 4 in the Appendix also demonstrate that our findings concerning the “U” shape temporal pattern of uncited patents continue to hold, even after we control for all these factors. In other words: after controlling for a series of prevalent factors that might affect citations, the likelihood of not being cited decreases from 1976 to (roughly) 1996, and increases consistently from 1996 through 2008.

Lack of citations and patent renewals

In addition to the aforesaid traits, we also examined whether uncited patents are less likely to be renewed by their owners, relative to cited patents.

Patent owners are required to pay periodical maintenance fees (also known as “renewal fees”) during the term of the patent.Footnote 17 The non-payment of those fees results in the expiration of the patent before the end of the statutory term (although, or course, the patent still appears in the repository and can be cited as prior art). The payment of renewal fees therefore constitutes an external indication for the value of the patent. Previous studies indeed found positive associations between the payment of renewal fees and high number of forward citations (Harhoff 1999; Moore 2005). Against this background, we examined whether an “uncited” status is associated with the non-payment of patent renewal fees, using USTPO data about patents which expired due to non-renewal.Footnote 18

We find a negative and significant correlation between patents that were cited at least once and the nonpayment of patent maintenance fees, which indicates that uncited patents, on average, are less likely to be renewed, relative to cited patents. Further, the partial correlation between uncited patents and nonpayment of patent renewal fees is higher (in absolute value) than the partial correlations between uncited patents and all but one of the other traits we employed in the analysis.Footnote 19

We further conducted a regression analysis which demonstrates that the “nonpayment of patent maintenance fees” is negatively associated with whether a patent has been cited and that this effect is statistically significant. The analysis further shows that these findings continue to hold after we control for the five patent traits we examined above. These results provide another external indication, in addition to the patent traits described above, that the quality of uncited patents may indeed be lower in comparison to cited patents.

Our findings concerning patent renewals are further detailed in the Appendix, Tables 5 and 6.

Robustness analyses

In order to check the robustness of our findings, we performed the following three analyses.

1) Excluding self-citations

As explained above, in our main analysis we considered self-citations as citations. In order to check the robustness of our findings we performed an additional analysis with self-citations excluded. In other words, under this second analysis citations to a patent by a subsequent patent with the same patent holder or the same inventor are not considered “citations”, and a patent whose sole citations are self-citations would be considered “uncited”. Figure 3 below shows the percentages of uncited patents by categories, with self-citations excluded.

Fig. 3
figure 3

Uncited patents by categories (10 years), no self-citations

As is apparent from Fig. 3, while excluding self-citations somewhat raises the percentages of uncited patents, the temporal U-shape pattern for all categories remains unchanged. Additional details about this analysis appear in the Appendix.

2) Citations to patent applications

Following a statutory amendment in U.S. patent law, applications filed on or after November 29, 2000 are generally published after the lapse of eighteen months from submission, whether or not a patent has been granted.Footnote 20 Therefore, when patent applications were published before grant, citations may refer to the patent application, and not only to the granted patent (Kuhn et al. 2018; and Cotropia and Schwartz, 2020). In order to verify that the increase in the rate of uncited patents did not result from citations to patent applications (which replaced citations to granted patents) we repeated our analysis while taking into account citations made to patent applications: For patents published since 2002, we linked and united the citations to the published applications with the citations to the patents themselves. In order to maintain our age limit, we counted all citations (either to the patent application or the patent itself) during the ten year period following the publication of the patent application. The details of our analysis appear in the Appendix (“Robustness analysis: citations to published patent applications”), Tables 7 and 8. As these tables demonstrate, the U-shape we identified continues to hold even when considering citations to patent applications, and importantly, the percentage of uncited patents still increases consistently from (roughly) 1996 through 2008. Thus, our results cannot be attributed to the aforesaid change in patent policy.

3) Uncited versus once-cited patents

Finally, we repeated our analysis while considering uncited patents versus patents that received a single citation during a ten-year period (“once-cited patents”). The details of this analysis appear in the Appendix (”Robustness analysis: uncited versus once-cited patents”, Tables 9 and 10)

As these tables show, the results of our analysis are qualitatively similar. Most importantly, the U-shape pattern of non-citations continues to hold even when our “universe” of patents is comprised only of patents that were uncited and patents that received just a single citation: between 1976 and (roughly) 1996, the percentage of uncited patents decreased, whereas between 1996 and 2008, this percentage has been increasing (essentially monotonically).Footnote 21

In addition, examining the patent traits described above, when the”universe of patents” is comprised only of uncited patents and once-cited patents, shows that backwards citations, claims, and the number of technological subclasses are very significantly associated with whether the patent was cited.Footnote 22 As expected, the magnitudes of the coefficients are smaller in comparison to our “base case” where we compare uncited patents with the entire universe of all cited patents. These results support the robustness of our findings and provide yet another indication that the quality of uncited patents may indeed be lower.

A long tail of patent citations?

Finally, we take a look at our raw data, without controlling for patents’ age. While our 10-year limit is necessary in order to compare and evaluate citation patterns, looking at the raw data without controlling for age can provide us with a different insight regarding the actual existence of citations beyond the 10-year period. Figure 4 shows our results for patents that did not receive any citations, by technological categories, without controlling for patent’s age.

Fig. 4
figure 4

Uncited patents by categories (all years, self-citations included)

The oldest patents in our database had more than forty years to gain citations, in comparison to the youngest patents, that had only ten years. Despite this distorting factor, this figure still demonstrates a decrease in the percentage of uncited patents between 1976 and 1996, and a change of this trend from 1996 onwards. Interestingly, however, the percentage of ‘old’ uncited patents drops significantly in all categories when we lift the 10-year limitation. For example, 22 percent of all patents issued in 1976 were uncited after 10 years (Fig. 1), but only 5 percent remained uncited in 2018, 42 years from issuance. Similarly, for pharmaceutical patents issued in 1976 the percentage of uncited patents drops from 44 percent after 10 years to roughly 20 percent after 42 years.Footnote 23

These results imply that even if patents remain uncited for ten years after grant, many of them can still receive at least one citation after that period. In other words, the patent citation tail is a ‘long tail’. These findings are consistent with a previous study by Standburg et al., which observed that “patent citability dies out surprisingly slowly with age” (Strandburg et al. 2006: 1340–41). They also strengthen and reinforce a growing stream of research maintaining that innovation is not a strictly cumulative and linear process, but rather a combinatory, exploratory and networked process, whereby old technologies may gain new significance at a later stage (e.g., Fleming and Sorenson 2001; Fleming 2001; Shur-Ofry 2017). This line of inquiry deserves further research.

Discussion

What are the possible interpretations of our findings, and what is their significance for innovation policy?

Several factors may explain the U-shape temporal pattern. A plausible explanation for the left side of the U, namely the decline in the ratio of uncited patents between 1976 and 1996, may be the substantial improvement in patent search tools. During the first years of our period patents were published on paper, and had to be searched manually (Grigg 2003). This state of affairs gradually changed with the introduction of a system that allowed a computerized CD-ROM-based search in 1988, and an internet-based search in 1996. The website launched by the USPTO in 1996 was first limited to bibliographic patent information, but since 2000 enabled to search the full-text of all patents issued from 1976 onwards (and some more limited search of earlier patents) (Grigg 2003). This timeline is largely consistent with our findings that the percentage of uncited patents reached its lowest value around 1996.

The increase in the ratio of uncited patents since 1996 is possibly connected to the sharp increase in the overall number of patents issued by the USPTO during that period. According to USPTO data, in 1976, the first year of our study, the cumulative number of patents in the USPTO registry, which started in 1790, was approximately four million patents. In 2008 the number of patents exceeded seven million. Figure 5 shows the percentage of uncited patents against the cumulative number of US patents during the period of our study.

Fig. 5
figure 5

Cumulative number of US patents vs. Percentage of Uncited Patents 1976–2008

A closer look at the number of patents issued per year during our study period reveals a sharp rise in issued patents during the second half of our period, from the mid-1990s. Figure 6 shows the percentage of uncited patents against the yearly numbers of US patents issued during the entire period of our study.

Fig. 6
figure 6

Yearly Numbers of Patents Issued vs. Percentage of Uncited Patents, 1976–2008

This significant increase in patents issued since the mid-1990s is largely consistent with the increase we find in the percentage of uncited patents during the same period. Hence, one possible interpretation of our results is that despite the improvement in search capabilities, the search for prior art becomes more difficult the more patents there are in the registry. Therefore, in an era of “patent explosion” more patents are left uncited.

A second interpretation, which is not contradictory to the first one, concerns patent quality. In recent years, numerous legal scholars expressed concerns about a decline in patent quality. This literature generally maintains that the issuance of low quality patents is a major cause for the recent patent explosion. It argues that low quality patents often find their way to the hands of non-practicing entities (commonly known as patent trolls), create “patent thickets” that hinder technology commercialization, and produce other negative externalities that overall impede, rather than promote, innovation (e.g., Merges 1999; Lemley and Sampat 2008; Hemphill and Sampat 2012; Lemley 2013; Chien 2018; Frakes and Wasserman 2019). The growing consensus that the quality of patents granted deserves the attention of policy makers has led the U.S. Patent Office to declare patent quality as a priority, and to consider various ways to improve it.Footnote 24

While we cannot offer a definitive conclusion, to a certain extent the picture emerging from our study reinforces patent-quality concerns. Because citations are perceived as an indication of quality (in the broad sense used in this paper, that includes, beyond technological quality, properties such as impact, or value), the increase in the percentage of patents with zero citations could imply an increase in low-quality patents that do not serve as building blocks for subsequent innovation.

Our analysis further reveals negative relationships between uncited patents and several patent traits that were previously associated with successful, highly cited, patents, namely numbers of backward citations, subclasses, and claims. We found that uncited patents largely possess “less” of those traits. We also found that uncited patent are less likely to be maintained by their owners through the payment of periodical renewal fees. These findings may not be surprising, yet they reinforces the concerns that uncited patents are indeed of lower quality, and that the increase in the rate of uncited-ness signals a decrease in patent quality. The fact that, unlike patents, the percentage of uncited scientific papers is continuously decreasing (Larviere et al. 2009; Noorden 2017) may provide further support for this explanation.

Cautionary notes are warranted. First, the factors we examined are not exhaustive, and further studies could expand this line of exploration and examine the links between uncited patents and a series of additional traits and external factors, which were previously linked with highly cited patents.Footnote 25 These factors include, for example, expert evaluations (Albert et al. 1991), R&D investments in the underlying technologies (Trajtenberg 1990) and the owners’ market value (Hall 2005). We outline below additional directions for such explorations that would deepen our understanding of the association between uncited-ness on the one hand, and lack of social or private value, on the other. Secondly, patent citations are a rough proxy, and cannot provide completely precise information of the value of specific patents: put differently, “zero citations” does not necessarily imply “zero quality”. And while our examination paints a general picture, specific cases may of course deviate from that picture. Nevertheless, the “big picture” emerging from our data raises concerns about patent quality.

The differences we find in the percentage of uncited patents between categories, particularly between drug patents and software related patents are counter-intuitive. The pharmaceutical field is considered the “poster child” of the patent system, and the area in which strong patent protection is most justified (e.g., Ouellette 2010: 300). Conversely, there is a longstanding policy debate that casts doubt on the justifications for and necessity of software patents (e.g., Landes and Posner 2003: 326; Bessen and Meurer 2008; Bessen and Maskin 2009). Yet, our analysis demonstrates that the percentages of uncited patents are consistently higher for drug patents in comparison to software-related patents.

One possible explanation may relate to the tendency of pharmaceutical companies to protect drugs by a series of secondary patents that do not involve new active ingredients, in order to prolong their life-cycle. Studies indicate that this phenomenon, also known as “ever-greening”, has been growing over time. For example, Ouellette (2010) observed that the average number of patents per drug increased from 2.5 in the late 1980s to nearly 3.5 in 2005. Kapczynski et al. (2012) found that around 50% of drugs are protected by secondary patents, while Feldman (2018) noted that 78% of the drugs associated with new patents in the FDA’s records between the years 2005 and 2015 were not new drugs coming on the market, but existing drugs that are “recycled” by their owners. These secondary patents are often considered low-quality patents (e.g. Hemphill and Sampat, 2012: 345), which may explain the relatively high percentage of uncited drug patents. Another explanation may be that the differences we find are related to the much larger number of software-related patents relative to drug patents in the groups we examined.Footnote 26

Yet another explanation may concern references to non-patent literature. Studies suggest that patents in the life-sciences tend to cite more scientific, non-patent literature, relative to patents in other fields (e.g., Ahmadpoor and Jones 2017; Verbandt and Vadot 2018). Possibly, such references to non-patent literature come, de facto, “at the expense” of citation of drug-patents.Footnote 27 Relatedly, Glänzel and Meyer (2003) found that scientific articles in the field of “Drugs and Medical” were more likely to cite patents, relative to articles in the field of “Computer and Communications”, which may imply that some of the value of some pharmaceutical patents is actually manifested in contribution to science.Footnote 28 Finally, our findings possibly reflect different dynamics of innovation, for example that stakeholders in the pharmaceutical area “rush to patent” at an early stage, and register patents in numerous directions, many of which do not mature into valuable innovation, or that innovation in this field has less cumulative traits than in the software-related domains.Footnote 29 All these hypotheses, however, require further exploration.

The finding that the percentage of uncited patents in software-related fields is lower than the average for all technological fields is also surprising, in light of prominent criticisms that software patents are often trivial, and have limited use as sources for subsequent developments (e.g., Bessen and Meurer 2008). One should note that our definition of software-related patents is broad and includes patents in the fields of information and communications technology, fin-tech and med-tech. Subject to this broad definition, our analysis seems to reflect the cumulative nature of innovation in the tech-related fields, and suggests that software-related patents provide building blocks to subsequent technologies, no less (and even more) than other categories of patents. It should be noted that because the youngest patents in our database were issued in 2008, our results do not reveal the potential impact of the recent U.S. Supreme Court cases that raised the threshold of patentability in the software field.Footnote 30 The question whether and how these decisions, which, presumably, could raise the quality of software related patents, will impact the rate of uncited patents in this area certainly warrants future examination.Footnote 31

Overall, although our study indicates that a substantial majority of patents receive at least one citation within 10 years from grant, the trend we identify is disturbing. If innovation is a networked process, the fact that more and more patents remain outside the network is a cause for concern. We therefore believe that a systematic exploration of the rate of uncited patents and the tracing of temporal changes in this rate can provide a complementary, evidence-based measure, for evaluating the shape of the patent system and the overall quality of the patents granted by the USPTO.

Nevertheless, the exploration we present in this Article is merely a first step in this direction and one should be cautious in the interpretation of our findings. One limitation relates to the period of our study, which ends in 2008. As we explain earlier, this limitation stems from the need to allow patents a sufficient period of time to gain forward citations. This implies that our findings do not reflect changes in non-citation rates that may have occurred after the passage of the America Invents Act of 2011.Footnote 32 The legislation introduced a series of amendments to the U.S. Patent Act designed to improve patent quality, including a legal mechanism to challenge patents post grant (see, e.g., Carrier 2012). The future tracing of the rates of uncited patents before and after the reform may provide insight as to its effect, and certainly warrants future research.

Similarly, our study examined citations of patents by subsequent patents. Yet patents can also be cited in scientific literature (Glänzel and Meyer 2003), which implies that the cited patent served as a source of knowledge, and therefore has some social value (Ouellette 2012: 555; Fromer 2016: 1717). Future research can examine whether and to what extent any of the uncited patents received citations in scientific literature. Nevertheless, since previous research indicates that the vast majority (98.5%) of US patents have not been cited in scientific literature (Glänzel and Meyer 2003: 422), this inquiry is unlikely to significantly alter our main findings.

Likewise, our inquiry is limited to patents issued in the U.S. However, U.S. patents can also be cited by foreign patents, or by applications that did not mature into an issued patent.Footnote 33 Thus, another line of research would be to inquire whether, and to which extent, patents that receive no citations from subsequent U.S. patents still receive citations from other sources, including scientific literature, foreign patents, or abandoned applications.

Finally, inventions could potentially have commercial value despite lack of citations to their underlying patents. Therefore, another direction for future exploration would be to cross the data on uncited patents with data concerning patent transactions,Footnote 34 with data about patents involved in litigation (in itself a proxy for commercial valuecf. Allison et al. 2004), and with data about patents held by non-practicing entities.Footnote 35

These directions for future research are of course non-exhaustive. Rather, they illustrate the broad potential of the approach we choose in this study. Overall, the focus on uncited patents opens up myriad research questions, the study of which can illuminate a shadowed side of the patent system, and provide important evidence that would allow us to better evaluate it.

Conclusion

Our systematic study of patents that receive no citations yields three principal insights. First, we find a robust U-shape pattern, whereby the percentage of uncited patents decreased between 1976 and 1996, but has been constantly increasing since then. Second, we find counter-intuitive differences in the rates of uncited patents between different technological fields, primarily drug patents and software-related patents. Third, our analysis reveals that uncited patents are negatively associated with several indications for patent quality. From the perspective of innovation policy, these findings are troubling. They raise, and reinforce, concerns regarding patent quality and ‘patent explosion’. On a more general note, this study’s systematic focus on patents that are “out of sight” opens up new avenues for future research, and demonstrates how exploration of negative information embedded in patent data can provide us with important knowledge and a deeper, more nuanced, understanding of our ecosystem of innovation.