Introduction

The field of scholarly publishing encountered the open access phenomenon [3] at the beginning of the twenty-first century, and the movement has been fueled by both technological revolution and scholarly pressure [19]. Due to widespread internet penetration, the growing prevalence of web-based knowledge transfer services and the communication revolution [2, 10, 23], new information technology has provided an opportunity for developing innovative and enabling open access publishing models [42]. Over the space of a few years, this communication revolution has brought a real paradigm shift in the sharing of academic knowledge, and the internet has provided new platforms for both the production and the dissemination of knowledge [15]. While open access publishing models first emerged as experimental models, it took only a few years for them to become mainstream approaches to delivering scientific knowledge [37]. Over the last two decades, legions of open access models have been developed, from which the most popular models are the gold, hybrid, green and diamond models [21]. While it is characteristic of all open access publication forms that the materials published are free to access and read online [22], there are fundamental differences between open access models regarding other aspects of publication. In the case of the golden model, all papers are published with open access and authors (or their institutions) must pay an article processing charge that varies by disciplines, publishers, geopolitical locations and even the prestige of the journals [13]. Green open access models [5, 29] allow authors to upload their papers to freely accessible research depositories (typically, after an embargo period), while hybrid models generally publish in a classical, prescription-based model, but, if they pay an author processing charge, authors can choose the open access option as well [4, 30]. Finally, diamond open access models provide free open access to both authors and readers [12, 16, 35].

The history behind open access has been widely discussed in the literature [17, 25, 26, 38]. After World War II, the Western world (especially the U.S.), experienced a significant growth of government investment in academic research that led to an unprecedented increase in the number of scholarly publications [27]. The skyrocketing number of research papers, despite the communication revolution enabling research papers to be made available online at considerably lower cost, was accompanied by an exponential rise in publication purchasing prices [1, 7, 20]. As a consequence, several institutions, associations and new initiatives started to argue that the traditional academic publishing industry should be dramatically reformed, and new technological innovations should be used to promote open science built on free access to scholarly knowledge.

The early 2000s were the beginning of the so-called open access era [11]. In 2001, the Open Society Institution convened the Budapest Conference with a focus on the possibilities of open access publishing [15], which was followed by the Budapest Open Access Initiative statement in 2002 [26]. Basically, the Budapest Statement considered the gold and the green open access models, and the strategies of this initiative were followed by many subsequent statements such as the Bethesda Statement on Open Access Publishing in 2003, the Berlin Declaration on Open Access to Knowledge [32], or the initiatives of the Open Access Scholarly Publishers Association in 2008 [26, 43, 44]. Hybrid open access journal publishing deals were concluded with major publishers such as Springer Nature and Wiley though the Project DEAL, and more recently the Plan S initiative [38] requires all publicly funded authors to publish their results on an open access platform [28, 38]. Specifically, in the European context, open access publishing has been part of the academic agenda since 2006, and the Lisbon Treaty (2007) clearly emphasized the European Union’s commitment to open access publishing. The Horizon 2020 program made it mandatory to publish publicly funded research open access, project proposals submitted to the European Research Council’s tenders should indicate the planned open access costs, and Science Europe [39] drafted minimum standards for open access publishing services [26].

Underlying the abovementioned strategies and policies, there is a seemingly self-evident conviction of the academic community to the effect that open access publication maximizes the benefits of scholarly research for both the academic field and the public [22, 45]. Economic, social and academic benefits are extensively discussed in the literature [22], with a specific emphasis on the growth in readership and impact due to open access publishing [40, 41]. Addressing the most important public debates on open access, Kingsley [24] analyzed several critical narratives around open access, from which most criticism relates to the quality of open access journals. Kingsley argued that—barring clear-cut cases of malpractice by predatory journals—open access journals can be of excellent quality and authors need not necessarily choose between esteem and sharing, or between quality and accessibility; indeed, many authors publish their best works in open access journals. As contrasted with several authors who found open access journals to be less prestigious or even that they engage in unethical practices [6, 8, 33], Kingsley argued that the quality of a journal is defined by its editorial policy and authorship, not its publishing model [24].

Others argue that, beyond the question of quality, disciplinary and geopolitical differences also have an impact on open access policies. On the one hand, health-related disciplines such as biomedical sciences have a considerably higher percentage of open access publication than other academic fields [34, 36], due to funders’ mandates and high public interest [9, 22]. On the other hand, there are geopolitical regions such as Ibero-America where open access publishing has a long tradition and where it is part of the national academic strategy. According to Scimago, the number of Ibero-American journals has increased from zero to 45 over the past 20 years, and the share of Ibero-American journals in Scopus is now slightly over 10% in several scientific fields. Moreover, the pace of the growth of Ibero-American journals is considerably faster than the overall growth of the field [14].

Despite the legion of research papers that focus on the history and growth of the open access publishing system, and the many analyses of the publishing patterns in specific countries and disciplines, we still lack a coherent picture of the global development of open access journals over the last two decades. Moreover, we have only limited empirical evidence on how the prestige factors of journals relate to open access publishing trends. In addition, we still need a comprehensive analysis that aims to explore existing differences between various disciplines and geographical locations in terms of their open access patterns. In accordance with these specific knowledge gaps, our present study aims to analyze the development of open access publication over the last two decades across nine disciplines and seven world regions. In order to explore these existing knowledge gaps in the literature of open access publishing, we developed three research questions.

RQ1 What is the general trend in open access publishing over disciplines between 2000 and 2019?

RQ2 What is the general trend in open access publishing over disciplines among the most prestigious (Scopus Q1 ranked) journals between 2000 and 2019?

RQ3 What is the general trend in open access publishing over disciplines in different geographical regions between 2000 and 2019?

Methods

To answer our research questions, we collected data from Scimago Journal and Country Ranking that works with information from Elsevier’s Scopus, the most inclusive international database of scientific publications. To analyze the global, regional and disciplinary distribution of open access journals over time, we have selected nine eminent subject areas, three time periods and seven world regions for further analysis. Time periods were selected in order to follow the development of open access publishing over the widest possible range, as Scimago currently offers data on journals between 2000 and 2019. Disciplines were selected in order to cover the highest possible variety of scholarly fields. Finally, world regions were selected as they are offered by Scimago. This categorization relates to geographical position instead of geopolitical or economic areas, thus Asia includes both developing regions as Malaysia or India as well as countries with developed economies such as Japan or Taiwan. Similarly, despite their strong academic relations to the U.S. and Europe, both Turkey and Israel fall into the group of Middle Eastern countries. Russia is considered as an Eastern European country, as well as some Central Asian countries such as Azerbaijan and Georgia, while Kazakhstan and Uzbekistan are considered as Asiatic countries (Table 1).

Table 1 Geographic and disciplinary construction of our sample

First, in order to catch global trends and to answer our first research question, we calculated the number of Scopus-indexed journals in each benchmark year in each discipline, and we calculated the amount of open access journals as well. In Scimago, the only journals considered as open access are those that provide either diamond open access publication (meaning that both publishing and reading papers is free of charge) or gold open access publication (with an article processing charge that unlocks access to published papers), whereas journals with hybrid, or green open access models are not considered as open access journals. Unfortunately, Scimago does not disclose which open access model a journal should use in order to be categorized as an open access journal. As a consequence, we had to test manually whether journals with diamond, gold, green or hybrid open access policies were categorized as open access journals. After running several checks on different fields, we found out that Scimago categorizes journals as open access journals only if open access publication is mandatory in them—whether for free, as in the case of the diamond models, or in exchange for author processing charges, as in the case of the gold model.

In line with earlier studies that found an association between journal prestige and the likelihood that the journal will use open access publication, we also calculated the number and proportion of open access journals in the first quartile (Q1) of the Scimago list that contains the top 25% of journals in each discipline. Every year, Scimago conducts journal assessments and categorizes them into four quartiles on the basis of Scopus citations and the average citations per paper quotient [31].

To answer our second research question, we calculated the share of open access journals in each discipline and geographical location for each benchmark year. A minimum number of published journals was necessary for meaningful calculations; thus, when the number of journals in a given world region was less than ten, we indicated that the open access share for this specific discipline, world region and time period should be interpreted critically. Typically, this occurred in the case of Africa, the Middle East and Latin America, where sometimes only three or five Scopus-indexed journals were published in a given discipline.

Results

Table 2 shows the results of our calculations in detail. In 2000, the share of open access journals was under 10% in each discipline, with an average of 4.3%. This grew to 15.3% in 2019, meaning that the share of open access journals has tripled over the last two decades. However, the exact proportion of open access journals within the set of all the journals varies across different disciplines, with neuroscience having the highest open access ratio, and computer science with the lowest. Besides significant differences, the share of open access journals increased significantly from 2000 to 2019 in all the disciplines analyzed.

Table 2 Proportion of open access journals across disciplines in our three benchmark years

However, the growing prevalence of open access journals followed different trends in each discipline (Fig. 1). The most striking growth was seen in neuroscience: in 2019, the share of open access journals was five times greater than two decades earlier, and now the proportion of open access journals is over 25%. A similar pace of growth was characteristic of economics, where the share of open access journals was 17.3% in 2019, as contrasted with the 2.8% in 2000. Generally, we can see three trends. First, there are disciplines in which the most striking growth happened between 2000 and 2010, as in the case of neuroscience and psychology. Second, there are scholarly fields where the most important emergence in the proportion of open access journals occurred in the last ten years, like in computer science, economics, engineering and social sciences. Finally, in arts and humanities, business, management and accounting, physics and astronomy, the pace of the evolution of open access publication models has been consistent over the last two decades.

Fig. 1
figure 1

Development of the proportion of open access journals, 2000–2019

To address our second research question, we further analyzed our sample to look for publication patterns among the Q1 journals. Within the set of Q1 journals, it is only in the field of neuroscience that the emergence in the number of Q1 journals is equal to the emergence in the number of all open access journals (Fig. 2). Indeed, the share of open access Q1 journals is 26.5% in 2019, which is even slightly higher than the share of all open access journals in neuroscience. However, in other disciplines, the growth of highly ranked open access journals underperforms the general growth, and in four of the nine disciplines, the share of open access Q1 journals from all Q1 journals has not exceeded 5%. This difference between the general trend and the trend of the top ranked journals shows that open access publication is more characteristic of lower ranked journals.

Fig. 2
figure 2

Development of the proportion of Q1-ranked open access journals, 2000–2019

Moreover, there is a strong, statistically significant correlation between the proportion of open access journals among all the published journals and the proportion of Q1 open access journals (r = 0.889 p < 0.001), and we found a significant regression equation predicting the proportion of Q1 open access journals based on the proportion of open access journals in the set of all published journals (F(1, 28) = 105.995, p < 0.001), with an R2 of 0.791. However, the proportion of open access journals in the Q1 quartile was lower than the general proportion of open access journals in each discipline and time period, and the growth in the proportion of open access journals was also lower than in the general trend (Table 3).

Table 3 General proportion of open access journals and proportion of open access journals in the Q1 quartile

To address our third research question, we analyzed the geographical distribution of open access papers over three benchmark years and nine disciplines. We found that the overall proportion of open access journals was the lowest in Northern America and Western Europe, while in Latin America, the open access model of publishing is the standard. The general trend also shows that the share of open access journals increased significantly in every world region (Fig. 3).

Fig. 3
figure 3

General trends in open access publications broken down by world regions

Significant disciplinary differences were found when we calculated the proportion of open access journals across regions and disciplines (Table 4). The share of open access journals in North America is still minimal in seven out of the nine disciplines, and there are two scholarly fields in Western Europe with less than 10% of journals being open access. It is followed by Asia with a considerably higher frequency of open access journals in all disciplines, while in Latin America the proportion of open access journals varies from 70 to 90% across disciplines. Apart from those cases where the number of journals was not sufficient to calculate open access ratios, the share of open access journals increased significantly over time across all locations and disciplines.

Table 4 Proportion of open access journals over disciplines, geographical regions and benchmark years

Since there were significant differences between the number of published journals across disciplines and world regions, we also calculated if the total number of published journals correlated with the proportion of open access journals. Indeed, we found that the total number of published journals has a significant negative correlation (r = −.420, p < 0.01) with the proportion of open access journals. To explore the exact nature of this connection, we ran a linear regression predicting open access proportion from the number of published journals. A significant regression equation was found (F(1, 187) = 40.073, p < 0.001), with an R2 of 0.176. Thus, open access journals are more prevalent in those cases where the number of all the published journals is lower. However, the significant negative association between the number of journals and the share of open access journals can be explained in the following ways. First, it is possible that the number of journals published is directly related to the proportion of open access journals, and, since there are more journals in Western Europe and North America than in other parts of the world, the lower proportion of open access journals in Western Europe and North America is a consequence of their abundance of journals, and not a result of their publishing policies. Second, it is also possible that world regions differ in both the number of their published journals and in their open access policies. In order to decide which suggestion fits our data better, we analyzed whether the negative association between the number of all published journals and the number of open access journals occurs in both the world regions with many journals (Western Europe and Northern American) and in world regions with less journals (Africa, Asia, Eastern Europe, Latin America and the Middle East). As Table 5 shows, North America and Western Europe together account for approximately 80% of the journals published in all disciplines.

Table 5 Share of world regions in the total of published journals, 2019

In Western Europe and North America, there is no statistically significant relation between the number of all journals and the proportion of open access journals (p = 346), while in other world regions, there is a statistically significant negative association between the number of published journals and the proportion of them that are open access (r =  − 0.262, p < 0.01). Moreover, a statistically significant regression equation was found (F(1, 133) = 9.073, p < 0.01), with an R2 of 0.062 that predicts the proportion of open access journals from the number of all published journals. Thus, we found that the negative association between the number of journals and the proportion of open access publication occurs only in the cases of non-Western world regions, and is not characteristic of Western publishers. It follows that the ratio of open access journals cannot be explained by the number of published journals alone, and both disciplinary differences and open access policies that are characteristic of different world regions should be considered as well.

Discussion and Conclusions

In line with the relevant literature, our research found that the proportion of open access journals has shown a significant increase since the early 2000s [25, 38], and we found that this phenomenon appeared across all disciplines and world regions. Our first research question was related to the general trend of the change in the proportion of open access journals over the last two decades. We found that the need for open access publication [26] resulted in a significant growth in the proportion of open access journals up until 2010, when the share of open access journals increased from 4 to 6%, but the real skyrocketing of open access model occurred only over the last ten years. Currently, more than 15% of Scopus-indexed journals are published on an open access basis. Despite significant disciplinary differences, this general trend holds for all the research fields analyzed, and while in 2000, there were disciplines with a share of open access publication close to zero, now there are no research fields with less than 10% of journals being open access.

Our second research question was related to the prestige of journals, thus we analyzed whether there were significant differences between the open access proportion of all the journals and the open access proportion of Q1-ranked journals. We found that, with the exception of neuroscience, the growth in the proportion of Q1 journals was lower than the general trend, and the proportion of open access journals was lower than it was among all the journals in each benchmark year. We can offer at least three explanations for this phenomenon. According to the first, we assume that, since most Q1 journals are older than the newcomers in lower quartiles [18], their business models were developed before the historical period where open access became more important. Of course, these older journals usually offer hybrid, gold or green open access possibilities [13], but these are nothing more that the extensions of existing publication models [5, 12]. By contrast, both diamond open access and gold open access are fundamentally new, and thus require totally different business models [3]. In the case of the diamond models, there is typically a non-profit maintenance entity such as a university or academic association [14]. Second newer journals that typically still lack a scholarly reputation and prestige have to offer appealing possibilities that make themselves competitive alongside the older established journals in the field. As several studies demonstrate that open access publication can significantly increase both readership and impact [40, 41], and given that author processing charges in non-diamond open access models can be extremely high [13], diamond open access can be a highly appealing feature in a new journal, and one that can attract the attention of both authors and readers. Thirdly, non-Western world regions, most typically Latin America, usually have pro-open access publishing systems, but since they are relative newcomers to international academia, their journals are still ranked lower than their Western counterparts. Thus, the lower ratio of open access journals in the Q1 quartile can be also explained by geopolitical differences.

Our third research question aimed to explore the geographical differences of open access publications over the disciplines analyzed. We found that the proportion of open access journals was the lowest in the so-called developed world (with the lowest rates in North America, followed by Western Europe), with higher proportions in Asia, and Latin America having the greatest open access ratio. This general trend is characteristic of all research fields, and disciplinary differences typically occur only in Western Europe and North America, while there are no significant differences in the proportion of open access journals across disciplines in non-Western world regions. In the case of North American journals, open access ratio can be extremely low even today: only 2% of engineering journals, 3% of business, management and accounting journals and 4% of psychology journals were published on an open access basis in 2019. The proportion of open access journals is significantly higher in other areas such as social sciences (8%), computer science (12%) and neuroscience (19%). There are considerable differences between disciplines in Western Europe as well, as there are research fields with relatively low proportions of open access publication, such as psychology (7%), or business, management and accounting (9%), and disciplines with high open access ratios like computer science (21%) and neuroscience (29%). In sum, we found that disciplinary differences in open access ratio occurs only in the Western world, which means that we should consider both geography and discipline in the explanation of open access trends. Thus, our model that aims to interpret open access trajectories over different disciplines, world regions, prestige factors and time periods should consider the following variables (Table 6).

Table 6 Our model for the interpretation of open access publication trends

Considering them as vectors that affect the proportion of open access journals, our model offers these five variables to interpret existing open access trends, from which three (but not the same three) variables apply to: (a) all regions, (b) Western and (c) non-Western geographies (Fig. 4).

Fig. 4
figure 4

Vectors explaining open access ratio on the general field, in Western and non-Western geographies

Limitations

The most important limitation of our research is a consequence of the fact that Scimago does not use different categories for diamond and gold open access journals. Accordingly, we could not make conclusive statements on the business models themselves, for although open access publication is mandatory in both cases, the diamond and gold open access models are based on fundamentally different business strategies [4, 13]. Further research with a specific focus on the differences between the gold and diamond models should manually code a representative set of journals and apply different categories for each model. While we can reasonably assume that Western European and North American countries with giant commercial publishing houses are more likely follow the gold model, and that non-Western university-based publishers will tend to follow the diamond model, an in-depth empirical analysis should identify the exact proportions across disciplines and world regions.