1 Introduction

Leydesdorff et al. (2016) acknowledged four main stakeholders in research evaluation and management using bibliometric indicators: producers, bibliometricians, managers and scientists. The last one is significant to this work that focuses on mapping the scientific contribution of an internationally renowned scientist: Manuel Sandoval Vallarta (hereafter referred to as Vallarta).Footnote 1 Our research included one more element: sociology, history and philosophy of science. This research line was proposed by Peritz and Bar-Ilan (2002) as an extension of the interdisciplinary nature of bibliometrics-scientometrics. We used three computational tools to map Vallarta’s production: Network analysis to determine how his fields of interest changed over time from 1924 to 1962. Bibliometric analysis to clarify the areas of the most cited articles including forward and backward citations. Finally, we incorporated a tag cloud and bigram analysis to ensure correlation with previous analysis based on the abstracts of his papers.Footnote 2

Why did we choose Vallarta? He was a world prominent scientist; Nobel Prize nominee that studied and worked with the world’s most renowned physicists. He was at the forefront of the cosmic rays field for many years and his breakthroughs helped open barriers in physics. He was the most remarkable physicist in Mexico and a major supporter of science in this country as well as Latin America; he campaigned for the peaceful use of nuclear energy at national and international levels and was the first chairman of the United Nations Atomic Energy Commission (1946). When he passed away in 1977, people considered his life's work did too but the findings of this paper tell different, since the articles he published 90 years ago continue to be cited.

Historical books and articles have been published in recent years, revealing unexplored aspects of Vallarta’s life and analyzing his contribution according to the scientific scenarios of his time. However, no analysis of his academic outcomes has been produced in terms of bibliometrics, networks and data mining tools up until this article. And while our original idea was to approach his legacy without rigorous evaluation, the interdisciplinary nature of these studies granted us with information in contexts we did not expect. The elements we identified are detailed below along with The social history of science from which we began designing and developing this study:

  1. 1.

    History of science (social perspective). Vallarta's educational background and expertise. His sudden shift from quantum mechanics and relativity to cosmic rays (due a geographical coincidence). His meeting in Mexico City during a vacation with a Nobel Prize physicist who was measuring cosmic radiation there. His employment at the Massachusetts Institute of Technology (MIT), his resignation (due to World War II) and his return to Mexico.Footnote 3

  2. 2.

    Position and contribution as a scientist. Vallarta's scientific production and its international value and relevance. About his papers cited by Scopus, the top one on quantum mechanics and relativity was never quoted by himself, opposite to the three most currently cited articles on cosmic rays that he did mentioned several times. Citation to his work attests for the significance of his contributions. Still, some questions are open. His personal skills and abilities in physics.

  3. 3.

    Social, political and economic context. The impossibility of studying in England because of World War I. Favorable factors to stand out in his field: be acquainted with renowned physicists, received a two-year Guggenheim fellowship to study physics in Germany. Having leading-edge technical equipment, a worldwide unique computer, etc. Those that finished their thriving career despite the beginning of World War II. His return to Mexico (a developing country) and facing economic, institutional and scientific limits. His role as a promoter of science in Mexico and Latin America and his involvement as a diplomat in this country and abroad.

  4. 4.

    Geographical (spatial) and cultural aspects. Transportation and communication conditions that dificulted collaborations between colleagues working in the same spatial geographic environment—most foreign coauthors at MIT, most Mexican coauthors in Mexico. Political changes and cultural barriers. Competing interests.

  5. 5.

    The philosophical features. How Vallarta built his own field of study, in accordance with a knowledge flow network, these behave in a scale-free network distribution. Forward and backward citations exhibit a power-law behavior. Saam et al. (1999) coincidence in: Vallarta published a great deal of his articles in one of the best physics journals of the time, from which his most cited articles were taken.

  6. 6.

    Interdisciplinary research. When mapping Vallarta’s scientific contributions, questions inevitably arose that only their peers can answer.Two examples, what contributions did he make to the foundations of science that continue to be cited? Is Vallarta and Rosen (1932) paper related to EPR paradox? An example from the social sciences: Why was Vallarta the first chairman of the United Nations Atomic Energy Commission (1946)? These mapping tools invite to an interdisciplinary collaboration.Footnote 4

Next, we present a Vallarta´s life overview with the information required to interpret the graphs, tables and results obtained by running computer programs.

2 Background and Early Achievements in his Life

He was born in Mexico City in 1899 to a distinguished family. He completed undergraduate and PhD programs at MIT, where he was employed as teacher and scientific researcher. Between 1927 and 1929, he studied in Berlin and Leipzig, Germany, among the greatest physicists of this period. In Berlin, relativity next to Albert Einstein, electromagnetic theory with Max Planck, quantum mechanics with Erwin Schrödinger, and the exegesis of science with von Harnack. In Leipzig, he met Werner Heisenberg and Peter Debye. From 1929 to 1932, Vallarta published the results of his research, especially concerning the study of the unified theory of electromagnetism and gravitation proposed by Einstein upon which highly regarded physicists were focusing their attention (Del Río, 2018 and Mendoza, 1995). It is well known that his first contribution to quantum mechanics came up six months after this scientific discipline emerged.Footnote 5

In 1932, his academic life changed when he met Compton in Mexico,Footnote 6 who was completing a series of observations he had also made in Hawaii, New Zealand, Peru and Panama.Footnote 7 Vallarta moved from the field of quantum mechanics and relativity to the field of cosmic rays. He felt greatly drawn to this subject as he realized that it could be an observational confirmation of the ideas on the primeval atom, through which; his friend Georges Lemaître (Riaza, 2010), sought to explain the origin and expansion of the Universe.Footnote 8 Upon his return to MIT, Vallarta discussed his experience with Lemaître, and they immediately began developing a quantitative theory concerning geomagnetic effects of cosmic radiation (Mondragón, 1987).Footnote 9 Their theory was easily successful, so Vallarta quickly gained an internationally privileged position.Footnote 10 As a result, he promoted a number of international projects in this area—for example the establishment of observatories and labs—which helped bring institutions and scientists closer together across America. Especially in Latin America, he influenced the development of cosmic rays physics at both the individual and organizational levels (Minor, 2019).

In 1939, the theory developed was confirmed throughout numerous observations and was universally known as the Lemaître-Vallarta theory of cosmic radiation (Mondragón, 1999b, 116). One of the advantages of this theory is that it was extrapolated not only to the Earth’s magnetic field, but also to planets such as Mercury and Jupiter, other magnetically-fielded objects such as the Sun and other stars.Footnote 11

The same year Vallarta held one of the best posts at MIT becoming a full professor of physics. He made a name for himself as one of the greatest teachers of theoretical physics.Footnote 12 His excellent teaching and research were instrumental in MIT’s transformation from an engineering school to one of the world's leading scientific research institutions.

By 1942, he decided to return to Mexico because the United States had joined the Second World War and MIT’s technological infrastructure redirected towards it leaving out individual research.Footnote 13 There were also restrictions for those not naturalized as US-citizens (as in his case) so he tried to focus on strengthening United States’ foreign policy in the Latin American region during that time (Minor, 2019).

Vallarta came back to Mexico at a time when the physical and mathematical sciences started to grow into scientific research’s professionalization and institutionalization.Footnote 14 He did not only support the development of science and technology in Mexico, but also in other developing countries. His production dropped considerably during the time he fulfilled many national and international duties. Some of them were: President of the agency that the federal government created to promote scientific research in MexicoFootnote 15; member of the Mexican National Nuclear Energy Commission (Comisión Nacional de Energía Nuclear) (1956),Footnote 16 Mexican representative at the United Nations Atomic Energy Commission (which he led in 1946) and the International Atomic Energy Agency, etc.Footnote 17 President of several international commissions on cosmic radiationFootnote 18; chairman of the United Nations Atomic Energy Commission (1946); chairman of the Bose Institute Reviewing Commission in Bombay, India (1948); member of the founder Committee for the Advanced School of Theoretical Physics of the University of Trieste—with Oppenheimer, Weisskopf, Bohr, Soloviev y Abdus Salam—to help first-rank theoretical physicists in developing countries (Chronicle 1965), among other positions.

Throughout the last three decades of his life, one of Vallarta’s main priorities was to channel nuclear energy use for best and peaceful purposes, so he took active participation in national and international forums. During those years, he also published various essays regarding science’s impact on society and how it had become a political, industrial, and military resource. In general terms, he would refer to the moral responsibility scientists had to face regarding the application of their discoveries (Gall, 1987b).Footnote 19

Vallarta died in Mexico City on April 18, 1977. This unfortunate event meant a great loss to the Mexican scientific community. In honor of his memory and legacy as an international scientist and promoter of Mexican science, physicists Alfonso Mondragón (then a senior professor-researcher at UNAM’s Institute of Physics) and Dorotea Barnés; edited Vallarta's full production in the book Manuel Sandoval Vallarta. Obra científica (858 pages). It was issued in 1978 to commemorate the anniversary of his passing. They also obtained the license of all journals where Vallarta had published (Mondragón & Barnés, 1978).

From this book we collected the data of the backward direct citations of his work to carry out the bibliometric analysis, and the abstracts to be analyzed with data mining. The data was completed with forward citation tracking using Scopus, one of the largest and abstract citation databases in the world in the research field, which is highly reliable due to peer-review.Footnote 20 Tag cloud analysis was performed using software designed and developed by Gustavo Carreón, one of the authors of this paper. This work contributes to the recent efforts in México to analyze scientific production in the field of physics (Del Río et al., 2020, Lancho-Barrantes and Cantú Ortiz, 2019).

3 Bibliometric Analysis of Vallarta’s Scientific Work (1924–1962)

Our information source to create the backward citations database was taken from the Mondragón and Barnés book Manuel Sandoval Vallarta. Obra científica (1978). It was divided into 3 sectionsFootnote 21:

  1. 1.

    The papers in which Vallarta published the results of his original research (national and international journals).

  2. 2.

    Essays and lectures on scientific subjects.

  3. 3.

    Vallarta´s curriculum vitae.

The first section in chronological order from 1924 to 1962.

We used the information contained in the first section to conduct an analysis of his scientific research. We built the historical network database incorporating each record one by one, because there isn´t exist a digital version. This was a slow and meticulous task that required the citation revision of each Vallarta´s article. Figure 1 show the journals in which he published the largest number of papers: Physical Review, Journal of Mathematics and Physics, Nature, and Proceedings of the National Academy of Sciences USA.

Fig. 1
figure 1

Source: Developed by the authors using data from Mondragón and Barnés (1978)

Journals Publishing Most of Vallarta’s Papers.

Figure 1 demonstrates that the Physical Review journal published most of Vallarta’s papers consistently throughout his scientific career. This journal widely disseminated the results of his research pertaining to relativistic and quantum physics and to cosmic rays. During the final years of his career in which he again gained interest studying gravitational theory; Physical Review, continued to publish his work.Footnote 22 This journal also describes the contribution that Vallarta’s colleagues in Mexico made to the study of cosmic rays.

The distribution of papers published by Vallarta from 1924 to 1962 is shown in Fig. 2. Periods of great productivity can be appreciated. The first one alludes to a period in which he had an interest in quantum mechanics and relativistic physics (until 1932), whereas the second one refers to his contributions to the study of cosmic rays—each period presents a maximum of productivity. A reduction in the amount of papers can be noted during the 1940s. This coincides with the World War II and his return to Mexico to carry out tasks as a government official to develop science, technology, and education. Becoming a science diplomat attending at international forums, particularly those related to developing nuclear energy.

Fig. 2
figure 2

Source: Developed by the authors using data from Mondragón and Barnés (1978)

Papers Published by Vallarta from 1924 to 1962.

This section presents the results of the bibliometric analysis using Scopus database,Footnote 23 referring to 36 papers that Vallarta published between 1926 and 1958. The results are shown in Table 1.

Table 1 Papers Published by Vallarta et al. with the Largest Number of Citations to Date (more than 4)

Source: Search in Scopus database in April, 2022.Footnote 24

Table 1 shows that the four papers with the largest number of citations were published in the Physical Review journal; one in 1932, one in 1933, and two more in 1936.Footnote 25 This, during his early stage as a researcher. He had already received his PhD from MIT in 1924. After that, he traveled to Europe to continue his studies, and in 1929 he returned to MIT as a faculty member. Vallarta’s h-index (provided by Scopus), corresponding to 5 out of 36 documents with 218 citations, should also be noted.Footnote 26 We have no opinion on this value as we agree with Leydesdorff et al. (2016) about the difficulty to interpret it.

It is known that Vallarta was a great admirer of Einstein’s theory of general relativity and a severe critic of his ideas about the unified field, since he considered that a theory of that kind should be first developed for gravitation and electromagnetism. By the end of the 1920s, he published some papers about this issue (one with Wiener). In 1932, one with Nathan Rosen, “The relativistic Thomas–Fermi atom,” which benefitted Rosen as it bridged his connection with Einstein at Princeton. Years later, Einstein, Podolsky, and Rosen published the famous paper concerning quantum mechanics (Moshinsky, 1987, 47). Table 1 show that until 2022 this was the paper with the largest number of citations.Footnote 27

Figure 3 indicates the number of citations the three most cited art icles received per year until May 6, 2022. Figure 3a corresponds to Vallarta and Rosen (1932). Figure 3b relates to second paper with the largest number of Vallarta citations that was co-authored with Georges Lemaître in 1933—Lemaître and Vallarta, “On Compton’s Latitude Effect of Cosmic Radiation,” Physical Review, 1933, 43, 87. Contrary to the earlier paper with Rosen, this one has been the most extensively cited paper in recent times.Footnote 28 The third and last paper hereby analyzed (Fig. 3c) is Lemaître and Vallarta, “On the Allowed Cone of Cosmic Radiation,” Physical Review, 1936, 50, 493.

Fig. 3
figure 3

Source: Developed by the authors with data from Scopus in May 6, 2022

Number of Citations Received per Year: a Vallarta and Rosen’s (1932), b Lemaître-Vallarta’s (1933), c Lemaître-Vallarta (1936a).

Vallarta’s papers have started to gain relevance in recent years. Figure 4 shows the annual distribution of the number of citations to the 36 papers presented in Scopus. It can be seen that those papers started to be cited in 1971 and most citations have been made since his death in 1977. It should be noted that some of them, like the three presented above, have maintained relevance.

Fig. 4
figure 4

Source: Developed by the authors using Scopus data on 13 May 2022

Distribution of the Number of Citations to Vallarta’s Scientific Work per Year. (By May 2022, Vallarta had already been cited)

The peak observed in 2016 is because L’Annunziata (2016) cites 20 Vallarta´s articles in his book on Radioactivity.Footnote 29 Sometimes these events raise the papers citations. If this data is clustered by decades, growth becomes (program auto-adjustment shows a quadratic curve, until 2019) as in Fig. 5. Fig. 6 shows the selfcites:

Fig. 5
figure 5

Source: Developed by the authors using Scopus data on January 15, 2020

Number of Citations that Vallarta’s Scientific Work Has Received per Decade.

Fig. 6
figure 6

Source: Developed by the authors using data from Mondragón and Barnés (1978)

The number of references that Vallarta cited in his papers (blue), the number of Vallarta’s papers cited by himself (red).

We can see that Vallarta’s articles cited by himself in cosmic rays are also the most cited internationally (see Table 2). However, his article with the most citations reported by Scopus (on relativity and quantum mechanics) was never cited by Vallarta. There are three other articles that he never quoted, which are in the top 10 of Scopus: Vallarta (1933), Forbush-Gill-Vallarta (1949) and Barajas-Birkhoff-Graef-Vallarta (1944).

Table 2 Most frequent Vallarta’s articles cited by himself vs most cited Vallarta´s articles in Scopus data base

Our results agree with Saam et al. (1999) who claim “the higher the reputation of the scientist, the quality of the article, or the quality of the journal the article was published in, the higher the probability of receiving citations more frequently”. Vallarta was a much referenced cosmic rays science leader in the 1930s.

4 Analysis of Vallarta’s Scientific Production (1924–1962)

Analytical network methods are significant to model and analyze indirect effects (direct effects can be analyzed using classical statistics). By calculating the parameters, we obtained information about the system as a whole (Zweig, 2016). Connections are important, but also the segregated network, because it reveals levels and concepts of organization that aren’t easily found otherwise (Newman, 2010). Temporal fluctuations in the network, called dynamic networks, also provide information about the behavior of the network structure (Barzel & Barabási, 2013).Footnote 30

In this section, we studied the structure of the networks to analyze how the backward citations evolved each time Vallarta published an article. We found very significant changes between 1932 and 1933, and between 1939 and 1940. 1932 was the year he turned into cosmic rays, a radical departure from quantum mechanics and relativity. We observed the formation of four new networks; two of them are the most cited articles today. On the other hand, cosmic rays productivity peaked in 1939, and the following year, a sharp decline. Next, we will explain the database construction (backward and forward citations).

As already mentioned, our backward citation database was built on the basis of Mondragón and Barnés (1978). In total, there are 51 Vallarta's articles and 512 articles he quoted from 1924 to 1962.Footnote 31 The database information for forward citations is from Scopus (references looked up in May 2022) and is 36 documents containing 218 citations. Both databases were organized by authors' name and year of publication. Vallarta’s articles added the letters a, b, c, d, and e when there were several published in the same year. In this way, each item is completely distinct from the other items. An example is shown in Table 3. Networks were created with the Cytoscape 3.7.2 program.

Table 3 An example of the structure of forward and backward citation database entries

To analyze Vallarta's scientific research, we used a directed network with the references that Vallarta cites in his research as nodes.

The structure of the network is shown in Fig. 7. The three networks represent the evolution of his publications from the beginning (1924) to 1930 (Fig. 7a), there were none in 1931, 1932 (Fig. 7b) and 1933 (Fig. 7c). We added the amplification of a network section to observe its structure (Fig. 7d). From an early stage in his career, Vallarta worked alongside brilliant physicists of the time in the fields of quantum mechanics and relativity. In 1932, Vallarta published his last two papers on these areas and both are outside the central network (Fig. 7b). The largest network represents his final dissertation co-authored with Nathan Rosen, “The Relativistic Thomas–Fermi Atom” is the most cited work to date. It can be interpreted as a paper in which Vallarta and Rosen propose new ideas that deviate from the accumulation of traditional citations. That's why they are now gaining relevance.

Fig. 7
figure 7

Source: Developed by the authors with data from Mondragón and Barnés (1978)

Networks Representing Vallarta’s Citations from 1924 to 1930, 1924–1932 and 1924–1933. . (Attribute circle layout network (Cytoscape 3.7.2 program). For achieving a better visualization of the network evolution from 1930 to 1933, in this graph the quote from Bush (1931) was omitted. Here, Vallarta thanks Professor V. Bush and his assistants for the support with the integration of the differential equation, “a machine for the solution of differential equations developed at this Institute by V. Bush”. For this reason, the network belonging to Vallarta-Rosen1932b (the most cited paper) is integrated into the cosmic rays network, as will be seen in the following graph. The Bush article title is “The differential analyzer. A new machine for solving differential equations”)

As mentioned earlier, after meeting Compton in Mexico in 1932, Vallarta decided to pursue cosmic rays research. Compton (Nobel Prize winner in 1927) was in controversy with Robert Andrews Millikan, (Nobel Prize winner in 1923) who had a different idea of the origin and composition of cosmic rays. This debate attracted the attention of the international scientific community as well as the general public. Thanks to Vallarta, Lemaître (who knew Millikan and with whom he had discussed his work) decided to step in and they both came up with such important contributions on the matter that their papers were among the most extensively cited ones of that time and even today.

As it can be seen on Fig. 7b and c, Vallarta completed his first field and opened a new one in a remarkable way. Both the last article in the first field and the first one in cosmic rays contain references that are completely different from the ones he used. The last publication in the first area is the most cited, and the first paper on cosmic rays is the second cited so far. This paper is entitled “On Compton’s Latitude Effect of Cosmic Radiation” was co-authored with Lemaître and published in January 1933. This is surprising given that the paper with Rosen had been published a few months earlier in August 1932. These two articles contain new discoveries that are still referenced in the international scientific community.

Figure 8 shows the network of authors that Vallarta cited in his research paper (backward citations). We can see two stages of his scientific trajectory. The network in Fig. 8b shows that during his first stage as researcher, he cited a large number of authors and very little of his own research. He later made his own contribution, this paper was outside the circle of references he had been consulting for years. The Fig. 8a highlights his growth in the field of cosmic rays until 1940, the year in which he stopped publishing for four years and the United States participated in Wold War II. This situation resulted in Vallarta’s successful productivity decay since he could no longer make use of the computer that was directed toward military projects. In Mexico, a developing country, there was no such equipment, it was impossible to buy or construct because the physics and engineering research was beginning. When coming back to Mexico, Vallarta had to seek a different way of doing research. It was not until 1944 that his next paper was published, and the last one in 1962.

Fig. 8
figure 8

Source: Developed by the authors with data from Mondragón and Barnés (1978)

Comparison between Vallarta’s Two Periods of Scientific Productivity According to his Citations. . (Circular layout network (Cytoscape 3.7.2 program). The network in Fig. 8a is linked to the network in Fig. 8b only through the article in Bush (1931) as an acknowledgement for having the differential analyzer at his disposal to perform calculations)

The separation between the first and second network (Figs. 7 and 8 respectively) demonstrates that the set of references Vallarta used in his research during the first years of his scientific productivity (without considering the books he cited) was different from the fields in each network. It is worth mentioning that we changed the layout of the network in Fig. 8 with respect to Fig. 7 because the references cited once by Vallarta can only be viewed in the external domain, while references cited multiple times in different works can be seen in the internal area. Later, Table 4 will compare the parameters obtained from the Cytoscape program (3.7.2).

Table 4 Representative parameters of the network (backward and forward)

Vallarta published almost two dozen papers on quantum mechanics and hardly cited his own work in almost one decade. On the other hand, he published almost three dozen papers on cosmic rays in almost two decades. Evidently, he dedicated more years to cosmic rays. It explains the higher values of network parameters in the field of cosmic rays (see Table 4). Figures 9 and 10 amplify a section of each of the networks in Fig. 8. It is clear that the citation density of the inner region of the cosmic rays network is higher than the citation density of the first field.

Fig. 9
figure 9

Source: Developed by the authors with data from Mondragón and Barnés (1978)

Amplified section from Quantum Mechanics and Relativity network. 1924–1932 and 1944–1945. . (Circular layout network (Cytoscape 3.7.2 program)

Fig. 10
figure 10

Source: Developed by the authors with data from Mondragón and Barnés (1978)

Amplified section from Cosmic Rays network 1933–1940 and 1947–1962. . (Circular layout network (Cytoscape 3.7.2 program))

Figure 11a represents the citation network both backwards and forwards. The amplification of a region is included to make an approach towards the network structure (Fig. 11b). The node in the network with the most connections is Vallarta's article with the most citations.

Fig. 11
figure 11

Source: Developed by the authors with data from Mondragón and Barnés (1978) and Scopus

The Backward and Forward Citation Network From Vallarta’s papers (1924–1962). . (Vallarta's papers are in red, backward citations in blue and forward citations in green)

The values for the backward, forward, and total network Cytoscape parameters (undirected) are shown below. The backward network values tend to be higher than the forward ones. This is because the network is larger in its original structure. Its clustering coefficient is non-zero. That means there is an intersection between Vallarta's works. For example, he self-quoted. The forward network, presents a disconnection between his works, because authors who cited him did not cite each other. The entire network (forward and backward) has different structure and the relationship is not linear. There are even more connections between Vallarta's papers to the ones in the outer region.

Three different network distribution types can be identified:

  1. i)

    The Poisson distribution model (or Erdös-Réngy model) characterized by the fact that its nodes are uniformly distributed;

  2. ii)

    Exponential distribution, produced in networks in which each new node added has the same probability of being linked than the others (equal link); and

  3. iii)

    Scale-free distribution. The distribution between nodes does not occur equally, which is why Barabasi (2014) referred to it as preferential attachment. The new nodes added to the network will preferentially connect to the already existing nodes through a larger number of connections. This corresponds to Vallarta's productivity behavior.

Figure 12 shows the distribution of nodes of a network (backward and forward citation together) made of 499 nodes and 727 edges. The degree of the node is plotted on the x-axis on a logarithmic scale and the frequency on the y-axis is also plotted on the logarithmic scale. The in-grade and out-grade of each node of the directed network were added. According to the linear regression analysis performed in the database, the results indicate that there is a scale-free distribution because the distribution can be approximated by a line with a slope between − 1 and − 2. To be exact, y = − 1.18x + 1.8. The highest frequency (that is, the ones that appear the most in the network) is the node of degree 1 and appears 367 times. Nodes of degree 2 appear 48 times. Nodes of degree 3, 16 times. Nodes of degree 4, 12 times, and so on. This behavior corresponds to the scale-free distribution of general knowledge-production. This is in accordance with a knowledge flow network (Valverde et al., 2007).

Fig. 12
figure 12

Source: Developed by the authors with data from Mondragón and Barnés (1978) and Scopus

Scale-Free Network Graph. The Backward and Forward Citation Network From Vallarta’s papers (1924–1962).

After Vallarta’s death, the scientific community looked back on some of his papers, the most relevant works in his field of knowledge. Therefore, some of them have been cited to this day. We consider that these are works that touched the frontier of knowledge and may have participated in the construction of a new paradigm.

5 Analysis of Vallarta’s Scientific Production Abstracts (1924–1962) Using Tag-Clouds

Data mining techniques are used to analyze information clusters based on a processing methodology (Han et al., 2012; Tan et al., 2005; Zaki & Meira, 2013). One of the most extensively used techniques is Knowledge Discovery in Databases (KDD). Fayyad, Piatetsky-Shapiro and Smyth (1996) KDD consists of five phases: (1) selecting a data set; (2) pre-processing; (3) transforming data; (4) applying data mining methods; and (5) interpreting and evaluating. The data’s qualitative properties enable a differentiation into: structured data, such as a traditional data table in which information has a predefined arrangement; semi-structured data, in which information has a certain structure and is often nested, having differentiated data types; and non-structured data, like texts, videos or images.

In this section, the non-structured data mining methodology was applied to the abstracts of Vallarta’s papers in the book by Mondragón and Barnés (1978). The abstracts became a guide to the concepts used since; in general, they briefly and concisely describe the research formulation and results. They became an important cluster for data mining and discovering new knowledge. The tag-clouds thus produced a visual representation of the list of frequently used words constructed with specific processing criteria.

KDD methodology was used to define this data mining procedure. Algorithms were implemented in order to manipulate data in PHP language and a PostgreSQL data management system was used for storing and retrieving information. This system has been tested in other contexts such as the analysis of tweets and journalistic texts (Piña et al. 2018). The methodology can be described as follows:

  1. 1.

    Selection. The initial set of data was defined. In this case, the abstracts of 58 papers. They were digitalized from the original source and stored in a database for processing.Footnote 32 The length of the abstracts is between 213 and 2446 words with an average length of 1048 words.

  2. 2.

    Pre-processing. The texts were subjected to a cleaning process, eliminating punctuation, apostrophes, mathematical formulas, Unicode graphic symbols, and also special and escape characters. All the letters were transformed into lower case and accents, umlauts, and any diacritical marks on the base letter were removed.

  3. 3.

    Transformation. A general word count was carried out, ordering the words in descending order of length. As a result, word-frequency pairs were obtained. In this type of analysis, it is commonly observed that the words that occur more frequently are “empty words,” i.e., words such as articles, prepositions, conjunctions, and relative pronouns that by themselves, have no meaning. Nouns were observed to be among the representative words, which are of importance within the context of scientific work.

  4. 4.

    Method Application. Based on the ordered list, words were discarded according to a dictionary of “discarded” words, mainly consisting of “empty words” and words that contributed minimum information within the research context. With the first remaining “n” words, a list was constructed in order to form a tag-cloud.

  5. 5.

    Evaluation. Once the tag-clouds were completed, the obtained information was interpreted.

Figure 13a shows the global result obtained from the tag-clouds, ordering the 100 most representative words alphabetically. Figure 13b and c shows the results for each field. A word’s size and color visually represent the frequency in which it appeared in the 58 abstracts; for the sake of information, the sub index represents the absolute value of their appearance.

Fig. 13
figure 13

Source: Developed by the authors with data from Mondragón and Barnés (1978)

Tag-Clouds of the Abstracts of Vallarta’s Scientific Production. a All the Period from 1924 to 1962. b Cosmic Rays. c Quantum Mechanics and Relativity.

This methodology applied within a specific historical context automatically evidences that Vallarta’s main interest focused on cosmic rays throughout the 38 years he devoted to research. With regard to author names, the tag-clouds show that in Vallarta’s initial stage as researcher he referred more frequently to the theories of renowned physicists in the field of relativity and quantum mechanics. However, his attention later shifted to developing his own cosmic radiation theory. Figure 14 presents Bigram Tag Cloud analysis.

Fig. 14
figure 14

Source: Developed by the authors with data from Mondragón and Barnés (1978)

Bigram Tag-Clouds of the Abstracts of Vallarta’s Scientific Production.

The Bigram shows two large (length and number) cores of conceptual content. The left corresponds to cosmic rays, and the right corresponds to quantum mechanics and the theory of relativity. Both are integrated by the content of the general theoretical concepts of physics. The pair of isolated concepts mainly corresponds to cosmic rays, as this was his longest-lasting research field. At the top of the chart, the three words "analyzer-differential-Bush" are displayed together. It explains how important the computer was to Vallarta's research. This was a device that he recognized in his abstracts, a machine unique in the world. This was a tool that prevented him from continuing his investigation when he returned to Mexico.

6 Conclusions

The mapping tools—bibliometric analysis, network theory (backward and forward citation), and tag-clouds—used in this paper to study Vallarta, demonstrate the number of citations of his work increased significantly over the last decades. Accounting quantum mechanics and relativity papers, as well as cosmic rays papers; our results showed that Vallarta completed his first field and opened a new one in a remarkable way.

Vallarta’s two most cited papers were published in 1932 and 1933, when he was a researcher-professor at MIT, (he was 33) eight years after having completed his PhD and only four years after coming back from studying in Germany. He immediately engaged in topics about the frontiers of physics; reaching out to other fellow scientists, attending international events of great relevance, and publishing papers for important outlets, soon achieving recognition for his contributions. Moshinsky states that Vallarta’s first publications about quantum mechanics appeared a few months after this field emerged (Moshinsky, 1987, 46) so it is worth mentioning his most cited paper corresponds to this subject of study and was published early during his productive activity (“The Relativistic Thomas–Fermi Atom,” 1932). Similarly, Vallarta’s second most cited paper corresponds to one of his leading papers on cosmic rays (“On Compton's Latitude Effect of Cosmic Radiation,” 1933). The third most cited one was published in 1936, “On the Allowed Cone of Cosmic Radiation”. All three of them came out in Physical Journal, one of the most reputed physics journals at the time.

Using network theory to analyze the first and second papers mentioned above, initially they were found to be outside the main network, thus representing a new insight outside the usual discussion. It can be interpreted that´s the reason why they continue to receive the largest number of citations. An interesting result of this work is that “The relativistic Thomas–Fermi atom,” was never cited by Vallarta but his three most currently cited articles on cosmic rays; were the most cited by himself in his own papers. These results showed his achievements breaking through the frontiers of physics.

The network structure confirmed Vallarta’s evident and radical academic turn in 1932 from quantum mechanics and relativity to cosmic rays (in which the specialized literature was entirely new). It showed no intersection between the two domains that literature surpassing the edge of physics knowledge, had done yet. One interesting fact we found is that Vallarta cited a lot of his own papers on cosmic rays, as he was a pioneer in the field, unlike the ones on quantum mechanics and relativity. Tag-cloud analysis indicated he was more interested in cosmic rays, having published a large number of papers in that domain. It also revealed that Einstein and Schrödinger were the scientists he mentioned most often, suggesting that; early in his career, he referenced theories developed by other physicists but soon he began formulating his own theory.

Bigram displays two large cores of conceptual content representing Vallarta’s fields of study. The isolated concepts correspond mainly to cosmic rays, his longest-lasting research field, the words "analyzer-differential-Bush" also appeared showing how important a computer for his research was (only a few worldwide, none in Mexico). Mexican economy made it impossible to purchase such expensive technological equipment. It could not be built because physics and engineering research were just beginning. The three tools also allowed us to visualize how his productivity decayed after 1940, and the years after he came back to Mexico to promote the development of science, technology, and education, as well as becoming a science diplomat and setting aside the scientific production.

Definitely, World War II had an adverse effect on Vallarta’s work, which was then at its peak. The computer equipment necessary for his calculations was redirected to military projects so he lost access to it. This situation might have persuaded him to reconsider returning to the field of relativity, since the first paper that came out when he resumed publication in 1944 focused on this theme and was a collaboration with Barajas, Birkhoff, and Graef. But they never referenced the most cited article today, Vallarta and Rosen (1932).

Finally, the results of the backward and forward network analysis revealed a scale-free distribution confirming that the way in which Vallarta constructed knowledge is parallel to the way knowledge has been constructed throughout history (in accordance with a knowledge flow network). Forward and backward citation presents a power law behavior. His articles featured in the best research journals of the time are the most cited. The interdisciplinary nature of the mapping studies of scientist's contributions using scientometric tools showed an interaction between economic, political and social elements. Many questions arose from these studies whose answers require the formation of interdisciplinary research teams. Their results can help us understand the process of knowledge production. We are confident that the methodology used in this paper will help mapping the contributions of other scientists.