1 Introduction

It has been known almost for two hundred years that the occurrence of sunspots is cyclic, although not strictly periodic. The length of the sunspot cycle (SC) has varied between 9.0 and 13.7 years. The shape of the sunspot cycle has also changed. Waldmeier noticed the asymmetry of sunspot cycles, with the ascending phase being typically shorter than the declining phase, and that there is an anti-correlation between cycle amplitude and the length of the ascending phase of the cycle (Waldmeier, 1935, 1939).

Gnevyshev (1967) suggested that the solar cycle is characterized by two periods of activity and these lead to a double peak with the so-called Gnevyshev-gap (GG) in between (Gnevyshev, 1977). Feminella and Storini (1997) studied the long-term behavior of several solar-activity parameters and found that maxima occur at least twice: first, near the end of the rising phase and then in the early years of declining phase. Norton and Gallagher (2010) analyzed the sunspot cycle double peak, and the Gnevyshev gap between them, to determine if the double peak is caused by averaging of the two hemispheres that are out of phase (Temmer et al., 2006). They, however, confirmed previous findings that the Gnevyshev gap is a phenomenon that occurs separately in each hemisphere and is not due to a superposition of sunspot indices from hemispheres slightly out of phase.

Most of the even-odd cycle comparisons have concentrated on the mutual strength of preceding cycles. These are referred as the so-called Gnevyshev–Ohl rule, which is an expression of the general 22-year variation of cycle amplitudes and intensities, according to which even cycles are, on average, about 10 – 15% lower than the following odd cycles (Mursula, Usoskin, and Kovaltsov, 2001). There have, however, been some exceptions to this rule, the last one happened between cycle pair SC22–23 (Javaraiah, 2012, 2016).

Another common subject has been the northern–southern asymmetry in solar sunspots and other activity (see some of the recent publications such as: Temmer et al., 2006; Carbonell et al., 2007; Li, Gao, and Zhan, 2009; Chang, 2011; Javaraiah, 2016; Badalyan and Obridko, 2017). Less attention has been paid to statistical distributions of the sunspots, sunspot groups, and areas. There have, however, been some studies about the spatial (latitudinal) distribution of sunspots and sunspot groups using slightly different statistical methods (Li et al., 2003; Ivanov, Miletskii, and Nagovitsyn, 2011; Chang, 2011; Munoz-Jaramillo et al., 2015; Leussu et al., 2016a,b; Mandal, Karak, and Banerjee, 2017).

Li et al. (2003) analyzed the latitudinal distribution of sunspot groups over a solar cycle. They found that although individual sunspot groups of a solar cycle emerge randomly at any middle and low latitude, the whole latitudinal distribution of sunspot groups of the cycle is not stochastic and, in fact, can be represented by a probability density function of the Gamma-distribution type having a maximum probability at about 15.5 degrees of heliographic latitude. Ivanov, Miletskii, and Nagovitsyn (2011) use in their analysis the relative latitude of sunspot groups, i.e. \(\phi _{rel}=\phi -\phi _{o}\), where \(\phi _{o}\) is the mean heliographic latitude and \(\phi \) is the actual latitude. They show that the latitude distributions of sunspots for a given year can be approximately described by the normal distribution, with its variance being a linear function of the current level of solar activity. Thus, the latitude size of the spotted zone increases with increasing activity. Chang (2011) studied the latitudinal distribution of sunspots between 1874 – 2009 by calculating the area-weighted sunspot mean value for each month (he calls it center-of-latitude, COL). He showed that the COL for both northern and southern hemisphere can be fitted with a double Gaussian distribution function.

Munoz-Jaramillo et al. (2015) use 11 different sunspot group, sunspot, and active region databases to characterize the area and flux distributions of photospheric magnetic structures. They plot the fraction of the areas or fluxes per unit area as a function of their area or flux (in millionths of hemisphere, MH), and fit different distributions, power-law, log-normal, exponential, and Weibull distributions to the these databases. They found that Weibull (six databases) and log-normal (four databases) are best-fit distributions, but none can explain both the small areas (weak fluxes) and large areas (strong fluxes). They conclude that a composite distribution with Weibull for small areas (weak fluxes) and log-normal for large areas (strong fluxes) with transition regions in between can explain best each of their databases. Leussu et al. (2016a) studied especially the latitude evolution and the timing of the sunspot groups in butterfly wings by characterizing three different categories: the latitude at which the first sunspot groups appear, the maximum latitude of sunspot-group occurrence in each wing, and the latitude at which the last sunspot group appears. They derived several statistical measures based on these variables. Mandal, Karak, and Banerjee (2017) found that the latitude distribution of sunspots does not statistically follow a Gaussian distribution. Furthermore, they found that the distribution parameters, the central latitude, width and height show some hemispheric asymmetries and significant positive correlations with the cycle strengths.

In this study we analyze the spatial distribution and evolution of sunspot areas of Solar Cycles (SCs) SC12 – 23 and sunspot groups of SC8–23 for even and odd cycles, and find the best-fit probability density functions (PDF) for these distributions. This article is organized as follows. Section 2 presents the data and methods used. In Section 3 we study distributions and PDF fits of sunspot groups for even and odd solar cycles. In Section 4 we discuss the latitudinal distribution of sunspot area sizes and migration of the sunspots over the solar cycle. We give our conclusions in Section 5.

2 Data and Methods

2.1 Sunspot Group Data

In the distribution analysis we use the data set of sunspot groups in southern and northern butterfly wings for cycles SC8–23 by Leussu et al. (Leussu et al., 2016b). This data includes the time and latitude for sunspot groups for cycles SC8–23 and is seen as a butterfly pattern in Figure 1. Note that the cycles SC21–23 have only an integer degree resolution in latitudes.

Figure 1
figure 1

Butterfly pattern of the sunspot groups used in the analysis. The vertical lines correspond to cycle maxima.

2.2 Sunspot Area Sizes

In the aerial sunspot analysis we use the database of the Royal Observatory, Greenwich - USAF/NOAA Sunspot Data for the years 1874 – 2016 (https://solarscience.msfc.nasa.gov/greenwch.shtml). This database contains, among others, time, latitude, and area size (in millionths of solar hemisphere, MH) for individual sunspots for cycles SC12–23.

2.3 Distributions

2.3.1 Rayleigh Distribution

The traditional definition of a normal distribution probability density function (PDF) is

$$ f\,(x\,;\,\mu ,\sigma ^{2}) = \frac{1}{\sqrt{2\pi \,\sigma ^{2}}} e^{ - \frac{\left ({x-\mu }\right )^{2}}{2\,\sigma ^{2}}} , $$
(1)

where \(\mu \) is the mean (or expectation) value and \(\sigma ^{2}\) is the variance. Let us consider a bivariate distribution (U, V), where U and V are normal, independent distributions with zero mean. Then we have

$$ f_{U}\,(u\,;\,\sigma ^{2}) = \frac{1}{\sqrt{2\pi \,\sigma ^{2}}} e^{ - \frac{u^{2}}{2\,\sigma ^{2}}} , f_{V}\,(v\,;\,\sigma ^{2}) = \frac{1}{\sqrt{2\pi \,\sigma ^{2}}} e^{ - \frac{v^{2}}{2\,\sigma ^{2}}}. $$
(2)

Let us define \(X = \sqrt{U^{2}+V^{2}}\). Because \(f_{U}\) and \(f_{V}\) are perpendicular, their joint bivariate normal distribution, X, has a cumulative distribution

$$ F_{X}\,(x;\sigma ^{2}) = \int \!\!\!\int _{D_{x}}\,f_{U}\,(u;\sigma ^{2})\,f_{V} \,(v;\sigma ^{2})\,dA , $$
(3)

where \(D_{x}\) is the disc \(D_{x}=\left \{ \sqrt{u^{2}+v^{2}}< x\right \} \). Integration is easier in polar coordinates, where \(dA=r\,dr\,d\theta \). Writing the double integral in polar coordinates we get

$$ F_{X}\,(x;\sigma ^{2}) = \frac{1}{2\pi \,\sigma ^{2}}\int ^{2\pi }_{0}\!\! \int ^{x}_{0}\,r e^{ - \frac{r^{2}}{2\,\sigma ^{2}}}\,dr\,d\theta = \frac{1}{\sigma ^{2}}\int ^{x}_{0}\,r e^{ - \frac{r^{2}}{2\,\sigma ^{2}}}\,dr . $$
(4)

While the probability function is the derivative of the cumulative function, we finally get

$$ f_{X}\,(x;\sigma ^{2}) = \frac{d}{dx}\,F_{X}\,(x;\sigma ^{2}) = \frac{x}{\sigma ^{2}}e^{-\frac{x^{2}}{2\,\sigma ^{2}}} , x\geq 0 , $$
(5)

which is the Rayleigh PDF.

2.3.2 Rician Distribution

The Rice distribution, or Rician distribution, is the probability distribution related to the Rayleigh distribution but with non-zero mean (Rice, 1945). The PDF of the Rician distribution is defined as

$$ f_{Z}\left (z\right )=\frac{z}{\sigma ^{2}}\exp \left ( \frac{-\left (z^{2}+\nu ^{2}\right )}{\sigma ^{2}}\right )I_{0}\left ( \frac{z\nu }{\sigma ^{2}}\right ) , r\geq 0 , $$
(6)

where \(I_{0}\) is the modified Bessel function of the first kind with zeroth order (Rice, 1945; Taluktar and Lawing, 1991). Note that for \(\nu =0\) we get the Rayleigh distribution. Note also that \(\nu \) and \(\sigma ^{2}\) are not the mean and variance of the Rician distribution, but are related to the bivariate Gaussian distribution \(x \sim \mathcal{N}(\nu \,cos\left (\theta \right ),\,\sigma ^{2})\) and \(y \sim \mathcal{N}(\nu \,sin\left (\theta \right ),\,\sigma ^{2})\) such that \(z=\sqrt{x^{2}+y^{2}}\). The mean and variance of the Rician distribution are defined by

$$ \mu = \sigma \sqrt{\pi /2}L_{1/2}\left (- \frac{\nu ^{2}}{2\sigma ^{2}}\right ) $$
(7)

and

$$ var = 2\sigma ^{2}+\nu ^{2}-\frac{\pi \sigma ^{2}}{2}\,L^{2}_{1/2} \left (-\frac{\nu ^{2}}{2\sigma ^{2}}\right ) , $$
(8)

respectively. Here \(L_{1/2}\left (\cdot \right )\) denotes a Laguerre polynomial, which can be presented as

$$ L_{1/2}\left (\xi \right ) = \exp ^{1/2}\left [\left (1-\xi \right )I_{0} \left (-\frac{\xi }{2}\right )-xI_{1}\left (-\frac{\xi }{2}\right ) \right ] . $$
(9)

2.4 Statistical Methods

2.4.1 Negative Log-Likelihood

The likelihood function \(L\left (\theta \right )\) is defined as

$$ L\left (\theta \right ) = \prod ^{n}_{i=1}f_{\theta }\left (x_{i} \right ) , $$
(10)

if variables \(x_{i}\) are independent and from the same distribution \(f_{\theta }\). The set of parameters \(\theta \) of the distribution, which maximizes \(L\left (\theta \right )\) is called a maximum likelihood estimator (MLE) and is denoted as \(\theta _{L}\). It is often easier to maximize the log-likelihood function, \(\log L\left (\theta \right )\), and since the (natural) logarithmic function is monotonically increasing, the same value maximizes both \(L\left (\theta \right )\) and \(\log L\left (\theta \right )\). Because the log-likelihoods here are always negative, we calculate instead the minimum value for negative log-likelihood (NLogL) (Forbes et al., 2011).

2.4.2 Two-Sample T-Test

The two-sample T-test for equal mean values is defined as follows. The null hypothesis assumes that the means of the samples are equal, i.e. \(\mu _{1}=\mu _{2}\). An alternative hypothesis is that \(\mu _{1}\neq \mu _{2}\). The test statistic is calculated as

$$ T = \frac{\mu _{1} - \mu _{2}}{\sqrt{{s^{2}_{1}}/N_{1} + {s^{2}_{2}}/N_{2}}} , $$
(11)

where \(N_{1}\) and \(N_{2}\) are the sample sizes, \(\mu _{1}\) and \(\mu _{1}\) are the sample means, and \(s^{2}_{1}\) and \(s^{2}_{2}\) are the sample variances. If the sample variances are assumed equal, the formula reduces to

$$ T = \frac{\mu _{1} - \mu _{2}}{s_{p}\sqrt{1/N_{1} + 1/N_{2}}} , $$
(12)

where

$$ s_{p}^{2} = \frac{(N_{1}-1){s^{2}_{1}} + (N_{2}-1){s^{2}_{2}}}{N_{1} + N_{2} - 2} . $$
(13)

The rejection limit for two-sided T-test is \(\left |T\right | > t_{1-\alpha /2,\nu }\), where \(\alpha \) denotes significance level and \(\nu \) degrees of freedom. The values of \(t_{1-\alpha /2,\nu }\) are published in T-distribution tables (Snedecor and Cochran, 1989; Krishnamoorthy, 2006; Derrick, Deirdre, and White, 2016). Now, if the value of \(\text{p}<\alpha =0.05\), the significance is at least 95%, and if \(\text{p}< \alpha =0.01\), the significance is at least 99%.

3 Latitudinal Distribution of Sunspot Groups for Even and Odd Cycles

Figures 2a and 2b show the latitudinal distribution histograms of northern and southern (absolute value) hemisphere sunspot groups with their Rician distribution fits for even and odd cycles SC8–23 in the range, respectively. It is clear that the distributions are not normal because of the cutoff at zero and positive skewness. The NLogL of the Rician distribution fits for even northern/southern wing groups are 40761/40770, while NLogL for, e.g., log-normal distribution fits are 42347/42417. Almost as good fits as Rician are Rayleigh (NLogL = 40766/40788) and Weibull (40764/40774) distribution fits. The NLogL of Rician distribution fits for odd northern/southern wing sunspots are 46803/44526, while NLogL for log-normal distribution fits are 48770/46374. The Rician distribution parameters for even northern/southern wing sunspot are \(\nu = 8.262\) (standard error, SE=0.581)/8.730 (SE=0.443) and \(\sigma = 10.338\) (SE=0.234)/9.944 (SE=0.196) and the Rician distribution parameters for odd northern/southern wing sunspots are \(\nu = 9.822\) (SE=0.334)/11.072 (SE=0.208) and \(\sigma = 10.162\) (SE=0.167)/9.43713 (SE=0.119). The mean and variance for even northern/southern sunspot groups are 14.94/14.76 and 58.62/56.08, respectively, and the mean and variance for odd northern/southern sunspot groups are 15.52/15.58 and 61.77/58.00, respectively. This shows that the sunspot groups of even cycle wings are on the average at somewhat lower latitudes than sunspot groups of the odd cycle wings, i.e. about 0.6 degrees for the northern hemisphere groups and 0.8 degrees for the southern hemisphere groups. Two-sample T-test gives significance for the difference of the means between even and odd cycles with \(\text{p}<10^{-9}\) and \(\text{p}<10^{-16}\) for north sunspot groups and south sunspot groups, respectively. This means that significance is much better than 99% in both cases.

Figure 2
figure 2

a) The latitudinal probability density of northern and southern (absolute value) hemisphere sunspot groups with their Rician distribution fits for even cycles in the range SC8–23. b) Same as a) but for odd cycles.

4 Distribution of Sunspot Areas

4.1 Latitudinal Distribution of Sunspot Areas

Figures 3a and 3b show the density of all sunspots (independent of the area) as histograms and the total areas of the sunspots at each (integer) latitude for even and odd cycles between SC12 – 23, respectively. The distribution and areal strength of the sunspots are quite similar but do not coincide in the low latitudes. It seems that there are smaller sunspots at low latitudes of the Sun for both odd and even cycles, and also for both hemispheres. Note that the histograms are somewhat narrower for even cycles than for odd cycles in both hemispheres. The averages latitudes are 15.02 and −14.79 for the northern and southern hemisphere for even cycles, respectively, and 15.51 and −15.55 for the northern and southern hemisphere for odd cycles, respectively. Again the average latitudes are about 0.5 and 0.8 degrees lower for even cycles than odd cycles. The average sizes of sunspots are 165.5 and 159.1 MH for even and odd cycles, respectively.

Figure 3
figure 3

a) The number of all sunspots of even cycles in the range SC12 – 22 as histograms and their total areas (left vertical axis) for both northern and southern solar hemisphere. b) Same as a) but for all sunspots of odd cycles in the range SC13 – 23.

Figures 4a and 4c show the total areas of large sunspots (\(\text{area} \geq 500\) MH), medium size sunspots (\(100 \leq \text{area}< 500\) MH), and small sunspots (\(\text{area} < 100\) MH) and number as histograms at each latitude in Figures 4b and 4d for even and odd cycles, respectively. It seems that the largest sunspots are located at latitudes between absolute values 10 – 25 degrees for both hemispheres and both even and odd cycles. There also seems to be some (quasi) periodicity in the locations of the large sunspots, especially for odd cycles. Note that for both hemispheres for even cycles and southern hemisphere for odd cycles there is a gap in the histogram of large sunspots around latitude ±15 degrees. The small sunspots are most abundant and spread from zero to 40 degrees but their total area is much smaller than the total areas of medium sized and large sunspots. The total areas of all sunspots are \(1.78\times 10^{7}\) and \(2.01 \times 10^{7}\) MH for even and odd cycles of the range SC12 – 23, respectively.

Figure 4
figure 4

a) Total areas of large sunspots (\(\text{area} \geq 500\) MH), medium sunspots (\(100 \leq \mbox{area}< 500\) MH), and small sunspots (\(\text{area}< 100\) MH) of even cycles in the range SC12 – 22 and b) the number of each size as histograms for both northern and southern solar hemisphere for even cycles. c) Same as a) but for sunspots of odd cycles in the range SC13 – 23 and d) same as c) but for odd cycles.

4.2 Temporal Distribution and Migration of Sunspots

Figures 5a and 5b show the density of very large sunspot distributions (areas equal or over 1000 MH) for even and odd cycles between SC12 – 23, respectively. The time-resolution of the figures is 0.1 year. In the same figures we show the average and median latitudes of all sunspots (right-axis) as a function of time for the whole period SC12 – 23 (absolute value is taken for southern hemisphere sunspots). The most conspicuous feature for the even cycles is the lack of large sunspots for four years (exactly between 4.2 – 4.5 years) after the start of the cycle. Two-sided T-test shows that this gap has significantly different mean value (at level 99%) with \(\text{p}=0.0022\) than the one-year period before and after the gap. Furthermore, the gap seems to be located at about the time when the evolution of the average sunspot latitude crosses about 15 degrees solar latitude. We believe that this is related to the Gnevyshev-gap (Gnevyshev, 1967, 1977; Storini et al., 2003; Ahluwalia and Kamide, 2004; Bazilevskaya, Makhmutov, and Sladkova, 2006; Norton and Gallagher, 2010; Du, 2015; Takalo and Mursula, 2018). The distribution of odd cycles has only a small decrease in the distribution at 4.2 – 4.3 years, but it is insignificant according to two-sided T-test analysis (\(\text{p}=0.40\)).

Figure 5
figure 5

a) The density of the large sunspot distribution for even cycles in the range SC12 – 22 as a function of decimal year (unit=0.1 year). The blue and red curves show the mean and median latitude of these sunspots as a function of time (the latitude axis on the right). b) Same as a) but for large sunspots of odd cycles in the range SC13 – 23.

5 Conclusions

We have analyzed the spatial and temporal distribution of sunspot groups for even and odd cycles in the range SC8–23. The Rician distribution is the best-fit function for both even and odd sunspot-group latitudinal occurrence. The mean and variance for northern/southern wing sunspots are 14.94/14.76 and 58.62/56.08 for even cycles, respectively, and 15.52/15.58 and 61.77/58.00 for odd cycles, respectively. This shows that the sunspot groups of even cycle wings are on the average at somewhat lower latitudes than sunspot groups of the odd cycle wings, i.e. about 0.6 degrees for northern hemisphere wings and 0.8 degrees for southern hemisphere wings. Furthermore, the distributions for even cycles are slightly narrower than for odd cycles. The skewnesses for all (absolute value) even/odd cycle distributions are 0.589/0.500 and the kurtoses 3.303/3.147, which verify the above-mentioned results.

The spatial analysis of sunspot areas between SC12 – 23 shows that the small sunspots are at lower latitudes of the Sun than the large sunspots for both odd and even cycles, and also for both hemispheres. The total area of the sunspots is more symmetric for the hemispheres of the odd cycles than for those of the even cycles. The large area sunspots (\(\geq 1000\) MH) occur mainly between 10 and 25 degrees at both northern and southern hemisphere and for both even and odd cycles. For even cycles there is a gap at 15 (−15) degrees northern (southern) hemisphere latitudes in the occurrence of large sunspots, but for odd cycles the amount of large sunspots abruptly decreases after 15 (−15) degrees northern (southern) latitude.

The temporal evolution of sunspot areas shows a lack of large sunspots for four years (exactly between 4.2 – 4.5 years), i.e., about 40% after the start of the cycle. This is related to the Gnevyshev gap and is consistent with the earlier result by Takalo and Mursula (2018). The significance level of this gap for even cycles is at least 95% for all sunspots and still higher (99%) for large sunspots. Furthermore, the average of the sunspot areas in this interval for even cycles is 152 MH, while in the surrounding interval (year before and year after) it is 188 MH. We have also shown that this gap is temporally located at the same time as the average latitude of the migration of the sunspots crosses 15 (−15) degrees of northern (southern) heliographic latitude. For odd cycles the gap is narrower (4.2 – 4.3 years) and it is insignificant according to two-sample T-test for the means.