1 Introduction

Are players motivated by others’ expectations? In the framework of standard economics, the answer depends on whether a player’s net benefit increases when the outcome meets others’ expectations. That is, a player is motivated if his or her benefits from meeting expectations exceed his or her cost. In other words, a player is not motivated if the benefit is the same regardless of others’ expectations. Meanwhile, in reality, a player is likely to increase his or her motivation to win when cheers of encouragement are heard from spectators. If so, that support could improve the performance of players. However, economic researchers have yet to examine this notion. Hence, this paper aims to explore whether encouragement from spectators improves player performance using novel data under the quasi-experimental setting of speedboat races.

Players of professional sports possibly become popular not only through their performance but also through their appearance. More physically attractive players are thought to be more popular than other players, even when performance is equal. Accordingly, good-looking players bear on their shoulders the expectations of others. The present study aimed to examine the effect of varying relative attractiveness on individual performance by considering the incentives to meet others’ expectations. Further, we compared the effects of men’s attractiveness with those of women’s. This work is the first to analyse bettors’ expectations based on the perceived effects of facial attractiveness on performance and the related gender difference. Japan speedboat races provide a setting for a natural experiment for comparing men’s behaviour with women’s under the same condition (Booth and Yamamura 2018). We intended to analyse unique performance data of speedboat races, in which men and women racers compete in a highly competitive environment. Our data were in panel form; we gathered information on each racer’s place in races and recorded time across all races in which they competed. We analysed a total of over 35,000 women race observations and over 400,000 men race observations during the period 2015–2017. Professional speedboat racers in Japan number approximately 1600, including both women and men. Each race had different competitors. Using the racers’ photographs that were officially open to the public, we measured the attractiveness and the order of such attractiveness among six racers participating in a race. In this way, this study determined the relative attractiveness effect, which varies according to the situation. Then, we analysed how the relative attractiveness influences the racers’ performance and compared the case between men and women racers. In the boat race, the rewards depend on the grade of the race and one’s place in the race, but do not differ according to popularity among fans. Thus, we could investigate how individual racers have an incentive to meet the expectations of others, dictated by racers’ relative physical attractiveness, even when economic return and racers’ skills are controlled.

Our empirical strategy was as follows: we assumed that racers’ physical attractiveness can influence popularity among bettors’, but not the racers’ performance in the race. Hence, attractiveness was used as an excluded instrumental variable (IV) to control for the endogeneity bias of popularity in the estimation of the effect of racers’ popularity on performance. The strategy revealed the following. (1) In the first stage of IV estimation, racers who were more attractive than other competitors tended to be more popular, as reflected in betting odds, even after controlling for various variables that captured racer’s ability. (2) In the second stage of IV estimation, the more popular racers recorded better race times. (3) More popular men finished with a higher place in races, even though popularity had nothing to do with their gains. Meanwhile, such tendency was not observed for women.

The third finding above is of particular interest. Men have a great incentive to improve their performance in response to expectation from fans even though the return is constant, whereas women do not have such an incentive.

The remainder of the paper is constructed as follows. Section 2 briefly reviews related literature and describes the setting of speedboat racing in Japan. Section 3 presents the construction of data. Section 4 provides the analytical framework of our hypotheses and explains the estimation model. Section 5 gives the estimation results and our interpretation of the major findings. Section 6 provides our conclusions and implications for future research.

2 Related literature and the setting

2.1 Related literature

The expected return, such as earnings, from performance leads individuals to make an effort to improve their skill, accumulating human capital. Other factors influence earnings. The seminal work of Hamermesh and Biddle (1994) provided evidence that the degree of an individual’s attractiveness is positively related to their earnings. Various other researchers have subsequently analysed the influence of beauty in the labour market (Doorley and Sierminska 2015; Fletcher 2009; Parrett 2015; Price 2008).Footnote 1 Physical attractiveness derives sizable rents, considered as a beauty premium reflected in the level of earnings in the labour market (Hamermesh and Biddle 1994). This effect of beauty can be attributed to two reasons (Biddle and Hamermesh 1998). The first is that employers simply prefer to employ attractive-looking applicants for s job.Footnote 2 The second is the production-enhancing effect, which states that physical attractiveness directly increases productivity. For instance, attractive-looking attorneys more easily obtain social skills, which are effective for convincing judges and juries. Physically attractive workers possibly have better communication skills, which increase their wage through interaction with employers (Mobius and Rosenblat 2006). In the case of professional sports, the more physically attractive players who play well are more likely to have opportunities for additional earnings through side jobs. That is, beauty premiums given to attractive individuals depend on their work performance. Consequently, more attractive players make greater effort to improve their performance, which is called the effort-enhancing effect of beauty (Ahn and Lee 2014).

The case of professional sports players merits consideration because it enables scrutiny of the effect of players’ physical attractiveness after controlling for their performance. More attractive-looking American football players are paid greater salaries, and this premium persists even when controlling for player performance (Berri et al. 2011). In team sports, individual attractiveness influences team performance, a phenomenon reported in the office setting: a worker’s attractiveness is related to the firm’s performance (Pfann et al. 2000). Meanwhile, an individual’s beauty effects can be considered to depend on the team’s, and cannot be analysed individually. In individual sports, communication skills are proven ineffective at improving performance. Ahn and Lee (2014) pointed out that players with better looks can gain wider publicity, which in turn can lead to opportunities for additional earnings. They hypothesized that this effect of attractive looks increases as these players improve their performance; attractive players thus have strong incentive to improve their performance. They showed that the performance of more attractive women golfers tended to be better than those who scored lower in attractiveness. In other words, the physical attractiveness of women players boosts their popularity, which directly increases the incentive for better performance. They further argued that beauty premiums may not be entirely rents. However, their argument cannot be generalized because no study has examined whether the mechanism holds for male players.

A number of studies have shown the differences in behaviour between men and women under the same situation.Gneezy et al. (2003) provided evidence that women are less likely to show their real ability in more competitive environments compared with men, even if their performance is similar in non-competitive environments.Footnote 3 Using data on tennis players, Paserman (2010) found the clear difference in performance between men and women tennis players under great competitive pressure in a setting with large monetary rewards. Compared with women players, attractive men players are more able to improve their performance when additional earnings from a side work are large. Similarly, in speedboat races, men racers are more likely to adopt aggressive strategy compared with women, leading to improvement in men’s performance (Booth and Yamamura 2018).

Studies have analysed the effects of absolute physical attractiveness. However, the influence of attractiveness depends on the situation. The most beautiful girl in a high school that becomes a fashion model or an actress becomes average in terms of looks in her new world. That is, the degree of beauty is determined relatively, rather than absolutely. Physical attractiveness leads people to obtain better social skills. It has also been shown to foster workers’ confidence, leading to increased wage levels (Mobius and Rosenblat 2006). Social skills and confidence are formed during a long period of time, and so do not vary depending on the situation. Therefore, these effects are fixed in the short period. An individual’s relative attractiveness and its effects frequently change if his/her objects of comparison change. Research has deconstructed the effect into the fixed effect and the varying one.

2.2 Overview of speedboat racing in Japan

Himura (2015) is our principal source of information in this section. In speedboat racing in Japan, men and women racers receive exactly the same intensive training for 1 year in only one training school, the Yamato Boat School. Both men and women train under the same condition and must pass the final examination to be qualified as a professional speedboat racer. After becoming racers, they compete in races under the same rules and conditions.

At present, Japan has 1600 racers, composed of 1400 men and 200 women. There are 24 speedboat racing stadia mainly located in the western part of Japan. Generally, boat races are randomly held about 4 days per week in each stadium; 1 day sees 12 races. Six racers compete in a race and are generally composed of same-gender racers. Racers participate in races in various stadia. The racing circuit is regulated to be 600 m in length. A race involves three laps; the total race distance is 1,800 m. Races are classified into five grades. The highest grade is Super Grade (SG), followed by Grade I (GI), Grade II (GII), Grade III (GIII), and ‘Usual’, which is the lowest grade. Racers can receive rewards according to race finish order, from the winner (first place) to the bottom (sixth place). The reward for the first-place winner of SG races is around USD 300,000. The rewards for the first-place winner in the subsequent race grades are as follows: USD 100,000 (GI), USD 40,000 (GII), USD 10,000 (GIII), and under USD 10,000 (Usual). The rewards in SG races for the other winners are as follows: USD 150,000 (second place), USD 50,000 (third place), USD 20,000 (fourth place), USD 10,000 (fifth place), and under USD 10,000 (sixth place). These rewards decrease as the grades of racers decline. That is, there are wide variations in the reward offered according to race grade and place in the race.

In addition to race grades, racers are also classified into four grades, namely, A1 (top grade), A2, B1, and B2 (bottom grade). Racers’ grades are determined by their past performances: a higher winning ratio leads to a higher grade. The Japanese Motorboat Association determines race participants. In most SG and GI grade races, A1 racers are selected to participate.Footnote 4 Inevitably, there is a wide gap in total rewards between A1 racers and other grade racers during a season. In addition, A1 racers are likely to obtain opportunities for additional earnings, such as appearance in TV commercials, even though speedboat racing is not so popular a sport in Japan compared with other professional sports like football or baseball. That is, A1 racers possibly enjoy the ‘super star’ effect (Rosen 1981). However, apart from SG and GI races, racers of various grades, from A1 to B2, are evenly and randomly assigned to participate in Usual races. Consequently, in Usual races, A1 racers compete with lower-grade racers.

The probability of winning a race varies according to the competitors in the race. That is, the winning probability of an A2 racer is low in a race where the other five competitors are A1 racers. Meanwhile, the winning probability of an A2 racer is higher when the competitors are A2, B1, and B2 racers. The competitors in a race change in each race, and, as such, the relative position of a racer’s characteristics, such as his/her grade, is different in each race. The same holds true for the degree of a racer’s physical attractiveness. A racer who is the most attractive among competitors in one race may be the least attractive in another set of competitors in a different race. Therefore, physical attractiveness changes according to races even though a racer’s appearance is fixed. Fans of a specific racer may always bet on the racer regardless of the race, but apart from this case, popularity, which can be captured by betting odds, depends on the competing racers and thus varies across races. Popularity is thought to depend on fans’ discrimination based on racers’ physical attractiveness. Even if the fixed effect of attractiveness is removed by fixed effect estimation, the relative effect of attractiveness persists because of the relative position of racers’ attractiveness in each race.

The characteristics of the boat motors, such as maker and type, are the same in all races. However, a disparity in motor performance has become apparent, attributed to differences in maintenance and deterioration through races. Such differences in motor performance have a significant influence on racers’ outcomes in a race. Motor units are randomly assigned to racers by drawing of lots in every race. Meanwhile, a racer’s lane on the track is also considered a key factor affecting the probability of the racer’s winning.Footnote 5 Generally, racing in an inner lane overwhelmingly confers an advantage. Hence, a racer’s performance is determined by not only the racer’s ability but other conditions as well. Therefore, low-skilled racers assigned with a better motor unit and the inner lane have a greater chance to win a race, even when competing with higher-skilled and higher-grade racers in a race. Inevitably, the disparity in racers’ performance is lessened.

Before the races begin, when the window of time for betting has expired, photos of the faces of racers alongside their recent records are displayed on the electric bulletin board. Hence, bettors can make a decision to place a bet by looking at racers’ faces in addition to their statistics. Racers can know the betting odds directly before they start. So, the betting odds possibly influence racer’s performance.

3 Description of data

3.1 Race level data

The study used individual records for theperiod April 2014 to October 2016. The data were collected from the online database officially provided by the Japan Speed-boat Association.Footnote 6 We obtained racers’ performance measured by racers’ place in races and race time. We sought detailed information on the characteristics of the races, such as stadia location, natural conditions (weather, wind speed, wave), racers’ grades, lane assignments, and performance record of motor units assigned to racers.Footnote 7 The database does not include precise information on racers’ weight (capturing the condition of racers on the day of the race) and the grade of races. To obtain these data, we used another source, ‘Boat Advisor’, a database provided by a private firm.Footnote 8

The data were subsequently integrated into an original panel dataset for racers and the races in which they participated. We used racers’ place in races and time record as measures for racers’ performance. A total of 24 stadia were included in the study, which cover a wide variety of locations and statuses. For all races held in these 24 boat race stadia, all of the records of racers’ places in races are available. However, in 18 stadia, racers’ time records are provided only for racers placed in the first, second, third, and fourth places.Footnote 9 Hence, the sample size used for analysing the racers’ time is smaller than that for analysing the place in the race. We restricted the performance data of racers to the active list in the first half season of 2017 (from May to November in 2017). We evaluated their photos in May 2017, which were made open to the public in the official website of the Japan Speed-boat Association, for measuring the racers’ attractiveness. In a season, 1,300 men and 180 women racers are featured on the website.

During the study period, the average number of races in which each of these racers participated was 350 for men and 200 for women. The sample for estimation comprised all those races with complete information on racers’ records. Therefore, in total, the sample size reached 430,000 and 36,000 person-race observations for men and women racers, respectively. One advantage of this study is that the sample size is far larger than the other datasets used to examine the effect of physical attractiveness (Ahn and Lee 2014; Biddle and Hamermesh 1998; Hamermesh and Biddle 1994; Hamermesh and Parke 2005). Our sample size is also far larger than that in existing works that analysed gender differences in work performance (e.g. Dreber et al. 2011; 2014; Paserman 2010).

Table 1 gives the definition of the key variables used for estimations. In most of the variables, the mean values were similar between men and women. The fixed characteristics were controlled in the estimation. Hence, the key variables were the order of variables among six same-gender competitors in a race. That is, the variables associated with order captured the relative position among competitors. Racers’ grades were captured by the fixed effects in the estimation because these hardly changed during the studied period.

Table 1 Mean values and definitions of key variables used for estimation

Figure 1 illustrates the distribution of men and women racers’ grades. The percentages of women racers classified as A1 and A2 were remarkably lower compared with men racers. That is, as a whole, the racer grade for women tended to be lower than that of men. Racers in the highest grade, A1, are more likely to gain opportunities for additional earnings because they are possibly greatly supported and preferred by fans. That is, they possibly enjoy the super star effect (Rosen 1981) and so have a greater incentive to improve their performance in races.

Fig. 1
figure 1

Distribution of racers’ grades

3.2 Measurement of physical attractiveness

In the evaluation of attractiveness, we referred to existing methods (e.g. Ahn and Lee 2014; Biddle and Hamermesh 1998; Dreber et al. 2013; Mobius and Rosenblat 2006). On May 2017, we measured the attractiveness of racers from their frontal head-and-shoulder photographs wearing the same uniform, which are open to the public on the official website of the Japan Speed-boat Association.Footnote 10 From the site, we gathered the photos of 1296 men racers and 180 women racers. We recruited 55 men and 63 women students from Seinan Gakuin University to evaluate the attractiveness of racers by looking at the photos. To collect pure physical attractiveness scores not influenced by player performance, fame, and popularity, we selected evaluators who were not at all familiar with speedboat racing. They were presented with the photos in random order through the originally constructed evaluation system on a computer. The evaluators were asked to rate the physical attractiveness of each racer on a ten-point scale, with 1 indicating ‘Not at all attractive’ and 10, ‘Very attractive’. This rating scale was almost the same as that in Dreber et al. (2013). A total of 118 evaluators rated 1496 photos racers; each photo was rated 118 times.Footnote 11 Different evaluators had different standards for average attractiveness. To remove the influence of individually different standards, we used standardized scores. The standardized attractiveness score for a racer is calculated as follows;

$$ {\text{Standardized}} {\text{score}}_{mi} = \frac{{{\text{Score}}_{mi} - \overline{{{\text{Score}}_{mn} }} }}{{S_{mn} }}, $$

where m is the evaluator, i the racer, and n is the total number of racers (1496). Smis the standard deviation of evaluator m’s scores for the 1496 racers. \({\text{Score}}_{mn}\) is the mean value of evaluator m’s scores for the 1496 racers. That is,

$$ {\text{Score}}_{m} = \frac{{\mathop \sum \nolimits_{i}^{n} {\text{Score}}_{mi} }}{n} $$

Hence, \({\text{Standardized}} {\text{score}}_{mi}\) has a negative value if racer i’s attractiveness score evaluated by m is lower than the mean value of the score based on 1496 racers.

Hence, we gathered 118 standardized attractiveness scores for each racer, because there were 118 evaluators, and calculated the scores for the 1496 racers. The mean value of the standardized attractiveness scores was used as racer i’s attractiveness score. The correlation between the men and women students’ average scores was 0.75, indicating high correlation between different groups. Thus, the attractiveness standards were considered similar within the sample.

A racer’s attractiveness was then compared with that of the competitors’ in a race. Even when the standardized attractiveness scores of “Racer A” were not so high, “Racer A” had the most attractive appearance in a race if the five other racers’ attractiveness scores were lower. A racer’s attractiveness thus depends on the competitors’ attractiveness in a race. Therefore, we focused on the relative attractiveness in a race rather than absolute attractiveness. As shown in Table 1, for a racer’s Attractiveness, we used a place ranking of the attractiveness scores of the 6 racers competing in a given race, which ranges from 1 (least attractive) to 6 (most attractive).Footnote 12

4 Analytical framework

4.1 Hypothesis

Figure 2 illustrates the analytical framework of this study, including the three key components: racers’ attractiveness, racers’ popularity in the race, and racers’ performance in the race. The study aimed to examine how racers’ attractiveness influences race performance by considering the channel through which attractiveness affects racers’ popularity in races. The inverse of betting odds was used to measure a racer’s popularity relative to that of the five competitors in a race. Individuals bet money on a racer because they expect the racer to win the race and then earn benefits.Footnote 13 Apart from the expectation of winning, Forrest and Simmons (2008) suggested that bettors tend to have a wager on their home team. Bettors also tend to lay a wager on attractive racers. We can explain the reason why bettors prefer the relatively better looking racers despite having the same ability. One possibility is that bettors believe that there may be a correlation between absolute good looks and unobservable productivity. Bettors may be more likely to vote for a racer who is relatively good looking because they know the correlation. As long as bettors have an incentive to win the bet, they will want to put it on the racer who has the best chance of winning. Another possibility is that consumers tend to prefer certain attributes, such as consumer discrimination. Bettors may derive utility from the very act of rooting for a beautiful racer.

Fig. 2
figure 2

Analytical framework

As shown in Fig. 2, we tested whether the racers’ relative attractiveness in a race determines their popularity. Furthermore, we assumed that attractiveness is not correlated with race performance. Some researchers might not agree with our assumption because it is inconsistent with the effort-enhancing effect of beauty (Ahn and Lee 2014). However, in the setting of this study, a racer’s inherent attractiveness does not change and so does not depend on competitors. After controlling for this fixed effect of attractiveness, as explained in the previous section, we focused on attractiveness relative to competitors in a race. As such, a racer’s attractiveness varied in each race. The effort-enhancing effect of beauty is thought to be derived from a person’s inherent attractiveness, which is controlled by a fixed effect in this study. Moreover, differing from major sports like football, baseball, and tennis, Japan boat racing is a minor sport, and thus, racers’ side job earnings are very small. Nonetheless, ‘super stars’ in this field may be able to earn a large amount of money. To remove the effort-enhancing channel and super star effects, we conducted estimations by limiting the sample: we excluded A1 racers.

Popularity and performance may also have reverse causality. A racer’s popularity in a race is determined before the race, whereas the racer’s performance is the outcome of the race. Inevitably, there is a time lag between popularity and performance. Popularity in races is reflected in the inverted value of betting odds. Popularity depends on a racer’s performance in past races, which leads to bettors’ expectations. Using a rich dataset, we controlled for various factors, such as innate racers’ talent, racers’ absolute level of skill, and relative skill among competitors in the fixed-effects model, along with other information on race conditions. Specific unofficial information may relate to racers’ popularity and performance. For instance, bettors obtain race information and prediction through boat racing magazines. Consequently, endogeneity bias occurs. To control for the bias, IV estimation should be used.

In the estimation model, we controlled for endogeneity bias and various factors related to racers’ ability and performance. Hence, apart from the bettors’ expectation on winning probability, we analysed how popularity influences racers’ performance. Racers knowing their own betting ratio, thereby their popularity relative to competitors, may affect motivation. For instance, racers might have an incentive to meet fans’ expectations, which could improve their performance. Racers might also unconsciously feel pressure or tensed, which could lower their performance.

The effect of popularity on racers’ performance might not be different among racers if such popularity reflects only the subjective assessment of bettors on racers winning, which is based on bettors’ information. That is, the effect of popularity might be different according to racers’ character: a racer’s reaction to popularity depends on his/her character. For instance, under the highly competitive pressure in professional sports, women tend to display less aggressive behaviour (Booth and Yamamura 2018) and make mistakes (Paserman 2010). If so, women are less able to improve their performance when they are more popular in a race. Therefore, we opted to split the data of person-race observations into men and women races to compare the estimation results between men and women racers.

Based on the argument above, we propose three hypotheses as follows:

Hypothesis 1: racers’ attractiveness makes them popular regardless of their ability.

Hypothesis 2: racers have an incentive to improve their performance to meet the expectation of bettors when they are more popular in races.

Hypothesis 3: the effect of popularity on male racers’ performance is larger than that on female racers’ performance.

If these hypotheses hold, there may be causality expressed in the black solid arrows in Fig. 1. The effect of attractiveness investigated in this study may be called the expectation-enhancing effect. This effect is distinct from the productivity-enhancing effect of attractiveness provided by Biddle and Hamermesh (1998) because there is no need for social and communication skills to improve racers’ performance. Moreover, prize winnings for racers do not depend on the betting odds; only the benefit for bettors greatly depends on the odds. Hence, this study does not examine the effort-enhancing effect of attractiveness (Ahn and Lee 2014).

Racers’ confidence derived from their attractiveness possibly influences their performance (Mobius and Rosenblat 2006). However, confidence is thought to be formed through various experiences in the long term and can be accumulated stock. Therefore, we assumed that confidence does not change across races. Although confidence relates to the absolute value of attractiveness, which is captured by fixed effects in this study, we focused on the ranking of racers’ attractiveness in each race, which, we believe, is independent of confidence.

4.2 Empirical model

For testing the hypotheses, we used the following model, and estimated it with the IV method taking Attractiveness as the excluded IV.

$$ Y_{ik} = \, \alpha_{1} {\text{Popular}}_{ik} + \, X_{it} \prime B \, + \, Y_{it} \prime C \, + \, e_{i} + \, m_{k} + \, u_{ik} $$
(1)

Before proceeding to IV estimation, let us begin with the explanation of the estimation of Eq. (1) regarding Popularity as an exogenous variable. It is necessary to control for racers’ individual time-invariant characteristics, such as inherent ability and physical attractiveness. To this end, we employed the fixed-effects model in the baseline model. The dependent variable Y denotes performance of individual i on race k. Place in race and race time are both relevant information for capturing a racer’s performance. Place matters more to the racers and bettors because it translates directly into winnings. Racers want to travel faster than competitors to finish at a higher place. Hence, although there are two different measure proxies, the principal performance covariate is place in races, ranging from 1 (sixth place; top) to 6 (first place; bottom) in races in which six racers compete. In the estimation, the inverted log of race time, i.e. 1/log (race time), is used to imply that larger values show a higher performance.Footnote 14Popularity is the order of betting odds among six racers: 1 (least popular; largest odds) to 6 (most popular; smallest odds).Footnote 15α1, the coefficient of Popularity, is expected to have the positive sign because more popular racers are more likely to rank higher (better performance).

As found by Yamane and Hayashi (2015), the performance of adjacent competitors positively influences swimmers’ performances. Therefore, it is crucial to control for order in skill, which is regarded as a racer’s relative skill to win among competitors in a race. The row vector Xit captures their controls, such as relative racer’s skill and weight, relative motor unit performance, and dummy for lane. Unobservable individual time-invariant characteristics ei are controlled for through fixed effects estimation. The outcome of boat racers is greatly influenced by natural conditions because the races are on water and in an open air environment. mk refers to race conditions, such as wave, wind speed, and weather. To compare results between men and women, we opted to split the data of person-race observations into men and women races.Footnote 16

In the IV estimation of Eq. (1), we conducted closer examinations of the fixed-effects IV model for exploring causality. As explained in Sect. 3 and illustrated in Fig. 1, the endogeneity bias of Popularityik was controlled by the excluded IV Attractiveness. From Hypothesis 1 proposed in Sect. 3, in the first stage of the model, the coefficient of Attractiveness is expected to have the positive sign because more attractive racers are more popular. For testing Hypotheses 2 and 3, in the second stage, the coefficient of Popularity suggests whether the effect of popularity relative to that of competitors’ influences a racer’s performance. From Hypothesis 2, the coefficient of popularity is expected to have the positive sign. Hypothesis 3 leads to the prediction that popularity’s absolute value for male racers is larger than that for female racers.

5 Estimation results

5.1 Baseline fixed-effects model

Table 2 reports the results of the fixed-effects estimation of the baseline model where the dependent variable is place in race (from first to sixth). Various control variables were included in the estimations, although their results are not reported in the table. Results based on the men racers sample are in columns (1) and (2), whereas those on women are in columns (3) and (4).

Table 2 Baseline estimation (fixed effects): ‘Place in race’ as dependent variable. Estimation results of Eq. (1)

Compared with lower-grade racers, A1 racers are far more well known and popular, and have greater opportunity to earn additional earnings, for instance, by appearing in TV commercials. Therefore, the effort-enhancing effect may occur (Ahn and Lee 2014). A kind of ‘super star effect’ is considered to be in the error terms and correlated with both racers’ performance and popularity, causing endogeneity bias. To remove the effort-enhancing and super star effects, we limited the sample where we excluded A1 racers. Columns (1) and (3) in Table 2 are on the sample of A1, A2, B1, and B2 racers, whereas columns (2) and (4) are on sample of A2, B1, and B2 racers.

As for Table 2, Popularity shows the positive sign and statistical significance at the 1% level in all columns. This outcome is in line with our inference. The absolute value of coefficients was almost around 0.30 in all estimations. This suggested that a rise in popularity by rank among the six racers resulted in the rise of place in a race by 0.30. For example, a racer’s place in a race rises approximately by one place if the popularity rank improves from 6th to 3rd. Among the independent variables, Order of racers’ grades and Order of racers’ winning rates captured a relative racer’s skill using the racer’s and competitors’ race performance records prior to a race. A larger value of these variables indicated higher skills. In addition, Order of motor units’ performance captures relative quality of motor assigned to the racer. The larger the value of the Order of motor units’ performance, the better the quality of the motor unit. These variables showed the positive sign and were statistically significant at the 1% level. Order of racers’ weight captured racers’ relative physical condition. The larger the value of this variable, the heavier the racer is. Its coefficient exhibited the significant negative sign, implying that the heavier racer has a disadvantage in the race.

Motor unit performance and racers’ weight vary per race, whereas a racer’s inherent attractiveness and grade are fixed and captured by the fixed effects. Hence, the effects of Motor units’ performance and Racers’ weight could be estimated; they showed the positive and negative sign, respectively. They were almost statistically significant, consistent with their relative effects. Lane dummies (Lane_2 to Lane_6; Lane_1 is reference) showed the negative sign and statistical significance. This outcome is consistent with the argument that the racer in the inner lane has great advantage. Dummies for natural conditions, such as Wave, Wind speed, Fine weather, and Rain, were less likely to influence place in the race. All racers in a race are in the same natural conditions and so their relative performance (place in the race) is not influenced by them.

Table 3 presents the estimation results where the alternative racer performance variable, ‘inverted log of recorded race time’, is the dependent variable. Popularity showed the positive sign and statistical significance throughout. Other control variables showed similar results to those in Table 2, indicating the robustness of the estimation results. The exception was the variable of natural conditions, which reasonably showed statistical significance to the race time: the Wave, Wind-speed, and Rain dummies showed the significant negative sign, whereas the Fine weather dummy showed the significant positive one. That is, bad natural conditions deteriorate the time record, whereas good natural conditions (fine day) improve it.

Table 3 Baseline estimation (fixed effects) with ‘inverted log of recorded race times in races’ as dependent variable. Estimation results of Eq. (1)

5.2 Fixed-effects IV model

Table 4 presents the results of the fixed-effects IV model, where place in races is the dependent variable. Table 4 reports the IV estimation results of Eq. (1), including only the estimates on Popularity in the second stage Eq. (1); all the other control variables were included in the estimation. For the first-stage estimation, the results of Attractiveness (the excluded IV) and other control variables are reported.

Table 4 Estimations (fixed-effects IV model), ‘Place in race’ as dependent variable

In the first stage, it is essential to check the validity of excluded instruments, based on various tests for weak, under, and over-identification. However, inTable 4, the excluded IV is only Attractiveness, and so the model is just identified. Accordingly, we reported result of only a weak identification test.Footnote 17 The null hypothesis being tested is that the estimator is weakly identified in the sense that it is subject to bias that the investigator finds unacceptably large (Baum et al. 2007, p. 489). In columns (1) and (2), the F-statistic is larger than 16.4, corresponding to Stock–Yogo critical values at the 10% level, and thus we reject the hypothesis of weak instruments. With the exception of column (3) and (4), the results of the tests rejected the null hypotheses that the equation is weakly identified. In alternative specifications exhibited in Table 5, instead of Attractiveness, dummies to capture racers’ rank of Attractiveness among racers were used as exogenous instruments. Table 5 shows the results for weak, under-, and over-identification in male samples to indicate instrument validity. However, for female samples, the results did not lead to rejection of the null hypothesis of weak instruments. These results suggest the validity of the instruments for male but not female samples. The coefficient of Attractiveness yielded the positive sign and statistical significance in all columns. In the male sample, dummies for attractiveness significantly showed the positive sign and the coefficient’s values were larger in the higher-ranked racers; in the female sample, the dummies did not show robust results. Nonetheless, these observations support Hypothesis 1. In our interpretation, attractiveness from appearance differed between males and females. When evaluating attractiveness, people emphasized tough-looking faces for males but sweet and pretty faces for females. We should consider the role of attractiveness in this situation. In a situation of competition such as the boat race, racers compete to win. In this setting, a racer’s tough-looking face plays a greater role in increasing fans’ expectation of the racer winning if other demographic characteristics are equal. Put differently, a sweet and pretty face is unlikely to increase expectations.

Table 5 Estimations (fixed-effects IV model), ‘place in races’ as dependent variable. Exogenous IVs are dummies of attractiveness

In the second stage, the sign of Popularity was positive and statistically significant at the 1% level in columns 1 and 2 for the men’s sample. Popularity showed statistical insignificance in column 3 and for the women’s sample. In our interpretation, even after controlling for the fixed effects and endogeneity of Popularity, men racers improved their performance to meet the expectations of bettors. Meanwhile, the expectation does not lead women to improve their performance under the assumption that women cannot exhibit their usual performance under high pressure. This result is in line with the findings of Paserman (2010). That is, for women, positive Popularity effect seems to be neutralized by the detrimental effect of social pressure for women racers. Thus, Hypotheses 2 and 3 were supported.

From the sample of all men racers, the absolute value of the coefficient ofPopularity (0.43) was 1.5 times larger than 0.29 in Table 2, indicating reverse causality from Performance (result of the race) to Popularity (betting odds before the race). It is reasonable to think that a higher Performance causes higher Popularity. Therefore, the endogeneity is considered to cause the over-estimation bias in the fixed-effects model; controlling the bias could reduce the value of the coefficient of Popularity. As shown in Table 4, the fixed effect IV model showed larger coefficient’s values compared with the baseline model of Table 2. An explanation is that some other mechanism might be at work between these two variables.Footnote 18 We believe the following effect is important: individuals have the motivation to bet on a racer whose probability of winning is expected to be low because bettors aim at the jackpot (Griffith 1949). Hence, those whose performances were relatively low might be subjects of larger wagers.

As for the specification where the inverted log of race time is the dependent variable,Table 6 shows the results of key variables. All the control variables are equivalent to those in, Table 4. Overall, the results are similar to those in Table 4. Ariely et al. (2009) provided evidence on the same by conducting experiments that high monetary rewards can decrease performance. In professional sports, enthusiastic supporters’ expectation for players’ performance generates pressure. Dohmen (2006), in studying professional football players, reported that players’ performance is negatively affected by the presence of a supportive audience.Footnote 19 Contrary to the negative effect of supporters, pressure from the support of an audience and fans has a greater influence on the referee’s decision-making, influencing game outcomes. In sports games, social pressure from supporters for the home team leads referees to make decisions that favour the home team (Dohmen and Sauermann 2016; Garicano et al. 2005), whereas games with no audience do not show the referee’s ‘favourism’ (Pettersson-Lidbom and Priks 2010).

Table 6 Estimations (fixed-effects IV model), ‘inverted log of recorded race times in races’ as dependent variable

In amateur sports where there are no monetary rewards for winners, some players might have greater incentive to win, if cheered by a greater number of people. For instance, individual athletes ‘may be more motivated to achieve Olympic fame when the events are conducted in front of friends and family’ (Bernard and Busse 2004, p. 414). People support players because of shared identities, such as nationality or locality. The factor causing spectators to support players is not related to the probability of winning and to monetary gains from players’ winning. The same holds for our analysis: racers’ physical attractiveness is not related to racers’ ability or probability of winning. However, attractiveness causes spectators to support and expect much from the players even if the players’ ability and probability of winning are controlled. Bettors have a wager on attractive racers, all other things being equal. In turn, racers have an incentive to improve their performance, although the monetary rewards for racers do not depend on betting odds. In other words, the expectation-enhancing effects of physical attractiveness improve the men racers’ performance.Footnote 20 However, this effect seems to be neutralized by the detrimental effect of social pressure for women racers. As such, physical attractiveness indirectly gives men racers a great incentive to improve their performance, resulting in long-term skill accumulation, whereas in women racers, physical attractiveness does not contribute to performance and skill accumulation.

6 Conclusion

Monetary and non-monetary incentives cause individuals to change their behaviour. In the novel setting of boat racing where men and women compete in the same conditions, the unique data allowed us to scrutinize how non-monetary incentives lead to different outcomes between men and women. Our main research interest was to analyse how racers’ physical attractiveness gives bettors a non-monetary incentive to wager and how racers react to the expectation from bettors when the rewards of winning do not depend on their odds ratio (expectation). The major findings through the fixed-effects IV estimations were as follows. (1) Bettors were likely to wager on racers who were more attractive than their competitors. (2) Men racers displayed better performance as a larger number of bettors wager on them. However, this tendency was not observed for women racers. Thus, an implication is that men have a greater incentive to win even if their rewards are the same, as a larger number of bettors would suffer loss and be disappointed unless the racers win.

Men racers have a motivation to avoid the bettors’ loss and disappointment. In other words, men racers attempt to satisfy the bettors who bet on them, which in turn increase racers’ own utility. Thus, fans’ and supporters’ utility in included in men racers’ utility. This mechanism generates the expectation-enhancing effects of physical attractiveness. Meanwhile, apart from this effect, women racers tend to feel larger social pressure, which deters them from improving their performance. Males are more likely than females to show their real ability in more competitive environments (Gneezy et al. 2003). We can interpret this in line with our findings as showing that the expectations of others give males a greater incentive to demonstrate their maximum ability.

Finally, the remaining issues need to be addressed. That the effect of popularity on female racer’s performance was “not significant” did not necessarily mean that the effect is weak for female racers. Hence, to be precise, the gender gap of the expectation-enhancing effects was only a possibility that was not supported by the evidence. To provide evidence of a gender gap, it is necessary to statistically test the null hypothesis that the coefficients of the popularity measures are equal. However, there are difficulties in testing the hypothesis. Some results using the women’s sample did not show the validity of IV; therefore, results of the fixed-effects IV may not be as reliable as those using the men’s sample. Unbiased results can be obtained by using an appropriate experimental method for dealing with endogeneity bias. Furthermore, it would be worthwhile comparing the size of the fans’ expectation effect in the present study with that of other studies; however, limitations in our study prevented our doing so. We hope that future research will explore these issues further.