“There is nothing so practical as a good theory (Kurt Lewin).”

More and better customer data have long been marketers’ holy grail when such information was scarce. Firms are now investing significant resources into big data technologies and analytics (BDTA), following the assumption that they may drive superior performance (Lambrecht and Tucker 2015), enable business transformation (Davenport and Bean 2019), and facilitate disruptive business model innovations (Sorescu 2017). This is particularly evident in the service industry, where BDTA are changing the nature of the customer–firm connection, thereby disrupting existing value propositions (Huang and Rust 2017). Yet, the accelerating rate of big data investments is not always matched by an increased quality and effectiveness of marketing decisions (Shah et al. 2012), and senior managers report mixed perceptions of the extent to which big data contribute to a firm’s performance (Bean and Davenport 2019).Footnote 1 Hence, developing a better understanding of how big data investments can turn into a competitive advantage is a priority for scholars and practitioners alike (Marketing Science Institute 2018).

The current understanding of the process through which big data investments pay off emerges from three research streams. First, information systems (IS) scholars recently started to describe the organizational affordances enabled by big data investments (Dremel et al. 2020; Lehrer et al. 2018). Affordances are defined as possibilities for action offered by a given technology to a goal-directed actor with certain capabilities (Volkoff and Strong 2018). Relatedly, marketing scholars directed attention towards the revolutionary impact of digital service innovation for competitive advantage in data-rich environments. In fact, ubiquitous communications and big customer data lead to the emergence of better and more personalized services (Rust and Huang 2014), which firms can use to build deeper and more profitable customer relationships through data-driven marketing actions (Kumar et al. 2013; Marinova et al. 2017). Finally, studies from strategic management and IS, established a positive relationship between big data investments and organizational outcomes such as firm productivity (Müller et al. 2018), firm performance (Wamba et al. 2017), and firm value (Tambe 2014). The positive link between big data investments and performance is attributed to the competitive advantage brought about by BDTA unique resources and capabilities (Grover et al. 2018). In this study we integrate the key tenets from these three largely disconnected research streams into a single and coherent conceptual model, and subject it to a rigorous, empirical test. The contributions of our study compared to previous research are summarized in Table 1.

Table 1 Summary of our contributions compared to existing research

First, we conceptualize and operationalize three distinct big data marketing affordances, defined as specific marketing actions enabled by investments into BDTA. These are customer behavior pattern spotting, real-time market responsiveness, and data-driven market ambidexterity. In doing so, we contribute to the nascent big data affordances literature by developing new scales for big data marketing affordances, and complementing existing case study evidence with the first quantitative examination of affordance-based hypotheses in both marketing and IS research. Second, we identify big data marketing affordances as antecedents of service innovation. Indeed, the central role of service innovation was conceptually argued in previous research on data-driven marketing (Rust and Huang 2014). Our findings add to this work by empirically testing the relationships between big data marketing affordances and service innovation, and differentiating among direct, indirect, and conditional effects of big data marketing affordances on perceived big data performance. Third, we add to research on the relationship between big data investments and performance by simultaneously accounting for big data marketing affordances and service innovation, against previous literature that focused on the direct effects of big data investments (Wamba et al. 2017), and identify industry digitalization as a boundary condition for the effect of big data investments on perceived big data performance.

To balance conceptual rigor and managerial relevance, our work is grounded in the marketing context and validated with marketing decision-makers. A key assumption within marketing strategy research is that structured data on customers, competitors, and the market environment are intentionally collected and analyzed, following predetermined market research techniques and analytical tools, to serve premeditated information needs and organizational goals. None of these assumptions holds its validity in the case of BDTA (Balducci and Marinova 2018). Our affordance-based perspective complements the extant body of knowledge built around market intelligence and market orientation—the intentional and predetermined acquisition, dissemination, and use of structured market data for premeditated information needs—to offer a reference point shift for marketing decision-makers involved with BDTA.

Our study is articulated as a scale development followed by a rigorous test of the scales’ nomological validity (i.e., Homburg et al. 2015; Kuehnl et al. 2019). In line with this design, we first introduce the study’s conceptual background. Next, we empirically validate the conceptual and practical relevance of our three big data marketing affordances, and then develop new scales to measure them (Study 1). Building on this scale development effort, we test the nomological network of big data marketing affordances, service innovation, and perceived big data performance, including the contingency role of industry digitalization (Study 2). We use data from interviews and multiple surveys with more than 400 managers from two countries, and proactively address potential sources of endogeneity (Rutz and Watson 2019).

Conceptual background

Big data technologies and analytics

Big data are described as large-volume, high-velocity, and heterogeneous data that emerge from the increasing digitization of transactions, interactions, communications, and everyday experiences (McAfee et al. 2012). Big data technologies include data management technologies that enable several key processes (Gandomi and Haider 2015): acquisition and recording; extraction, cleaning and annotation; integration, aggregation, and representation. Big data analytics make it possible to model, analyze, and interpret big data (Gandomi and Haider 2015). Big data analytics are often described in terms of their approach and contribution to decision-making (i.e., descriptive, diagnostic, predictive, and prescriptive; Wedel and Kannan 2016), and based on their specific use domain (i.e., text, speech, web, network, and mobile analytics; Chen et al. 2012). Thus, BDTA encompass the whole process of turning big data into valuable insights, which goes through managing structured and unstructured data, creating ad-hoc data structures, and using specialized analytical algorithms (Balducci and Marinova 2018).

Affordance theory

The key theory supporting our conceptualization is affordance theory. The origins of affordance theory track back to Gibson’s (1979) work on the psychology of perceptions. According to this theory, goal-directed actors (i.e., animals, humans) interpret, relate to, and learn about objects in the environment (i.e., rocks, rivers, trees) in terms of the possibilities for action that these objects offer (i.e., their affordances), rather than by cognitively processing information on the physical characteristics inherent to the object and independent of the actor. For example, a tree can afford a bird to build its nest, a giraffe to eat, or a man to procure wood for burning or building. Thus, affordances depend on the relationship between actors’ goals and abilities and the characteristics of objects within their environment (Chemero 2003). This relational property of affordances makes them dynamic: as the environment and the goals/abilities of actors change over time, new affordances emerge, and old ones cease to exist.

IS scholars translated the concept of affordance to study the impact of technology on organizational design and performance (e.g., Hutchby 2001; Strong et al. 2014; Zammuto et al. 2007). At its core, the IS affordance perspective advances that technologies offer possibilities for action that emerge from their interaction with organizational actors, processes, structures, systems, and goals. An affordance is actualized when an actor realizes its potential for action. For an affordance to exist, it is not necessary for an actor to have actualized it, however an actor who could actualize the affordance must exist (Strong et al. 2014). Thus, a technology may offer many affordances, but only a subset of these may be actualized at any given point in time.

While the affordance theory lens can be applied at multiple levels of analysis (e.g., individual versus organizational), the emphasis of recent research has been on organizational affordances, defined as “the extent that the potential actions enabled are associated with achieving organizational-level concrete outcomes in support of organizational-level goals” (Strong et al. 2014, p. 74). For example, Zammuto et al. (2007) provide five examples of organizational affordances offered by information technologies: work process visualization, real-time innovation, virtual collaboration, mass collaboration, and simultaneous synthetic representation. More recent contributions shed light on new organizational affordances offered by BDTA: Lehrer et al. (2018) use qualitative case studies to conceptually describe two important clusters of big data affordances in service companies: service automation and human-machine service practices; Dremel et al. (2020) analyzed four big data affordances for automotive companies: establishing customer-centric marketing, provisioning vehicle-data-driven services, data-driven vehicle developing, and optimizing production processes.

Big data marketing affordances

In line with affordance theory, we define big data marketing affordances as actualized possibilities to perform specific marketing actions offered by BDTA, which could not be realized otherwise. While affordance theory has not been explicitly recognized within the marketing literature so far, several studies of the impact of BDTA on marketing activities make direct reference to marketing affordances (e.g., Erevelles et al. 2016; Germann et al. 2014; Rust and Oliver 2000; Sleep et al. 2019; Troilo et al. 2017; Wedel and Kannan 2016). Drawing on these previous conceptualizations and examples, we introduce three primary big data marketing affordances: customer behavior pattern spotting, real-time market responsiveness, and data-driven market ambidexterity. These affordances capture marketing actions enabled by BDTA for specific actor/time/space combinations (Lycett 2013). Thus, we define big data marketing affordances as actualized organizational affordances. This distinction is crucial because while affordances represent a mere potential for action, actualized affordances relate to specific actions that have taken place to produce organizational-level concrete outcomes (Strong et al. 2014).

Customer behavior pattern spotting

The first big data marketing affordance, customer behavior pattern spotting, captures the extent to which BDTA afford companies the ability to spot and predict patterns of customer behavior that would not be easily detectable otherwise. This affordance may lead to the identification and implementation of effective marketing responses to both desirable and non-desirable customer behaviors. Customer behavior pattern spotting is an essential contribution of BDTA to marketing actions, which does not depend of previous knowledge or hypotheses, as it predominantly concerns the “know what” rather than the “know why” of customer behavior (Erevelles et al. 2016). For example, Germann et al. (2014) describe how Walmart used BDTA to detect a sharp rise in Pop-Tarts sales shortly before a hurricane, or NFLShop.com identified an increasing trend of online purchases from women buyers in the Holiday season.

Real-time market responsiveness

Well before big data became a buzzword, Rust and Oliver (2000) anticipated the concept of real-time marketing as the new dominant paradigm for many service and product markets. Real-time marketing is based on the ability of marketers to satisfy the different and evolving needs and preferences of individual customers as and when they emerge (i.e. in specific actor/time/space contexts). More recently, Wedel and Kannan (2016, p. 102) described “the development of models to generate diagnostic insights and support real-time decisions from big data” as imminent. Real-time marketing works when companies embed customized offerings in “decentralized ‘intelligence’ capable of anticipating or reacting to customer needs, either overtly or covertly, or to environmental changes” (Rust and Oliver 2000, p. 54). Marinova et al. (2017) argued that the combination of big data streams, smart technologies, and analytical techniques can generate arrays of data-driven options for real-time marketing action, which complement or substitute the interaction between customers and front-line employees. In fact, BDTA make it possible to minimize response time latency, defined as the temporal separation between an event and the response to that event, by reducing the intervals between data capture/analysis/interpretation and action (Pigni et al. 2016). In line with this literature, we define real-time market responsiveness as the extent to which BDTA afford companies the ability to analyze events, make-decisions and enact a market response with minimal response time latency. For example, using live patient data and heart attack incidences over time, a smart medical device firm can predict in real-time when a patient is about to have a heart attack and respond with a series of actions and messages targeted to market stakeholders, such as the patient, her doctors, and the emergency services nearby (Marinova et al. 2017).

Data-driven market ambidexterity

Market ambidexterity means the ability to exploit current customers/markets while exploring new ones (Voss and Voss 2013). Previous studies suggest that BDTA are a powerful tool to leverage information on current customers and connect them to future customer needs and market opportunities (e.g., Erevelles et al. 2016; Huang and Rust 2017; Zuboff 2015). We refer to this affordance as data-driven market ambidexterity: the extent to which BDTA afford companies to identify synergies and connections between current and future customer needs, service offerings and strategic market opportunities that would not be established otherwise. BDTA make it possible for firms to access large volumes of various information about new customer needs and market domains more cheaply, and therefore reduce the traditional barriers associated to exploratory learning (March 1991). Thus, data-driven market ambidexterity affords firms the visualization of new services that have synergies with their current offerings, customer needs, competitive conditions, and environmental trends. For example, Amazon employs BDTA to constantly explore new business opportunities in complementary product categories, which can be developed internally or through MarketPlace partners; at the same time, the company uses BDTA to exploit its internal service process by targeting external customers with similar needs for information solutions (Amazon Web Services) or logistical solutions (“Fulfilled by Amazon”).

Service innovation

The role of service innovation is central to understand the link between big data investments, big data marketing affordances, and performance. As summarized by Rust and Huang (2014, p. 206): “Increasingly, and inevitably, all of marketing will come to resemble to a greater degree the formerly specialized area of service marketing, only with an increased emphasis on marketing analytics”. In line with previous studies, we conceptualize service innovation across three dimensions: service concept, service process, and customer experience (Barrett et al. 2015; Den Hertog 2000; Miles 2008).Footnote 2 New service concept refers to new ways to organize a solution to customer needs or problems, service process innovation consists of new or enhanced internal systems to deliver services to customers more efficiently, and customer experience innovation captures innovative firm-customer interfaces (i.e., touchpoints) that improve service interactivity and customization. Importantly, the notion of service innovation is not limited to the traditional services industries but is equally salient for the servitization of product-based strategies and business models (Barrett et al. 2015).

Study 1: Developing scales for big data marketing affordances

Rationale of the scale development process

As big data affordances are possibilities for action offered by BDTA to organizations, their number is potentially indefinite, as firms’ goals, capabilities, and competitive contexts are various and constantly changing (Majchrzak and Markus 2013). For this reason, affordance theory research typically targets sub-sets of affordances with a highly granular approach that captures the relevant possibilities for action within a given actor/time/space frame (Dremel et al. 2020; Lehrer et al. 2018; Majchrzak and Markus 2013; Strong et al. 2014).

In line with the need for a granular analysis, our aim was to identify a workable set of relevant big data marketing affordances for firms operating in today’s competitive environment that could provide a parsimonious yet comprehensive basis for our theorizing and empirical analyses. Existing work on technological affordances in organizations is either conceptual (e.g., Majchrzak and Markus 2013; Volkoff and Strong 2018) or based on case study evidence (e.g., Dremel et al. 2020; Lehrer et al. 2018), and no previous study focused on BDTA from a marketing affordances perspective. For these reasons, we could not rely on existing scales for measuring big data marketing affordances. We therefore developed new scales, which cover the dimensionality we derived conceptually, and are parsimonious with respect to the number of items for each dimension. In line with prior research (e.g., Homburg et al. 2015; Kuehnl et al. 2019), we followed established scale-development procedures (e.g., Churchill Jr. 1979; Gerbing and Anderson 1988). Table 2 shows the seven steps in our scale-development process.

Table 2 Overview of the scale development process for big data marketing affordances

Validation of the big data marketing affordances (Study 1a)

The first step in our scale development process was the validation of the three focal big data marketing affordances that we conceptually derived from previous literature (e.g., Erevelles et al. 2016; Pigni et al. 2016; Rust and Oliver 2000; Sleep et al. 2019; Troilo et al. 2017). To this aim, we first conducted six interviews with senior managers with extensive knowledge of BDTA, to gather their views on the relevance and validity of our three big data marketing affordances, to receive feedback on their conceptualization and definitions, and to solicit examples. This initial evidence suggested that managers recognize the face validity of our three big data marketing affordances, and can provide relevant examples (see Web Appendix 1). Also, these interviews did not indicate that we omitted important big data marketing affordances.

Next, to corroborate these preliminary indications, we collected a short survey asking informants to report up to five marketing actions enabled by BDTA in their firms. We sent a link to an online survey to 200 marketing managers who attended executive education in one of the authors’ business school, and obtained valid replies from 53 of them (26.5% response rate). We collected a total of 121 entries, for an average of 2.3 marketing actions per informant. We asked two doctoral students and one data scientist who were unfamiliar with our study to independently group responses that shared a common theme. Any diverging interpretation was discussed and reconciled. The top four themes obtained via this process (81% of all entries) largely overlap with our three big data marketing affordances and the service innovation constructs. Other themes include sales-related affordancesFootnote 3 (7.4%), and pricing-related affordances (2.5%); the residual entries (9.1%) are not underpinned by a single theme (Web Appendix 2). This evidence supports the parsimony and relevance of our three-affordance framework. Next, we generated an initial item pool for the three big data marketing affordances.

Item generation and initial reduction of items

We employed two sources of information for item generation: a review of relevant conceptual literature (e.g., Erevelles et al. 2016; Pigni et al. 2016; Rust and Oliver 2000; Troilo et al. 2017; Wedel and Kannan 2016), and semi-structured interviews with 40 managers in seven large international service firms.Footnote 4 From the review and the interviews, we generated an initial pool of 15 items for each of the three marketing affordances. Conscious of the need to make scales short enough to be usable in empirical research, we first shared the definitions of big data marketing affordances with three marketing faculty members and two data science practitioners, and asked them to assign each item to one of the three specific affordances. We conservatively excluded 17 items that did not receive a 100% consistent assignment, and further refined some items according to the experts’ suggestions.

Reduction of the number of items (Study 1b)

We submitted the remaining 28 items to a quantitative test with managers of 107 service firms from Italy in 2016.Footnote 5 The initial sampling frame was obtained from Bureau van Dijk-AIDA, a dataset of all registered firms operating in the country.Footnote 6 From an initial list of 8169 firms, we narrowed down to 263 firms that met two criteria: 1) they reported on their website current examples of big data projects and service innovations; 2) they had a company LinkedIn profile from which we could identify organizational roles linked to big data and innovation. A research assistant used LinkedIn to contact the selected informants. We obtained 122 responses (46% response rate), and discarded 15 due to missing data. 53% of informants were IT managers,Footnote 7 12% marketing managers, 12% business development managers, 7% CEOs, 7% CIOs; the remaining 9% included product managers, innovation managers, and others. In line with Kumar et al. (1993), we checked for informants’ knowledge (“How knowledgeable are you about the topics of the survey?”), involvement (“How involved are you with the activities and decisions mentioned in this survey?”), and confidence (“How confident are you in the quality of your responses to this survey?”). Informants scored high on 10-point scale (“1 = not at all”; “10 = a lot”) regarding big data knowledge (mean = 7.75), involvement (mean = 6.90), and confidence (mean = 7.97).

Scale parsimony and dimensionality

To develop parsimonious scales, we chose the items with the highest indicator reliabilities for each dimension. The elimination of semantically similar items did not lead to a loss of content validity and reliability. Importantly, the model with the 12 remaining items achieved better goodness-of-fit values than the model with 28 items.Footnote 8 Table 3 gives examples, for each item, of managerial quotes referring to customer behavior pattern spotting, real-time market responsiveness, and data-driven market ambidexterity from the interviews.

Table 3 Item-by-item quotes from the manager interviews for the three big data marketing affordances

An exploratory factor analysis (EFA) with the final pool of 12 items revealed the three expected factors with eigenvalues greater than 1 (see Web Appendix 3). In line with our conceptualization, a confirmatory factor analysis (CFA) with the final 12 items confirms the three dimensions of customer behavior pattern spotting, real-time market responsiveness, and data-driven market ambidexterity (see Web Appendix 4). Specifically, standardized factor loadings were all high and significant (p < .01) ranging from .72 to .97. Moreover, the final three-factor model showed good global and local fit statistics: Satorra–Bentler corrected chi-square (degrees of freedom) χ(51) = 109.58, comparative fit index (CFI) = .95, Tucker–Lewis index (TLI) = .93, root mean square error of approximation (RMSEA) = .10, standardized root mean square residual (SRMR) = .07. The Cronbach’s Alpha are ≥ .88, and the average variance extracted (AVE) ≥ .66. Examination of the Fornell–Larcker criterion, which requires that for every pair of factors the squared correlation should be smaller than each factor’s AVE, revealed that discriminant validity exists for each of the three factors. Moreover, comparisons of models with different numbers of big data marketing affordances dimensions confirmed the best model fit for the three-factor solution.Footnote 9 These findings corroborate the evidence from Study 1a in establishing the dimensionality of the three big data marketing affordances.

Validation and discriminant validity (Study 1c)

With Study 1c, we sought to validate the measurement properties of the three big data marketing affordances scales, using marketing and IT managers from various industries as key informants. In addition, we tested the discriminant validity of big data marketing affordances against three related big data constructs. The key is that affordances capture the actualized possibilities for action stemming from BDTA, in contrast to mere big data availability, or the presence of a data-oriented culture, and collaborative structures (i.e. marketing-IT collaboration).

Data collection and measures

The sampling frame for Study 1c consisted of 600 managers who attended executive education courses on data analytics, digital marketing, and/or service marketing at one of the authors’ business school between 2014 and 2017. The alumni office of the business school contacted the informants with a request to take part in the study, accompanied by a letter from the research team and a link to the web-survey. We received 135 responses (22.5% response rate), 14 of which were discarded due to missing data or because informants reported that their confidence in the big data topic was 6 or less (on a 1–10 scale). Of the remaining 121 valid informants, 56% were marketing managers and 44% IT managers.Footnote 10 Participants scored high on big data knowledge (mean = 7.03), big data involvement (mean = 6.43), and confidence (mean = 7.26). To account for potential common method bias, we used a variety of scales, pretested the questionnaire, reassured respondents about anonymity and confidentiality, and emphasized that there were no right or wrong answers, to help reduce the likelihood of self-presentation bias. We measured big data marketing affordances with the newly developed scales, big data availability based on the 3 V framework (Johnson et al. 2017; Laney 2001), data-oriented culture based on Davenport et al. (2012), and marketing-IT collaboration with a measure adapted from Kahn and Mentzer (1998). The Appendix reports all measurement items. An EFA reveals the six factors, and a CFA on all scales shows good fit indices (χ2(d.f.) = 360.70 (284); p < .01; CFI = .97; TLI = .97; RMSEA = .05; SRMR = .05).

Discriminant validity

We assessed whether the three big data marketing affordances are empirically distinct from the three related constructs. First, we compared the correlation between all six constructs with their AVEs. All pairwise correlations were smaller than the square root of the AVE for each construct (see Web Appendix 5). Further, combining any related construct with any of the big data marketing affordances significantly decreased the overall model fit (all Δχ(1) > 147.32, p < .01), indicating discriminant validity. Taken together, through Study 1a, Study 1b, and Study 1c we developed three new scales for big data marketing affordances, established scale dimensionality, and provided evidence for discriminant validity against related measures. Next, using new data, in Study 2 we further validate the newly developed scales (as reported in Web Appendix 3 and 4), and test their nomological validity in relation to a wider set of constructs, including antecedents, consequences and contingencies.

Study 2: Testing the nomological network of big data marketing affordances

In Study 2, we test the nomological network of big data marketing affordances. In doing so, we advance a set of hypotheses and provide preliminary empirical findings on the process linking big data investments and performance, which include indirect effects and boundary conditions. By investing in big data, firms create the conditions for the actualization of big data marketing affordances (i.e., new BDTA-enabled marketing actions). In line with affordance theory, big data marketing affordances improve a firm’s ability to develop and introduce valuable service innovations (Lehrer et al. 2018). Finally, service innovation is linked to perceived big data performance, so that overall, there is a process from the initial big data investments to perceived big data performance, mediated by big data marketing affordances and service innovation. Finally, industry digitalization moderates the positive effect of service innovation on perceived big data performance (see Fig. 1).

Fig. 1
figure 1

Conceptual model for the nomological test of big data marketing affordances

Hypotheses development

We propose that big data investments positively affect customer behavior pattern spotting, real-time market responsiveness, and data-driven market ambidexterity. Following the key tenet of affordance theory, BDTA change the socio-technical system of organizations, and interact with their capabilities and goals to enact specific new actions (Dremel et al. 2020). By investing in big data, firms actualize the three affordances for data-driven marketing as they become able to perceive the action possibilities offered by BDTA, experiment with such actions, socialize this experience across different teams and sub-units, and embed these actions in their structures and processes (Strong et al. 2014). Based on this logic, we expect that:

H1:

Big data investments are positively related to (a) customer behavior pattern spotting, (b) real-time market responsiveness, and (c) data-driven market ambidexterity.

As described in the literature and anecdotally pictured in the examples above, big data marketing affordances widen the space for service innovation opportunities that firms can pursue in both the short and the long term (Davenport et al. 2012; Huang and Rust 2017; Rust and Huang 2014). Customer behavior pattern spotting and real-time market responsiveness are more likely to lead to short-term innovation opportunities, while data-driven market ambidexterity may produce more long-term innovation benefits. Customer behavior pattern spotting may lead to incremental service innovations that leverage superior knowledge of users’ consumption patterns. For example, digital content may be reconfigured, customized, and/or prioritized based on customer behaviors. Also, real-time market responsiveness can lead to designing new service offerings that adapt and respond dynamically to the customer context, thereby providing timely and valuable new offerings and new customer experience. Finally, data-driven market ambidexterity may lead to new service concepts that leverage resource complementarities and other synergies for service innovations that only become apparent through BDTA. Thus, in line with affordance theory, we advance that the potential for service innovation that firms are able to afford via big data investments is realized via the three big data marketing affordances, such that:

H2:

The effect of big data investments on service innovation is mediated by (a) customer behavior pattern spotting, (b) real-time market responsiveness, and (c) data-driven market ambidexterity.

Previous research empirically supports a link between big data investments and firm performance (e.g. Müller et al. 2018; Wamba et al. 2017), but does not explore the intermediate processes accounting for this link. We hypothesize that the overall effect of big data investments on perceived big data performance is at least partially explained by service innovation and the big data marketing affordances. In line with previous research that proposed innovation as a mediator between outside-in capabilities, such as market intelligence, and performance (e.g., Han et al. 1998), we argue that big data marketing affordances generate more opportunities to use data in novel and useful ways to locate new service ideas. In turn, this enhances the firm’s service innovation competence, such that the benefits of big data investments and related affordances accrue through the launch of new and successful services in the marketplace (Rust and Huang 2014). Taking these arguments together, we advance that:

H3:

Service innovation mediates the effect of (a) customer behavior pattern spotting, (b) real-time market responsiveness, and (c) data-driven market ambidexterity on perceived big data performance.

H4:

The effect of big data investments on perceived big data performance is mediated by (a) big data marketing affordances and (b) service innovation.

There is strong consensus in the marketing strategy literature that the relationship between firm innovativeness and firm performance is contingent on salient industry characteristics (e.g., Dotzel et al. 2013; Mizik and Jacobson 2003). In particular, cumulated empirical evidence exists that firms in high-tech industries benefit more from innovativeness than their low-tech industries counterparts (Rubera and Kirca 2012). Based on this established theoretical foundation, we examine the moderating role of industry digitalization between service innovation and perceived big data performance within our nomological network. Specifically, we expect the effect of service innovation on perceived big data performance to be stronger for firms competing in industries with a high (versus low) degree of digitalization, for two main reasons. First, when industry digitalization is high, customers will be more sensitive to offerings that embed a significant digital innovation element, and will adopt them faster, thus improving innovation returns (Rubera and Kirca 2012). Second, in such contexts, BDTA become a stronger driver of differentiation, and the presence of premium innovative offerings is more likely. In contrast, when industry digitalization is low, we expect the rewards from service innovation to be reduced, due slower customer adoption and weaker digital competition. In line with this argument, recent studies support the notion that firms in information technology-intensive industries extract more value from BDTA assets, than firms in less technologically developed industry groups (Müller et al. 2018; Tambe 2014). We also expect this contingency to qualify the indirect effect of big data investments on perceived big data performance.

H5a:

The effect of service innovation on perceived big data performance is moderated by industry digitalization, such that the effect is stronger when digitalization is higher.

H5b:

The indirect effect of big data investments on perceived big data performance via big data marketing affordances and service innovation is moderated by industry digitalization, such that the effect is stronger when digitalization is higher.

Context and data collection

We used data from a survey among 110 Swiss firms at two points in time, separated by four months in 2018, to test our hypotheses. Big data investments and big data marketing affordances were measured at T1, and service innovation and perceived big data performance were measured at T2. We used a panel database of a professional marketing association to contact 707 senior managers responsible for marketing and/or marketing analytics with an invitation to an electronic survey. After one follow-up, we received 184 fully completed T1 questionnaires (26% response rate) from managers working in various industries.Footnote 11 Respondents have on average 8.54 years of experience with the firm, and score 6.11 on a seven-point scale measuring the degree of knowledge about big data processes in their firm. After four months, the 184 managers received a reminder and a gift certificate incentive, and 110 informants provided the requested information (60% T2 response rate). Participants scored high on big data knowledge (mean = 7.38), involvement (mean = 6.69), and confidence (mean = 7.11).

In line with the recommendations from Hulland et al. (2018), we used multiple a priori methods for dealing with common method bias. Namely, we temporally separated the measures of dependent and independent constructs, we employed different response scales for different variables, used physical separation among related scales within the questionnaire, and we validated the subjective measure of big data investments with objective measures related to the number of dedicated big data employees newly employed and IT investment allocations.Footnote 12

Measurement

The Appendix provides the measurement items for all constructs. We used the newly developed scales for big data marketing affordances and measured big data investments with three items tapping the extent to which firms invest resources in big data projects, consider big data as a priority in allocating resources, and invest in human talent for big data. These items emerged from our interviews and are consistent with the indicators of big data investments from the literature (McAfee et al. 2012; Wamba et al. 2017). We measured service innovation based on the three dimensions of service concept, service process, and customer experience (Barrett et al. 2015; Den Hertog 2000). We used a 3-item scale for each first-order factors, and aggregated them, with parceling, into a second-order factor capturing service innovation (χ2(d.f.) = 54.63 (24); p < .01; CFI = .96; TLI = .94; RMSEA = .11; SRMR = .07). We used a 5-item scale based on Wamba et al. (2017) to measure perceived big data performance (i.e., the organizational perceptions of the extent to which big data contribute to a firm’s performance by generating a positive return on investments). In line with Edeling and Himme (2018), we adopted Gandhi et al.’s (2016) measure of industry digitalization to categorize industries from weakly influenced (e.g., hospitality) to highly influenced (e.g., ICT) by digitalization on a six-point scale.

We controlled for several observed sources of heterogeneity across firms. Namely, we controlled for big data availability (e.g., Johnson et al. 2017) because the firm-specific availability of data might determine both big data marketing affordances and their outcomes; data-oriented culture (Davenport et al. 2012), because such culture could contribute to the effectiveness and efficiency of big data projects; and marketing-IT collaboration (Kahn and Mentzer 1998), because strategically involving the IT department within the marketing sphere fosters the successful implementation of BDTA projects within firms (Brinker and McLellan 2014; Sleep et al. 2019). To account for the impact of human capital and the key role of data scientists, we controlled for the number of data specialists. Finally, to account for industry and firm characteristics, we controlled for B2B versus B2C focus, for the proportion of pure services versus services including tangible goods in a firm’s offering, and for firm size (employee number) and age. Table 4 reports descriptive statistics, correlations, Cronbach’s alpha, and AVEs for all variables. A CFA on all multi-item scales indicated a good fit with the data (χ2(d.f.) = 836.84 (558); p < .01; CFI = .94; TLI = .92; RMSEA = .05; SRMR = .04).

Table 4 Inter-construct correlations and descriptive statistics in Study 2

Model development

Accounting for self-selection bias

As we could only include firms in the analysis for which T2 data were available, our estimates may be biased due to a self-selection process (e.g., less successful firms choosing not to participate in the second survey). To address this issue, we employed a two-stage selection model. In the first stage, we estimated a choice model with the availability of T2 data as the binary dependent variable. From this model, we compute the inverse Mills ratio (IMR) to account for a potential selection bias.

Accounting for unobserved heterogeneity

Beyond the observed covariates, unobserved factors could potentially influence perceived big data performance. For example, some firms might have a higher veracity in their data, or employees more open towards data-driven insights compared to other firms, yet we were not able to observe this. Not accounting for such unobserved factors can lead to statistically biased, inconsistent parameter estimates. To account for unobserved heterogeneity, we follow a semiparametric approach and represent the intercept term with a finite number of support points to account for heterogeneity across firms.

Accounting for endogeneity

Additional concerns about endogeneity may arise because some variables that are omitted might influence both perceived big data performance and key explanatory variables. First, a firm’s strategic choice to invest in big data is made in anticipation of future performance (Müller et al. 2018), thus big data investments could be endogenous. Second, all three big data marketing affordances depend on both firm decisions and the characteristics of the available data (Sleep et al. 2019), and the latter are unobservable for us. Third, service innovations not only depend on firms’ decisions but also on consumer acceptance and competitive actions (Gielens and Steenkamp 2007). Such external forces are unobservable for us. As finding valid instruments was not feasible, we used the latent instrumental variable (LIV) approach as introduced by Ebbes et al. (2005). In line with Herhausen et al. (2020), we used a two-step approach where we first partition the variance of each endogenous regressor into an endogenous and an exogenous component, and then use both the endogenous and exogenous components as predictors in the main model (see Web Appendix 6).

Model estimation and results

Estimation strategy

We started by estimating the IMR correction term to address the self-selection bias. The probit model in Web Appendix 7 shows acceptable fit indices (pseudo R2 = .20; Wald χ2 = 40.30, p > .01), and indicates that big data investments (β = 59, p < .01), big data availability (β = .47, p < .01), the B2B vs. B2C focus (β = 1.82, p < .01), and service mix (β = −1.30, p < .05) are significantly linked with T2 data availability. We included the correction term IMR in the final model estimation. To identify second-stage parameters, in line with recommended exclusion restrictions (Hausman 1978), we dropped the B2B vs. B2C focus and service mix variables from our final models, as they satisfy both relevance and exogeneity requirements (i.e., they do not have direct relationships with the dependent variables – all r < .08, all p > .38 – but explain firms’ choice to participate in the second survey).

Next, we estimated the LIV corrections for each independent variable (Wang et al. 2017). The Akaike Information Criterion (AIC3) suggested three-support point solutions for big data investments and customer behavior pattern spotting, and four-support point solutions for real-time market responsiveness, data-driven market ambidexterity, and service innovation.Footnote 13

Finally, we estimated Eqs. 4a to 4e simultaneously. We standardized all explanatory variables, used robust standard errors, and excluded non-significant controls from the equations to have the most parsimonious model. In particular, number of data specialists, firm size, and firm age were excluded from our model. The variance inflation factors ruled out any multicollinearity concerns (all VIF < 2.65). The AIC3 criterion for the final model suggested a two–support point solution to control for unobserved heterogeneity.Footnote 14

Hypothesis testing

We present the results from three models in Table 5, one model without endogeneity correction, the hypothesized model, and a model with an additional, non-significant, interaction effect of industry digitalization and customer behavior pattern spotting on perceived big data performance. Although the results are generally consistent across models, some LIV error terms and the IMR suggest the importance of accounting for endogeneity. Thus, we discuss the results of the hypothesized model that incorporates the endogeneity correction (Model 2).

Table 5 Results of Study 2

The results offer support for the positive effects of big data investments on customer behavior pattern spotting (γ = .34, p < .01), real-time market responsiveness (γ = .37, p < .01), and data-driven market ambidexterity (γ = .46, p < .01), in support of H1a, H1b, and H1c.

Real-time market responsiveness (γ = .24, p < .05), and data-driven market ambidexterity (γ = .30, p < .01), but not customer behavior pattern spotting (γ = −.14, ns), are positively related to service innovation. Consequently, both real-time market responsiveness (indirect effect: γ = .09, 95% CI = .01 to .17), and data-driven market ambidexterity (indirect effect: γ = .14, 95% CI = .04 to .24), mediate the relationship between big data investments and service innovation. Thus, H2b and H2c are supported, while H2a is not (Table 6).

Table 6 Indirect effects in Study 2

Service innovation has a positive effect on perceived big data performance (γ = .26, p < .01), and mediates the effects of real-time market responsiveness (indirect effect: γ = .06, 95% CI = .01 to .11), and data-driven market ambidexterity (indirect effect: γ = .08, 95% CI = .01 to .14) on perceived big data performance, supporting H3b and H3c. Customer behavior pattern spotting has a positive direct effect on perceived big data performance (γ = .26, p < .01), rejecting H3a.

We also find an indirect effect of big data investments on perceived big data performance via customer behavior pattern spotting (single mediation: γ = .09, 95% CI = .03 to .15), and an indirect effect of big data investments on perceived big data performance via real-time market responsiveness/data-driven market ambidexterity, and service innovation (serial mediation: γ = .06, 95% CI = .02 to .10), in support of H4a and partial support of H4b.Footnote 15

We further find a positive and significant interaction effect between service innovation and industry digitalization on perceived big data performance (γ = .25, p < .01), in support of H5a. Service innovation is positively related to perceived big data performance in industries with high digitalization (γ = .51, p < .01), but not in industries with low digitalization (γ = .01, ns).

The indirect effect of big data investments on perceived big data performance via the serial mediation of real-time market responsiveness/data-driven market ambidexterity and service innovation is also contingent on industry digitalization. Big data investments are positively linked to perceived big data performance in industries with high digitalization (γ = .22, 95% CI = .07 to .36), but not in industries with low digitalization (γ = −.02, 95% CI = −.07 to .04). Thus, H5b is supported. Given the direct effect of customer behavior pattern spotting on perceived big data performance, we explored whether industry digitalization also moderates this effect in Model 3 but did not find a significant interaction (γ = .02, ns). Thus, even firms in low-digitalization industries may benefit from big data investments via customer behavior pattern spotting (γ = .09, 95% CI = .01 to .16).

Robustness tests

Since we measured both service innovation and perceived big data performance at T2, we collected additional performance data to test for a lagged effect of service innovation. We were able to obtain data from a subsample of 51 participants two years after the initial data collection (46% response rate). The time-lagged perceived big data performance measure has a strong positive correlation with perceived big data performance in the original data collection (r = .56), and results of a simple regression analysis are in line with our main analysis: service innovation has a positive effect on lagged perceived big data performance (γ = .27, p < .01), and this effect is moderated by industry digitalization (γ = .27, p < .05).

Results from Study 2 provide a further validation of the three scales and generally support our expectations regarding the chain of relationships connecting big data investments to perceived big data performance, via big data marketing affordances and service innovation. We show that service innovation mediates between real-time market responsiveness and data-driven market ambidexterity and perceived big data performance; in contrast, customer behavior pattern spotting is directly related to perceived big data performance. We also find an overall indirect positive effect of big data investments on perceived big data performance, indicating that investing resources into big data may drive superior performance. Yet, this indirect effect is conditional on industry digitalization, as firms in highly digitalized industries benefit more from big data investments. Still, all firms benefit from customer behavior pattern spotting.

Discussion

Theoretical implications

Our study provides several important contributions to theory. First, we introduce affordance theory to the marketing strategy literature. Information technologies generate possibilities for marketing actions that reshape the competitive landscape. This implies a transformation of marketing towards the formerly specialized area of service marketing and an increased emphasis on marketing analytics (Rust and Huang 2014). To address this challenge, the marketing theory toolkit needs to be enriched with new concepts, logics, and relationships that account for the digitized world. This purpose is well served by affordance theory, which is emerging as the overarching theoretical framework to understand the impact of technological developments for organizations (Nambisan et al. 2017). Our newly developed measures of big data marketing affordances represent a step forward in big data marketing research, and allow for the first quantitative test of affordance theory in the literature. In addition, by widening the scope of big data affordances to the marketing domain, we address a limitation of previous studies of big data affordances that focused predominantly on technological affordances (Dremel et al. 2020; Lehrer et al. 2018). In fact, while our big data marketing affordances scales are anchored to BDTA, their conceptualization and nomological implications go beyond data management and analytics. Thus, our conceptualization and empirical findings connect recent theoretical developments from marketing and IS, and answer calls for a more solid platform of interdisciplinary research between these two fields (Rust and Huang 2014).

Second, we contribute to the literature on marketing analytics and service innovation in data-rich environments (e.g., Troilo et al. 2017; Wedel and Kannan 2016) by identifying different antecedents of service innovation among big data marketing affordances. We find that while real-time market responsiveness and data-driven ambidexterity lead to service innovation, customer behavior pattern spotting does not. This result deviates from our prediction and is worth attention. A possible explanation is that customer behavior pattern spotting is deployed in an exploitation mode, to optimize existing service processes and performance. This may be plausible especially if managers look for patterns of customer behaviors in well-known domains that are linked to short-term marketing metrics, such as for example reducing churn, increasing customer engagement, or improving communication campaigns. All these outcomes will benefit significantly from customer behavior pattern spotting but may produce rather negligible effects on service innovation. Conceptually, this implies different routes through which the effect of big data investments generates superior performance. Big data marketing affordances relate to perceived big data performance either via service innovation (for real-time market responsiveness and data-driven market ambidexterity), or directly (for customer behavior pattern spotting); thus, we provide a more nuanced picture the contribution of big data investments and marketing affordances to perceived big data performance compared to previous literature.

Third, our middle-range theorizing approach fills an important gap between general and more de-contextualized theories of the link between big data investments and performance, prevalent in IS and strategic management (resource-based view, knowledge-based view, and dynamic capabilities; e.g., Wamba et al. 2017; Grover et al. 2018), and the need for a tighter connection between theoretical formulation and empirical findings in the marketing context (Brodie et al. 2011). In fact, the mechanisms through which BDTA operate are radically different from the traditionally known sources of market intelligence, as big data emerge unintentionally, are mostly unstructured, and often redundant. Thus, the meaning and value of big data is defined ‘on the way out’ rather than ‘on the way in’ (Constantiou and Kallinikos 2015). Our affordance theory framework addresses this issue by promoting an action-based view of the value of BDTA for marketing, against previous explanations based on organizational information processing (Maltz and Kohli 1996), and the knowledge-based view (Erevelles et al. 2016).

Finally, to further strengthen this point, we also find that big data investments pay off more in highly digitalized industries, owing to the moderating role of industry digitalization between service innovation and perceived big data performance; this finding extends previous marketing literature that narrowly focused on the direct effects of big data investments on the bottom-line (e.g., Wamba et al. 2017). This additional result contributes to the emerging literature describing the process, and conditions under which, big data investments and digital transformation efforts pay off for companies (Müller et al. 2018).

Managerial implications

Managers aiming to exploit the potential of BDTA find limited inspiration from extant research rooted in traditional marketing paradigms. Through our affordance theory lens, we offer marketing decision-makers some recommendations to reap the benefits of big data investments.

Shift your efforts from data to actions

Managers must be aware that what makes big data pay off is not their large volume, high velocity, or broad variety but rather the organizational ability to actualize the action potential offered by BDTA (Shah et al. 2012). We introduce three big data marketing affordances that may guide the extraction of valuable marketing insights from data and the implementation of corresponding actions. In fact, customer behavior pattern spotting allows organizations to increase the effectiveness and efficiency of marketing activities; real-time market responsiveness affords the ability to respond to market events with minimal time latency; data-driven market ambidexterity improves a firm’s ability to identify new market opportunities leveraging on their current offers. For this reason, managers must identify those actors within their organization who are in the best position to actualize BTDA marketing affordances, and empower them with the responsibility to take actions.

Anticipate your return on investment pathway

Our findings show that these three affordances provide two alternative routes to performance. The first route leads to the improvement of the efficiency and effectiveness of actual marketing activities. Managers who wish to follow this path should invest in BDTA that afford the identification of patterns of customer behaviors, thereby allowing organizational members to optimize budget allocations across marketing activities. The second route aims to innovate in terms of new service concepts, processes, or customer experience. Managers who are prone to achieve this objective should focus their efforts on BDTA that afford the implementation of individualized marketing actions nurtured by data collected in real time, and on BDTA that identify market opportunities built on the synergies between new and existing service offers.

Sync your big data strategy with your industry digitalization stage

Our findings point to the contingent effect of industry digitalization. In fact, if an industry is at an earlier stage of digitalization, managers should focus on improving efficiency and effectiveness via customer behavior pattern spotting. Managers competing in industries at a later stage of digitalization should complement such affordance with data-driven service innovations that build on increased marketing action potential allowed by real time market responsiveness and the creation of new opportunities allowed by data-driven market ambidexterity.

Expect the unexpected insight

Big data investments change the way to acquire, disseminate, and use market intelligence. In fact, big data investments enable to complement structured data intentionally collected through more traditional marketing research activities for premeditated information needs with a large variety of unstructured data. To benefit from such opportunities, managers must design and implement actions to leverage on emerging and unexpected patterns of customer behaviors, to allocate decision-making responsibilities to exploit real-time responsiveness to unexpected market events, and to favor new opportunities stemming from the synergies between existing and future services.

Adopt a servitization perspective in every industry

Due to digitization and the availability of BDTA, the boundaries between services industry and the goods sector are blurring. For example, the Internet-of-Things enables product-driven manufacturing firms to collect big data on customer behaviors and leverage these data to design innovative services. Although we focus on service innovation, we believe that our implications generalize beyond the service industry. Thus, we also recommend managers in manufacturing firms to adopt a service-based perspective in order to reap the benefit of BDTA.

Limitations and future research directions

Our study offers theoretical, empirical, and managerial contributions as highlighted in the previous sections, yet it is also subject to some limitations that need to be acknowledged. First, we collected self-reported and some close-to-factual data, yet we were not able to employ secondary data to measure variables such as big data investments or big data performance. Future research may use multi-source data to further corroborate our findings. In particular, it would be desirable to disentangle the dimensions of big data performance in future studies. Second, our samples of informants come from two European countries (Italy and Switzerland), with relatively similar business environments, broadly comparable to those of developed Western countries. Thus, our findings do not immediately generalize to other geographical settings with different business contexts around big data investments and use (i.e., China). Third, we only consider three types of big data marketing affordances based on previous conceptual literature; while we provide empirical evidence of their relevance and significance for managers, future research should consider new and emerging affordances and their antecedents and consequences, in order to capture their dynamic nature across actor/time/space combinations. For example, our data point to sales-related affordances as an attractive area for future research.

In addition to addressing the limitations highlighted above, future research could unearth the organizational processes through which firms actualize big data marketing affordances. This could be done by including a set of contingencies in the relationship between big data investments and big data marketing affordances, and between the latter and service innovation. Another avenue for future research based on our framework is to analyze big data investments and affordances at the individual level (i.e., frontline employee, salesperson) or the team level (i.e., service or sales teams) to complement our organizational-level approach (Marinova et al. 2017). This effort will be consistent with the multi-level nature of customer service, affordances, and innovation already highlighted in the respective literatures. Finally, while we focus on the firm’s perspective, future work can look at the possibilities for action that BDTA offer to individual customers, and how these affect customers’ behavior, engagement, and other relevant marketing outcomes. To conclude, our study can inspire researchers and doctoral students from marketing and IS to develop a joint platform for theoretical and empirical research. We show how marketing researchers may adopt affordance theory as a theoretical lens to investigate BDTA, artificial intelligence and other related technologies in relation to marketing strategy outcomes; in parallel, we offer IS researchers the opportunity to incorporate our constructs and measures in future quantitative studies of big data affordances in organizations.