Persona Studies 2018, vol. 4, no. 2 47 ARE PERSONAS DONE? EVALUATING THE USEFULNESS OF PERSONAS IN THE AGE OF ONLINE ANALYTICS JON I SALMI N EN, BE RN AR D J. JANS E N, JISU N AN, HAE WOON KWA K A N D SOON-GY O JU NG ABSTRACT In this research, we conceptually examine the use of personas in an age of large-scale online analytics data. Based on the criticism and benefits outlined in prior work and by practitioners working with online data, we formulate the major arguments for and against the use of personas given real-time online analytics data about customers, analyze these arguments, and demonstrate areas for the productive employment of data-driven personas by leveraging online analytics data in their creation. Our key tenet is that data-driven personas are located between aggregated and individual customer statistics. At their best, digital data-driven personas capture the coverage of the customer base attributed to aggregated data representations while retaining the interpretability of individual-level analytics; they benefit from powerful computational techniques and novel data sources. We discuss how digital data-driven personas can draw from technological advancements to remedy the notable concerns voiced by scholars and practitioners, including persona validation, inconsistency problem, and long development times. Finally, we outline areas of future research of personas in the context of online analytics. We argue that to survive in the rapidly developing online customer analytics industry, personas must evolve by adopting new practices. KEY WORDS Data-Driven Personas; Online Analytics; Customer Segmentation INTRODUCTION The abundance of social media data has increased the difficulty of making sense of data (Järvinen 2016; Saggi & Jain 2018; Salminen, Milenković & Jansen 2017), while at the same time making it possible to automatically infer customers attributes from social media that were previously accessible only by survey research. Researchers have begun utilizing publicly available social media posts to infer customer attributes, including personality traits, political orientation, brand liking, and needs and wants (Ardehaly & Culotta 2015; Del Vecchio, Mele, Ndou, & Secundo 2017; Jung, An, Kwak, Salminen, & Jansen 2017; Volkova, Bachrach, & Durme 2016). Free expression in social media provides opportunities to learn about the needs and traits of groups and individuals (Owusu et al. 2016). Overall, the development of computational techniques and the availability of online data has resulted in an increased interest in data- driven personas: (a) to describe the content consumption patterns of diverse online audiences Salminen et al. 48 (Salminen, Şengün, et al. 2018), and (b) to use online data, namely online analytics and social media posts, in persona generation (An, Kwak & Jansen 2017; Zhang, Brown & Shankar 2016). However, at the same time, the usefulness of personas has been questioned. There is a plethora of alternative online analytics tools (e.g., Google Analytics, SimilarWeb), services (e.g., comScore, HitWise) and metrics (Boghrati et al. 2017; Clarke & Jansen 2017; Järvinen & Karjaluoto 2015) that one can employ to understand customers. Moreover, companies have gained access to individual-level data that performs well for many marketing purposes, including customer relationship management (Zerbino et al. 2018), providing tailored experiences and recommendations (Ronen, Yom-Tov & Lavee 2016), one-to-one targeting of online ads (Miralles-Pechuán, Ponce & Martínez-Villaseñor 2018), and enabling the creation of sophisticated customer segments (Jansen et al. 2017; Rundle-Thiele, Dietrich & Kubacki 2017). Although prevalent in many fields, the use of automated techniques is especially pertinent in the field of digital marketing which is shifting toward a higher degree of personalization, micro- targeting, and one-to-one marketing (Bleier & Eisenbeiss 2015). Thus, there are concerns about personas providing real value in such an environment. In this research, we evaluate the use and usefulness of personas given the easy availability of online individual customer data for digital marketing use cases. Traditionally, personas are used in replacement of actual one-to-one data about customers (Howard 2015), as mental models that help keeping customers in mind in the absence of having real customers available when making decisions about design (Nielsen 2013), software development (Pruitt & Grudin 2003), and marketing (Russell & Toklu 2011). However, with the widespread availability of online analytics data and numerous programs, techniques, and platforms to process the gathered data, a research question arises: Are personas still valid as a marketing tool in the era of online analytics? We address this question through a conceptual inquiry and analysis, drawing from two streams of literature: (a) persona studies (for drawing the benefits and shortcomings of personas) and (b) digital marketing research (for drawing the use of digital marketing). We aim to conceptually combine these streams to better understand the role of personas in the era of online analytics and digital marketing. Therefore, this work is conceptual research discussing the value of personas in the era of online analytics. Our purpose is to review the related literature for better positioning personas given the context of online analytics, which is currently lacking in the extant persona literature. With our inquiry, we aim to avoid the advocacy issue mentioned by Matthews, Judge & Whittaker (2012, p. 1220): “[persona] literature […] generally takes a position of advocacy and lacks objectivity.” That is, we explore both the possible strengths and the shortcomings of personas in an objective manner. First, we identify the benefits to which personas are associated in the literature. Second, we summarize the traditional criticisms of personas. Third, we formulate new critical arguments against the use of personas given the widespread availability of online analytics data. We then address these arguments, examining their relevance to traditional and digital data- driven personas, and discuss the ability of technology to solve the major shortcomings of personas. Finally, we conclude by presenting ideas for the future of personas in the era of online analytics. Our key contributions are twofold. First, we present novel criticism relating to the use of personas for customer-related decision making, addressing concerns from both digital analytics practitioners and academic scholars. Second, we reposition personas in the age of online analytics by conceptually distinguishing between traditional data-driven personas and digital data-driven personas, and analyzing the implications of both for solving persona challenges. We, Persona Studies 2018, vol. 4, no. 2 49 therefore, establish and discuss the role of personas in the context of online analytics, which has not previously been done in the persona literature. METHODOLOGICAL APPROACH Being a conceptual and survey work, this manuscript relies on prior literature as a source of evidence. To this end, we perform a literature review to identify the key benefits and criticisms of personas in the academic literature. To find the works of this domain, we conducted searches on Google Scholar and Science Direct with the base keyword “personas” and then scanned the abstracts of the articles for cue words indicating weaknesses, downsides, or critiques of personas. In this literature query, we focused on finding critical papers, as most persona studies list the benefits but more rarely include criticism (Matthews, Judge & Whittaker 2012). The literature on persona benefits is, therefore, easier to find. In total, we identify 32 relevant papers, many of which are conference articles in the field of computer science (sub-fields e.g. human-computer interaction, design, and software development). We read these articles in detail and use them to discover related articles discussing the strengths and weaknesses of personas (a technique referred to as snowball sampling (Provan & Milward 1995). WHY ARE PERSONAS USEFUL? Cooper (1999) introduced personas as a design technique for understanding and communicating the goals and needs of different user types. Since then, personas have been employed by designers, software developers, and marketers, among other decision-maker groups (Nielsen 2013; Pruitt & Adlin 2006; Mulder & Yaar 2006). Personas crystallize a specific user type, often focusing on core users in the absence of an immediate contact to the end user (Floyd, Jones & Twidale 2008). A user can refer to a user of a software system, such as a website or mobile application, a player of a game, customer of a product or service, audience of online content, target segment for marketing campaigns, a patient of public health services, and so on (Pruitt & Grudin 2003; LeRouge et al. 2013; Ma & LeRouge 2007; Scott 2007; Dong, Kelkar & Braun 2007; Nacke, Drachen & Göbel 2010; Vahlo & Koponen 2018). In the remainder of this work, we use the concept of ‘customers’ when referring to these groups. In this research, we examine personas specifically in the context of online analytics, adopting the perspective that personas are created to provide real value for their end users (Cooper 1999; Gudjónsdóttir 2010), such as achieving more user-friendly designs or more empathetic advertising texts. Evaluating personas, therefore, has to take place in relation to their value in use (Kaartemo, Akaka & Vargo 2017). The benefits of personas according to the literature are summarized in Table 1. The communicational benefits arise from summarizing customer information into an intuitive format of representation that can be communicated with little effort (Holtzblatt, Wendell & Wood 2005) within organizations, teams, departments, and with external stakeholders (Matthews, Judge & Whittaker 2012). In theory, personas provide an engaging description of the end users’ needs and wants, in the form of another human being that is more memorable than numbers (Goodwin 2009; Hill et al. 2017). At their best, personas become shared mental models that individuals rely upon when making decisions (Nielsen 2013) that concern the specific user type (Cooper 1999). This enables the decision makers to discuss experiences and backgrounds different from their own and realize that the customer preferences may deviate from their own preferences (Miaskiewicz & Kozar 2011). Salminen et al. 50 Table 1: Benefits associated with the use of personas Category Description Communication Personas facilitate user-oriented communication within and between teams in the organization. Psychology Personas enhance the immersion required for designing ‘for a person’ instead of fuzzy and complex target groups. Transformation Personas challenge existing assumptions about customers and orientate trade-off decisions when customers have conflicting needs. Focus Personas help focus design decisions on user goals and needs rather than on system attributes and features. The psychological benefits are rooted in identification with the personas (Miaskiewicz, Sumner & Kozar 2008), whereby decision makers can obtain an empathic understanding of users, immersing themselves in real situations experienced by others. Decision makers can use this ability to predict customer behavior under different circumstances (Pruitt & Grudin 2003). This mental modeling relies on human beings’ innate ability of empathy and immersion (Krashen 1984), and is, therefore, a powerful agent for enhanced motivation and purpose. At best, personas can give a higher sense of meaning to one’s work. Consider the psychological difference between creating a software product to the nameless target group of 24–35 year-old women, compared to creating a mobile application for Jane, a stressed single mum who wants to better manage her time. Transformational benefits relate to challenging the established perceptions about the users within the organization (Miaskiewicz & Kozar 2011). Because the creation of personas is based on gathering real evidence of the users (Pruitt & Grudin 2003), the results can deviate from the existing preconceptions and truisms within the organization. When there is friction between the perceived and real goals of the users, accurate persona representations can reduce this perceptional gap by conveying factual information about users’ needs and wants (Pruitt & Adlin 2006). If the organization and the decision makers are open to re-aligning their perceptions, the persona exercise can prevent and rectify false conceptions of the end users (Matthews, Judge & Whittaker 2012). Finally, personas can facilitate focusing on the most important audiences (Miaskiewicz & Kozar 2011). This helps decision makers with strategic agenda to prioritize certain customers over others, and thus resolve conflicting needs and wants among the customer base. For example, if Persona A wants Feature set x, while Persona B wants Feature set y; by considering the overall strategy of which users the organization wants to serve (a practice known as customer portfolio management) (Johnson & Selnes 2004), we can define the optimal product features to focus on (Cooper 1999; Ma & LeRouge 2007). Thus, personas help to prioritize product requirements and help determine if the right problems are being solved while curbing the self-centering bias of the decision makers (Matthews, Judge & Whittaker 2012). Persona Studies 2018, vol. 4, no. 2 51 CRITICISM OF PERSONAS The literature includes a lot of substantial criticism ofpersonas. To better understand and dissect this criticism, we have categorized it into three sections that are roughly compatible with the typical lifecycle of a persona project: first personas are created (creation of personas), then assessed by decision makers (evaluation of personas), and finally applied in real scenarios and use cases (use of personas). Various critical arguments arise in the literature at each of these stages. These arguments are summarized in Table 2 and discussed afterward. Table 2: Established criticism of personas from prior literature. Category Key issues Authors Creation of personas a) persona creation takes a long time b) personas are expensive to create c) personas can be biased by their creators d) personas are based on non- representative data Pruitt and Grudin (2003); Vincent and Blandford (2014); Hill et al. (2017) Evaluation of personas e) personas lack credibility f) personas are not accurate or verifiable g) the information in personas is not relevant for decision makers h) personas are inconsistent Chapman and Milham (2006); Bødker et al. (2012); Matthews et al. (2012) Use of personas i) not using the created personas j) using personas for politics and power play k) using personas to justify preconceptions l) personas change in time Rönkkö et al. (2004); Rönkkö (2005); Chapman and Milham (2006) Hill et al. (2017) point out that (a) creating quality personas takes considerable time and effort. According to Vincent and Blandford (2014), persona creation can take months. In a similar vein, Pruitt and Grudin (2003) advocate an in-depth research effort for persona creation, typically lasting months. It is seen that for personas to be accurate, considerable investigative work is required. This tends to result in (b) persona projects being expensive, in the range of tens of thousands of American dollars (Marsden & Haag 2016; Miaskiewicz, Sumner & Kozar 2008). Consequently, as Rönkkö (2005) found, the amount of effort may lead to questioning the return on investment of persona projects. Moreover, the high cost of persona creation tends to exclude them from the reach of small businesses and startups, as pointed out by Salminen, Jansen, An, Kwak, and Jung (2018). There are also concerns about the validity of personas, or how well the personas match the reality. A commonly mentioned validity concern is that (d) personas are based on insufficient or non-representative data (Chapman & Milham 2006). Personas that are built based on relatively few qualitative interviews may not represent the underlying user groups in a statistically valid manner. For example, the resulting personas may suffer from bias when the interviewed subjects are chosen based on availability rather than representativeness of the entire customer base. Overall, (c) personas risk inheriting organizational tensions and individual biases, including political and strategic ambitions of their creators (Hill et al. 2017; Massanari 2010; Rönkkö 2005). Vincent and Blandford (2014, p. 1098) argue that “[persona creation] has Salminen et al. 52 been adapted, depending on what people want to accomplish and why.” Since persona representations are not easily verifiable, a deliberate selection bias can take place. This bias can also be involuntary so that the creators of the personas are projecting their own prejudices unconsciously in the persona description. Additionally, since persona creation work is typically qualitative, (e) personas lack the credibility of numbers and are, to some, interpretative and subjective instead of rigorous and believable (Chapman & Milham 2006). Even when using the best practices of qualitative inquiry, number-oriented decision makers may consider personas as ‘nice narratives’ instead of serious decision-making instruments, resulting in resistance for the use of personas (Massanari 2010). For example, the participants in the study of Matthews et al. (2012) found personas misleading, abstract, and unrealistic. In a similar vein, Bødker et al. (2012, p. 93) report that “as soon as the project started [...] it became a concern that the 12 personas seemed distant from actual citizens, very general and difficult to activate.” There can be many reasons for such interpretations. For example, the subjective experiences and impressions of decision makers may conflict with personas (Marsden & Haag 2016). Furthermore, other information about users, such as direct customer feedback, can conflict with persona information, because the full complexity and range of the customer base deviates from the idealized personas (Chapman & Milham 2006). In such cases, decision-makers need to consider the credibility of personas against other sources of data. This may result in a willingness to hold on to one’s existing beliefs (Delfabbro 2004), trusting one’s own observations and insights instead of more abstract personas. Moreover, there is no objectively right or wrong answer on which information to include in the persona profile (Bødker et al. 2012; Chapman & Milham 2006). Some decision-makers, for example, might prefer data relating to customer journey while others are more interested in psychographics. Therefore, (g) information selection for personas is arbitrary and may not be of use in a given scenario or use case. The information should be based on the information needs of the end users of personas (Sinha 2003). These needs vary across industries and use cases, even between job roles within the same organization. For example, “marketing personas” would include information such as consumptions patterns and consumer motivations, goals, and likes and dislikes (Thoma & Williams 2009), whereas online content producers would prefer information on content consumption patterns (Nielsen et al. 2017). Additionally, (h) personas are said to be inconsistent, meaning that they are created by combining information from several unrelated data sources, without ensuring that the individual pieces of information are commensurable (Matthews, Judge & Whittaker 2012). Bødker et al. (2012) refer to personas as “Frankenstein’s monsters”, postulating that they can be patched up from any information available. Due to the above reasons, (f) the accuracy of personas is difficult to validate. If decision makers in an organization are aware of this risk, they will not trust the persona representations and will downplay their use in real decision-making situations (Chapman & Milham 2006). The lack of trust is aggravated when decision makers do not personally participate in the persona creation (Long 2009), and, thus, the lack the psychological ownership of the persona artifacts (Bødker et al. 2012). Overall, these adverse dynamics assert strain on the credibility of personas. If the attitude of the decision makers is not favorable to the use of personas, the potential benefits remain unachieved. Finally, we identified five threats for the actual use of personas. First, there are situations where (i) personas are fully developed but then left without meaningful use. For example, Rönkkö et al. (2004) report a case where a considerable amount of time was used to develop personas that were never implemented. In a similar vein, Matthews, Judge and Whittaker (2012) found that personas had little impact on the actual design work. Friess (2012) Persona Studies 2018, vol. 4, no. 2 53 conducted an ethnographic study among designers and found a serious imbalance between the creation and use of personas in real decision-making situations. Chapman and Milham (2006) see the lack of validation resulting in political conflicts where (j) choosing a persona is a question of opinion instead of a fact. For example, the created personas may be rejected in favor of the manager’s pre-existing beliefs about the customers (Chapman & Milham 2006; Pruitt & Grudin 2003), to frame the strategic and operational discussions in the organization (Rönkkö 2005), or (k) interpreted to confirm existing beliefs rather than to seek new explanations (Salminen, Jung, et al. 2018). Rönkkö (2005) also found that personas were used as a form of after-the-fact justification, so design choices based on other inputs were later communicated to other team members as if they were based on the personas. Rönkkö et al. (2004, p. 115) describe a case where the persona evolved in time and was eventually seen differently by stakeholders of the company: “The [...] persona who originated as a middle-age businessman ended up as a less clearly definable figure, e.g. a younger careerist of both the male and the female sex, and a diversity of different professions whose common characteristics was mobility, e.g. salesman, plumber, nurse, policeman, veterinary.” This practice in effect nulls the acclaimed benefit of aligning user understandings within the organization. Finally, (l) use of personas is hindered by changing customer behavior; as the customer behavior changes, personas should be updated to reflect these changes (Jung et al. 2017). However, being that data collection is typically expensive, the updating may not be possible, and personas risk expiring rapidly in real use. Overall, the above challenges risk creating situations where the persona benefits remain largely theoretical and do not materialize in real use cases (Friess 2012; Marsden & Haag 2016). In addition, there are novel concerns arising from the use of personas in comparison to utilizing online analytics data to understand customers. NEW CRITICISM We now move toward introducing the context of online analytics. In addition to the established criticism laid out in the previous section, there have been newly-found criticisms of personas, arising from the availability of online analytics data, metrics, and techniques for business purposes. To demonstrate the logic of this new criticism, we present the following quotations retrieved from recent online writings by practitioners dealing with online customer data: “Whereas personas were once a good starting point to identify ‘buckets’ of customers, the limitations of persona-based marketing have become apparent as the consumer decision-making journey veered from its predictable linear path and increased in complexity.”i “Personas tend to be exhaustive where it’s not needed (demographics, names, pictures are not necessary most of the times), while they fail to summarise the complex variety of needs and usage scenarios that real users express in real life situations.”ii “The idea of a persona or an average customer was the typical way that marketers would think about their customer base. But now with advancements in technology, with modeling, with more available skill sets, they are able to understand and predict future behavior at more granular levels, and it’s a dramatic shift that’s happening.”iii “You’re still clinging to generic user personas in the age of Big Data? LOL."iv Salminen et al. 54 The above critical statements are chosen to illustrate the anti-persona sentiment taking place among some of the data-oriented business professionals. It is not a comprehensive sample, nor do we argue that all business professionals proficient with online analytics data and tools would perceive personas as useless. However, there are several recent blog posts that dispute the usefulness of personas and therefore it is worthwhile to bring this criticism to scholarly attention and analyze it objectively in the proper frame of context. The crux of the criticism can be summed up in three categories: increased complexity (Quotes 1 and 4) – personas are not able to capture the diversity and nuances of the increasingly large online audiences. Typically, one would create only a few personas (less than 10) to describe the core users which may not be enough in the era of online analytics and fragmented consumer behavior. redundant information (Quote 2) – personas are overly focused on superficial demographic information instead of focusing real needs and wants of the customer base. Also, the information presented tends to be static and not dynamic. lack of prediction (Quote 3) – personas are descriptive and not predictive; they cannot be used for prediction, unlike other analytics tools. To understand this criticism, we must bear in mind that personas are inherently connected to decision making about customers. In this sense, they are analytical tools and fall under the scope of other analytics solutions when applied in practical use cases. As such, practitioners using personas are questioning them in comparison to other analytical tools. For example, it is now commonplace to target individual users within digital marketing. Online advertising platforms, such as Facebook Ads and Google AdWords, have constructed social graphs and knowledge graphs (Venkataramani et al. 2012) with each user a node with descriptive properties that can be used for advertisement targeting. Moreover, there have been substantial algorithmic advances that enhance targeting and optimization of advertising (Graepel et al. 2010; Wang & Yuan 2015). Techniques such as multi-armed bandits define the search space and find the best matches given an overall target group or population (Chatwin 2013). Given that users can be targeted and analyzed individually, what purpose is there for aggregated data representations, such as personas? To answer this question, we examine the ability of data- driven personas. EVALUATING DATA-DRIVEN PERSONAS IN THE LIGHT OF CRITICISM: DISTINGUISHING BETWEEN TRADITIONAL AND DIGITAL DATA-DRIVEN PERSONAS Cooper’s (1999) initial idea was for personas to be data-driven, i.e., based on real insights about the users. Later, other scholars working with personas have confirmed this view of personas originating from comprehensive investigative data collection among real customers (Chapman et al. 2008; Howard 2015; Pruitt & Grudin 2003). However, the conceptual difference between manual and digital data-driven personas has been poorly established in the prior literature, even though this distinction is central for understanding the role of personas amidst digital data. Therefore, we separate between traditional data-driven personas (TDDPs) and digital data-driven personas (DDDPs) and focus on analyzing their strengths for solving the criticism proposed by scholars and practitioners. What are DDDPs, then? The primary descriptor of DDDPs is that they bridge persona creation between quantitative data and computational techniques. Several examples can be Persona Studies 2018, vol. 4, no. 2 55 found in the literature. For example, Chapman et al. (2008) use conjoint analysis to reduce a data-set to persona-like representations. Zhang, Brown, and Shankar (2016) use click-stream data to generate personas of website visitors. (An, Kwak & Jansen 2017) develop a methodology and system for completely automatic persona generation using YouTube audience statistics. These novel approaches illustrate ways to combine online analytics data and personas in a way that draws from the benefits of large-scale online data and retains the core benefits of personas in showing the data as another human being. Online analytics data and computational techniques for processing the data provide at least four major advantages for persona creation (Salminen et al. 2017): (1) the possibility to automatically collect large volumes of data through application programming interfaces (APIs), (2) the availability of behavioral data (not only survey and interview responses), providing better grounds for statistical methods (Chapman et al. 2008), (3) scalability, meaning that data analysis algorithms and automatic systems can process millions of user interactions from millions of content pieces, (4) near real-time responsiveness, enabling customer insights to change as the underlying data changes. These features also make DDDPs different from mere “quantitative personas” suggested in the literature (Mesgari, Okoli & de Guinea 2018). The concept of quantitative persona captures the statistical aspect of data analysis, but it does not correctly capture the aspects of automation and large-scale data analysis associated with the use of online analytics data characterized by high volume, velocity, veracity, and variety (Storey & Song 2017). At its best, automatic data collection and analysis is cost-efficient and behaviorally accurate across the whole user base, providing excellent foundations for the creation of data- driven personas. Additionally, DDDPs do not necessarily need to be either quantitative or qualitative, but they can draw from both types of data, as demonstrated by Salminen, Şengün, et al. (2018) with their hybrid personas created using quantitative online analytics data and qualitative insights. How are DDDPs, then, able to solve persona challenges? And how do they compare against TDDPs? We perform a conceptual analysis evaluating these questions for each point of criticism laid out in the previous sections. After this, we discuss the findings and evaluate the benefits. Table 3 provides a comparison of the ability of TDDPs and DDDPs to address the criticism. If we contrast DDDPs against the criticism of personas laid out in the previous sections, we find that they have the potential to solve many acute problems. First, automation enables rapid persona creation. Whereas the creation of personas using manual methods, such as ethnography and surveys, can take several months, digital persona generation system are able to run the required calculations in the matter of a few hours (Jung et al. 2017). Second, the personas can be generated by inferring latent patterns of users’ behavior, e.g. video viewing or website browsing (An, Kwak & Jansen 2017). This technique is robust against personal biases of human creators and produces personas that are based on behavioral data instead of self-stated data that has been shown vulnerable to respondent bias (Fisher 1993). Third, representativeness of the sample is not an issue, when the persona generation is based on the whole user base. For example, Salminen, Şengün, et al. (2017) generated personas from YouTube data of a major online news media company consisting of millions of viewers. Using online analytics data potentially solves the trade-off of relying on either qualitatively rich but non-verifiable data or using numbers that are accurate but lose the immersion of another human being. Salminen et al. 56 Table 3: Evaluation of TDDPs and DDDPs against persona criticism. Applies to Category Key issues TDDPs DDDPs Creation of personas a lot of effort and time needed to create quality personas Yes, manual data collection and analysis No, data collection and analysis can be automated personas are expensive to create Yes, because they require manual labor No, because persona generation can be automated and replicated personas can be biased by their creators Yes, because information selection is made subjectively No, because algorithms decide the information shown personas are based on non- representative data Yes, because sampling is limited No, because one can sample the whole customer base Evaluation of personas personas lack credibility Potentially yes Potentially yes personas are not accurate or verifiable Yes, qualitative analysis is difficult to replicate systematically No, the results can be statistically evaluated the information in personas is not relevant for decision makers Potentially yes Potentially yes personas are inconsistent Potentially yes Potentially yes Use of personas not using the created personas Potentially yes Potentially yes using the personas for politics and power play Potentially yes Potentially yes using personas to justify one’s preconceived notions Potentially yes Potentially yes personas change in time Yes, because data collection and analysis would need to be repeated No, because personas can be updated to reflect changes in data Personas in online analytics context increased complexity Yes, because the number of personas is limited No, because the personas can capture a large number of patterns redundant information Potentially yes Potentially yes lack of prediction Potentially yes No, because the underlying data can be used for prediction Persona Studies 2018, vol. 4, no. 2 57 Fourth, the “file drawer effect” (Rönkkö et al. 2004) becomes an aggravated issue for TDDPs as time goes by because of the underlying customer behavior changes over time but the manually collected data does not change to reflect these changes. DDDPs are responsive to those changes because the personas can be periodically updated to reflect the most recent behaviors. If the data is directly obtained from the APIs of online platforms, this update process can be completely automated, as demonstrated by (An, Kwak & Jansen 2017). Regarding the DDDPs in the context of online analytics, several studies support our argument that DDDPs can handle the increasing complexity of audiences and customer bases. For example, (Liapis et al. 2015) are able to infer dozens of behavioral gaming patterns from their dataset. Salminen, Şengün, et al. (2018) summarize the audience of a large social media channel, consisting of viewers from more than 200 countries, to five representative personas. (Kwak, An & Jansen 2017) identify hundreds of unique content consumption patterns among channel users that they use to generate personas. An example of prediction is demonstrated by Jung, Salminen, An, Kwak, and Jansen (2018). They use online analytics data for persona generation, and then predict the interest of a persona to a given video using an underlying topic matching algorithm. Even though some prediction tasks could be done using TDDPs (e.g., “Would Martin like our content?”), the method of doing so inevitably involves a prominent level of subjectivity, whereas using DDDPs the algorithm treats the prediction as a numerical problem. LIMITATIONS OF DDDPS From the evaluation in Table 3, we see that DDDPs have the greatest potential for solving persona creation and digital context problems. Some of the evaluation issues can also be addressed. In contrast, they do not seem to provide a considerable advantage for use of personas, as the application is subject to organizational and individual biases. Moreover, consistency has been found an issue also in DDDPs (Salminen, Nielsen, et al. 2018) as in TDDPs (Bødker et al. 2012). In fact, inconsistency might be even heightened in automatic persona generation, as there is a subjective safeguard for making sure that the information pieces selected by the machine are topically consistent (An, Kwak & Jansen 2017). Moreover, even in the context of DDDPs, information selection remains a challenge, as Salminen, Jung, et al. (2018) observed in their user study. Despite the progress made in developing DDDPs, there remain many open challenges, such as reliance on current audience data, lack of depth, and a discerning lack of basic attributes in many of the approaches (i.e., not generating persona profiles but behavioral archetypes). Another major limitation of the current DDDPs methodologies is that none of them include deeper information and insights about the users, such as customer pain points, motivations, needs and wants that are essential for the depiction of full, rounded personas (Nielsen 2013). Understanding the deeper motivations of customers is an essential question for marketers (Dichter 1964). Furthermore, while the existing DDDPs may be efficient in modeling current audiences, the decision makers might be interested in potential customers (Thoma & Williams 2009). This interest can be explained by the expansive goals of a typical marketing organization; that is, marketers are pressured to find novel audiences and markets. However, when the DDDPs are generated from current audience data (Jung et al. 2018), such information is not readily available. Thus, we conjecture that the greater the need for reaching new audiences, the riskier it is to use DDDPs for decision making. Moreover, the use of existing online analytics data can lead to confirmation bias. For example, a decision maker may only target Women, age 25–34 with his efforts, so when DDDPs reveal to him that the group is indeed his core customers, he Salminen et al. 58 carries on saying “I was right,” maintaining his targeting and never, in fact, trying out other target groups. In other words, DDDPs must be interpreted carefully. Furthermore, a major limitation of DDDPs is the lack of participation of the team using the personas. The process of persona creation has been found valuable per se, as participation increases the decision makers’ interest in and understanding of personas (Molenaar 2017). This aspect is lacking in DDDPs that are created by algorithms “in distance”, representing a potential threat for adoption and active use of the persona among decision makers (Matthews, Judge & Whittaker 2012). It is, however, possible to create so-called hybrid personas (Miaskiewicz, Sumner & Kozar 2008; Salminen et al. 2017) that combine quantitative and qualitative aspects. Finally, like for TDDPs, another major limitation for DDDPs is that the benefits postulated in the existing persona literature remain potential, depending on the decision makers’ actual willingness to use the personas. Table 4 evaluates the applicability of persona benefits to TDDPs and DDDPs. Table 4: Applicability of persona benefits to TDDPs and DDDPs Applies to Category Description TDDPs DDDPs Communication Personas facilitate user-oriented communication within and between teams in the organization. Potentially yes Potentially yes Psychology Personas enhance the immersion required for designing ‘for a person’ instead of fuzzy and complex target groups. Potentially yes Potentially yes Transformation Personas challenge existing assumptions about customers and orientate trade-off decisions when customers have conflicting needs. Potentially yes Potentially yes Focus Personas help focus design decisions on user goals and needs rather than on system attributes and features. Potentially yes Potentially yes The benefits of personas are a question of value in use (Kaartemo, Akaka & Vargo 2017). That use varies by use case and user of personas. Generally, the same challenges in transcending the theoretical value of personas to practice apply to both TDDPs and DDDPs. However, as alternatives to numerical online analytics data, personas do have some distinct advantages. For example, dealing with numbers poses cognitive challenges for individuals who often cannot recall many numbers at a time (Miller 1956), whereas human attributes are more easily remembered (Mulken, André & Müller 1998). Therefore, DDDPs seem to provide an ample alternative for presenting numerical data, even though they are not perfect. Persona Studies 2018, vol. 4, no. 2 59 DISCUSSION In this research, we examined the role of personas given the widespread availability of online analytics data, and its perceived useful for business purposes. We reviewed scholarly criticism of personas and extended it with fresh perspectives from practitioners working with online analytics data. Although our inquiry is conceptual in its nature, these quotations served as a basis for understanding the challenges and disadvantages of personas in the modern customer analytics environment. To better address the question of personas’ usefulness in this environment, we dissected the concept of personas into TDDPs and DDDPs. This separation was crucial to make the analysis because the two types of personas are different in their ability to address the challenges associated with personas in the context of online analytics. The main contributions of this research include a) identifying new critical arguments against personas in the context of online analytics, previously not discussed in persona studies, and appending this criticism to the continuum of established persona criticism; and b) conceptually differentiating between traditional and digital data-driven personas and separately analyzing their ability to address the established and novel criticism of personas, with the finding that DDDPs possess considerable strengths in regard to both types of criticism. Regarding the answer to our research question, namely can personas provide value in the age of online analytics, we answer that the real value of DDDPs is provided by giving faces to data, as an alternative way to present online analytics information. Ultimately, however, the usefulness of personas comes down to specific use cases (Cooper 1999). In general, personas are useful for tasks requiring a qualitative understanding of customers and numbers are useful for getting a general overview and, in the case of machine-based decision-making (i.e., marketing automation), making individual level optimization. To this end, we postulate that individual data is optimal for automated decision making, whereas aggregate data such as personas work best for human decision making, especially relating to decisions at the strategic level. The claimed irrelevance of personas seems to be based on the confusion of their use in an age of online analytics data. It seems that the criticism presented by online analytics practitioners is based on understanding personas as TDDPs, while overlooking the potential of DDDPs. This insight further supports the purpose of conceptually separating these two types of personas, and clearly communicating their differences to end users of online analytics data. Data-driven personas are not a novel idea, as the purpose of persona creation has always been to use real customer information to generate realistic user characterizations (Cooper 1999). However, using computational techniques, such as machine learning, provide tremendous opportunities toward this end (Salminen, Jansen, et al. 2018). At the same time, there are a plethora of open research questions to answer. Moreover, the persona research related to DDDPs tends to be fragmented, while it would make sense for researchers to collaborate and validate the works of one another to make evolutionary progress in this field. The core benefits of using personas for design, system development, and marketing have not changed. However, for tasks such as targeting or recommendation engines, individual level analytics are likely to perform more efficiently. While these methods are likely to excel in those use cases, their application to other use cases, such as strategic decision making, is more limited. As with any analytical technique, the use of personas is relative to the problem one seeks an Salminen et al. 60 answer to. In this light, the question “Are personas useful?” should be rephrased as “When are personas useful?” Finding the answer requires persistent conceptual and empirical research and exploring new contexts such as online analytics. CONCLUSION In conclusion, personas remain a viable option even in the era of online analytics data. It is possible to combine automatic data collection and other computational techniques to create accurate persona profiles that can also be used for advanced purposes such as prediction. However, some of the challenges of TDDPs are inevitably inherited such as the end user relating to the actual use of personas and their perceived credibility. Therefore, more research and development work is needed to overcome these challenges and to show the tangible value of personas in actual use. i See http://www.cmo.com/features/articles/2017/1/16/why-personas-dont-work-and-what- innovators-are-doing-differently.html#gs.zxQ5E9s ii See https://www.humaneinterface.net/article/are-personas-really-useful iii See http://knowledge.wharton.upenn.edu/article/160811b_kwradio_fader-mariychin-mp3- zodiac/ iv See http://thecontextofthings.com/2016/08/25/user-personas/ END NOTES WORKS CITED An, J, Kwak, H & Jansen, BJ 2017, ‘Personas for Content Creators via Decomposed Aggregate Audience Statistics’, in Proceedings of Advances in Social Network Analysis and Mining (ASONAM 2017), Sydney, Australia. Ardehaly, EM & Culotta, A 2015, ‘Inferring latent attributes of Twitter users with label regularization’, retrieved 29 October 2018 from http://cs.iit.edu/~culotta/pubs/ehsan15inferring.pdf. Bleier, A & Eisenbeiss, M 2015, ‘Personalized Online Advertising Effectiveness: The Interplay of What, When, and Where’, Marketing Science, vol. 34, no. 5, retrieved 18 May 2016, from http://pubsonline.informs.org/doi/abs/10.1287/mksc.2015.0930. Bødker, S, Christiansen, E, Nyvang, T & Zander, P-O 2012, ‘Personas, people and participation: challenges from the trenches of local government’, Proceedings of the 12th Participatory Design Conference: Research Papers - Volume 1 (PDC '12), K Halskov, H Winschiers- Theophilus, Y Lee, J Simonsen, and K Bødker (Eds.). ACM, New York, NY, USA, pp. 91- 100. DOI=http://dx.doi.org/10.1145/2347635.2347649 Boghrati, R, Hoover, J, Johnson, KM, Garten, J & Dehghani, M 2017, ‘Conversation level syntax similarity metric’, Behavior Research Methods, vol. 50, no. 3, pp. 1055-1073, doi: 10.3758/s13428-017-0926-2. Chapman, CN, Love, E, Milham, RP, ElRif, P & Alford, JL 2008, ‘Quantitative Evaluation of Personas as Information’, Proceedings of the Human Factors and Ergonomics Society Annual Meeting, vol. 52, no. 16, pp. 1107–1111. http://cs.iit.edu/%7Eculotta/pubs/ehsan15inferring.pdf http://pubsonline.informs.org/doi/abs/10.1287/mksc.2015.0930 Persona Studies 2018, vol. 4, no. 2 61 Chapman, CN & Milham, RP 2006, ‘The Personas’ New Clothes: Methodological and Practical Arguments against a Popular Method’, Proceedings of the Human Factors and Ergonomics Society Annual Meeting, vol. 50, no. 5, pp. 634–636. Chatwin, RE 2013, ‘An overview of computational challenges in online advertising’, American Control Conference (ACC) 2013, IEEE, pp. 5990–6007, retrieved 29 October 2018 from http://ieeexplore.ieee.org/abstract/document/6580778/. Clarke, TB & Jansen, BJ 2017, ‘Conversion potential: a metric for evaluating search engine advertising performance’, Journal of Research in Interactive Marketing, vol. 11, no. 2, pp. 142–159. Cleary, PD 1997, ‘Subjective and Objective Measures of Health: Which is Better When?’, Journal of Health Services Research, vol. 2, no. 1, pp. 3–4. Cooper, A 1999, The Inmates Are Running the Asylum: Why High Tech Products Drive Us Crazy and How to Restore the Sanity. Sams - Pearson Education, Indianapolis, IN. Del Vecchio, P, Mele, G, Ndou, V & Secundo, G 2017, ‘Creating value from Social Big Data: Implications for Smart Tourism Destinations’, Information Processing & Management. vol. 54, no. 5, pp. 847-860, doi: https://doi.org/10.1016/j.ipm.2017.10.006 Delfabbro, P 2004, ‘The stubborn logic of regular gamblers: Obstacles and dilemmas in cognitive gambling research’, Journal of Gambling Studies, vol. 20, no. 1, pp. 1–21. Dichter, E 1964, Handbook of consumer motivations: the psychology of the world of objects. McGraw-Hill. Dong, J, Kelkar, K & Braun, K 2007, ‘Getting the most out of personas for product usability enhancements’, Usability and Internationalization. HCI and Culture. UI-HCII 2007. Lecture Notes in Computer Science, N Aykin (eds), vol 4559. Springer, Berlin, Heidelberg, doi: https://doi.org/10.1007/978-3-540-73287-7_36. Fisher, RJ 1993, ‘Social Desirability Bias and the Validity of Indirect Questioning’, Journal of Consumer Research, vol. 20, no. 2, pp. 303–315. Floyd, IR, Jones, MC & Twidale, MB 2008, ‘Resolving Incommensurable Debates: A Preliminary Identification of Persona Kinds, Attributes, and Characteristics’, Artifact, vol. 2, no. 1, pp. 12–26. Friess, E 2012, ‘Personas and Decision Making in the Design Process: An Ethnographic Case Study’, in Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, CHI ’12, ACM, New York, NY, USA, pp. 1209–1218, retrieved from http://doi.acm.org/10.1145/2207676.2208572. Goodwin, K 2009, Designing for the Digital Age: How to Create Human-Centered Products and Services. Wiley, Indianapolis, IN. Graepel, T, Candela, JQ, Borchert, T & Herbrich, R 2010, ‘Web-scale bayesian click-through rate prediction for sponsored search advertising in microsoft’s bing search engine’, in Proceedings of the 27th international conference on machine learning (ICML-10), pp. 13– 20, retrieved September 20, 2017, from http://machinelearning.wustl.edu/mlpapers/paper_files/icml2010_GraepelCBH10.pdf. Gudjónsdóttir, R 2010, Personas and scenarios in use, Skolan för datavetenskap och kommunikation, Kungliga Tekniska högskolan, Stockholm. Hill, CG, Haag, M, Oleson, A, Mendez, C, Marsden, N, Sarma, A & Burnett, M 2017, ‘Gender- Inclusiveness Personas vs. Stereotyping: Can We Have it Both Ways?’, CHI '17 Proceedings of the 2017 CHI Conference on Human Factors in Computing, pp. 6658– 6671, retrieved May 28, 2017, from http://dl.acm.org/citation.cfm?doid=3025453.3025609. Holtzblatt, K, Wendell, JB & Wood, S 2005, Rapid Contextual Design: A How-to Guide to Key Techniques for User-centered Design, Elsevier. Howard, TW 2015, ‘Are Personas Really Usable?’, Communication Design Quarterly Review, vol. 3, no. 2, pp. 20–26. Jansen, BJ, An, J, Kwak, H, Salminen, J & Jung, S-G 2017, ‘Viewed by Too Many or Viewed Too Little: Using Information Dissemination for Audience Segmentation’, Proceedings of the http://ieeexplore.ieee.org/abstract/document/6580778/ https://doi.org/10.1016/j.ipm.2017.10.006 https://doi.org/10.1007/978-3-540-73287-7_36 http://doi.acm.org/10.1145/2207676.2208572 http://machinelearning.wustl.edu/mlpapers/paper_files/icml2010_GraepelCBH10.pdf http://dl.acm.org/citation.cfm?doid=3025453.3025609 Salminen et al. 62 Association for Information Science and Technology, vol. 54, no. 1, pp. 189–196, doi: https://doi.org/10.1002/pra2.2017.14505401021. Järvinen, J 2016, ‘The use of digital analytics for measuring and optimizing digital marketing performance’, Jyväskylä Studies in Business and Economics 170, retrieved September 8, 2017, from https://jyx.jyu.fi/dspace/handle/123456789/51512. Järvinen, J & Karjaluoto, H 2015, ‘The use of Web analytics for digital marketing performance measurement’, Industrial Marketing Management, vol. 50, pp. 117–127, https://doi.org/10.1016/j.indmarman.2015.04.009. Johnson, MD & Selnes, F 2004, ‘Customer portfolio management: toward a dynamic theory of exchange relationships’, Journal of Marketing, vol. 68, no. 2, pp. 1–17. Jung, S, Salminen, J, An, J, Kwak, H & Jansen, BJ 2018, ‘Automatically Conceptualizing Social Media Analytics Data via Personas’, Proceedings of the Twelfth International AAAI Conference on Web and Social Media (ICWSM 2018), retrieved 29 October 2018 from https://www.aaai.org/ocs/index.php/ICWSM/ICWSM18/paper/view/17810/17089. Jung, S-G, An, J, Kwak, H, Ahmad, M, Nielsen, L & Jansen, BJ 2017, ‘Persona Generation from Aggregated Social Media Data’, in Proceedings of the 2017 CHI Conference Extended Abstracts on Human Factors in Computing Systems, CHI EA ’17, pp. 1748–1755, retrieved 29 October 2018 from http://www.bernardjjansen.com/uploads/2/4/1/8/24188166/jansen_personas_chi20 17.pdf. Jung, S-G, An, J, Kwak, H, Salminen, J & Jansen, BJ 2017, ‘Inferring social media users’ demographics from profile pictures: A Face++ analysis on Twitter users’, in Proceedings of 17th International Conference on Electronic Business, Dubai, retrieved 29 October 2018 from https://aisel.aisnet.org/iceb2017/22/. Kaartemo, V, Akaka, MA & Vargo, SL 2017, ‘A Service-Ecosystem Perspective on Value Creation: Implications for International Business’, in Value Creation in International Business, Palgrave Macmillan, Cham, pp. 131–149, retrieved 11 February 2018, from https://link.springer.com/chapter/10.1007/978-3-319-39369-8_6. Krashen, SD 1984, ‘Immersion: Why it works and what it has taught us’, Language and Society, vol. 12, no. 1, pp. 61–64. Kwak, H, An, J & Jansen, BJ 2017, ‘Automatic Generation of Personas Using YouTube Social Media Data’, Proceedings of the 50th Hawaii International Conference on System Sciences, pp. 833–842, retrieved 29 October 2018 from http://www.bernardjjansen.com/uploads/2/4/1/8/24188166/jansen_personas_hciss2 016.pdf. LeRouge, C, Ma, J, Sneha, S & Tolle, K 2013, ‘User profiles and personas in the design and development of consumer health technologies’, International Journal of Medical Informatics, vol. 82, no. 11, pp. e251-268. Liapis, A, Holmgård, C, Yannakakis, GN & Togelius, J 2015, ‘Procedural Personas as Critics for Dungeon Generation’, in AM Mora & G Squillero (eds), Applications of Evolutionary Computation, Springer International Publishing, Cham, pp. 331–343, retrieved 14 March 2018 from http://link.springer.com/10.1007/978-3-319-16549-3_27. Long, F 2009, ‘Real or imaginary: The effectiveness of using personas in product design’, in Proceedings of the Irish Ergonomics Society Annual Conference, Irish Ergonomics Society Dublin. Ma, J & LeRouge, C 2007, ‘Introducing User Profiles and Personas into Information Systems Development’, AMCIS 2007 Proceedings, retrieved 29 October 2018 from http://aisel.aisnet.org/amc is2007/237. Marsden, N & Haag, M 2016, ‘Stereotypes and Politics: Reflections on Personas’, in Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems, CHI ’16, pp. 4017– 4031, retrieved 18 March 2018, from http://doi.acm.org/10.1145/2858036.2858151. Massanari, AL 2010, ‘Designing for imaginary friends: information architecture, personas and the politics of user-centered design’, New Media & Society, vol. 12, no. 3, pp. 401–416. https://doi.org/10.1002/pra2.2017.14505401021 https://jyx.jyu.fi/dspace/handle/123456789/51512 https://doi.org/10.1016/j.indmarman.2015.04.009 https://www.aaai.org/ocs/index.php/ICWSM/ICWSM18/paper/view/17810/17089 http://www.bernardjjansen.com/uploads/2/4/1/8/24188166/jansen_personas_chi2017.pdf http://www.bernardjjansen.com/uploads/2/4/1/8/24188166/jansen_personas_chi2017.pdf https://aisel.aisnet.org/iceb2017/22/ https://link.springer.com/chapter/10.1007/978-3-319-39369-8_6 http://www.bernardjjansen.com/uploads/2/4/1/8/24188166/jansen_personas_hciss2016.pdf http://www.bernardjjansen.com/uploads/2/4/1/8/24188166/jansen_personas_hciss2016.pdf http://link.springer.com/10.1007/978-3-319-16549-3_27 http://aisel.aisnet.org/amcis2007/237 http://doi.acm.org/10.1145/2858036.2858151 Persona Studies 2018, vol. 4, no. 2 63 Matthews, T, Judge, T & Whittaker, S 2012, ‘How Do Designers and User Experience Professionals Actually Perceive and Use Personas?’, in Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, CHI ’12, pp. 1219–1228, retrieved 29 October 2018 from http://doi.acm.org/10.1145/2207676.2208573. Mesgari, M, Okoli, C & de Guinea, AO 2018, ‘Creating Rich and Representative Personas by Discovering Affordances’, IEEE Transactions on Software Engineering, doi: 10.1109/TSE.2018.2826537 Miaskiewicz, T & Kozar, KA 2011, ‘Personas and user-centered design: How can personas benefit product design processes?’, Design Studies, vol. 32, no. 5, pp. 417–430. Miaskiewicz, T, Sumner, T & Kozar, KA 2008, ‘A latent semantic analysis methodology for the identification and creation of personas’, in Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, pp. 1501–1510, retrieved from http://dl.acm.org/citation.cfm?id=1357290. Miller, GA 1956, ‘The magical number seven plus or minus two: some limits on our capacity for processing information’, Psychological Review, vol. 63, no. 2, pp. 81–97. Miralles-Pechuán, L, Ponce, H & Martínez-Villaseñor, L 2018, ‘A novel methodology for optimizing display advertising campaigns using genetic algorithms’, Electronic Commerce Research and Applications, vol. 27, pp. 39–51. Molenaar, L 2017, ‘Data-driven personas: Generating consumer insights with the use of clustering analysis from big data’, Thesis, retrieved 17 March 2018, from http://resolver.tudelft.nl/uuid:12d7f261-20b4-4656-93d7-fed2b437aefb. Mulder, S & Yaar, Z 2006, The User is Always Right: A Practical Guide to Creating and Using Personas for the Web, New Riders. Mulken, S van, André, E & Müller, J 1998, ‘The Persona Effect: How Substantial Is It?’, in People and Computers XIII, Springer, London, pp. 53–66, retrieved 28 February 2018, from https://link.springer.com/chapter/10.1007/978-1-4471-3605-7_4. Nacke, LE, Drachen, A & Göbel, S 2010, ‘Methods for evaluating gameplay experience in a serious gaming context’, International Journal of Computer Science in Sport, vol. 9, no. 2, pp. 1–12. Nielsen, L 2013, Personas - User Focused Design, Springer Science & Business Media, Springer- Verlag London. Nielsen, L, Jung, S-G, An, J, Salminen, J, Kwak, H & Jansen, BJ 2017, ‘Who Are Your Users?: Comparing Media Professionals’ Preconception of Users to Data-driven Personas’, in Proceedings of the 29th Australian Conference on Computer-Human Interaction, OZCHI ’17, pp. 602–606, retrieved 28 January 2018, from http://doi.acm.org/10.1145/3152771.3156178. Owusu, RA, Mutshinda, CM, Antai, I, Dadzie, KQ & Winston, EM 2016, ‘Which UGC features drive web purchase intent? A spike-and-slab Bayesian Variable Selection Approach’, Internet Research, vol. 26, no. 1, pp. 22–37. Provan, KG & Milward, HB 1995, ‘A preliminary theory of interorganizational network effectiveness: A comparative study of four community mental health systems’, Administrative Science Quarterly, vol. 40, no. 1, pp. 1–33. Pruitt, J & Adlin, T 2006, The Persona Lifecycle: Keeping People in Mind Throughout Product Design. Morgan Kaufmann, Boston. Pruitt, J & Grudin, J 2003, ‘Personas: Practice and Theory’, in Proceedings of the 2003 Conference on Designing for User Experiences, DUX ’03, pp. 1–15, retrieved from http://doi.acm.org/10.1145/997078.997089. Ronen, R, Yom-Tov, E & Lavee, G 2016, ‘Recommendations meet web browsing: enhancing collaborative filtering using internet browsing logs’, 2016 IEEE 32nd International Conference on Data Engineering (ICDE), pp. 1230–1238, doi: 10.1109/ICDE.2016.7498327. Rönkkö, K 2005, ‘An Empirical Study Demonstrating How Different Design Constraints, Project Organization and Contexts Limited the Utility of Personas’, in Proceedings of the Proceedings of the 38th Annual Hawaii International Conference on System Sciences - http://doi.acm.org/10.1145/2207676.2208573 https://doi.org/10.1109/TSE.2018.2826537 http://dl.acm.org/citation.cfm?id=1357290 http://resolver.tudelft.nl/uuid:12d7f261-20b4-4656-93d7-fed2b437aefb https://link.springer.com/chapter/10.1007/978-1-4471-3605-7_4 http://doi.acm.org/10.1145/3152771.3156178 http://doi.acm.org/10.1145/997078.997089 https://doi.org/10.1109/ICDE.2016.7498327 Salminen et al. 64 Volume 08, HICSS ’05, IEEE Computer Society, retrieved 29 October 2018 from http://dx.doi.org/10.1109/HICSS.2005.85. Rönkkö, K, Hellman, M, Kilander, B & Dittrich, Y 2004, ‘Personas is Not Applicable: Local Remedies Interpreted in a Wider Context’, in Proceedings of the Eighth Conference on Participatory Design: Artful Integration: Interweaving Media, Materials and Practices - Volume 1, PDC 04, pp. 112–120, retrieved 29 October 2018 from http://doi.acm.org/10.1145/1011870.1011884. Rundle-Thiele, S, Dietrich, T & Kubacki, K 2017, ‘Why We Need Segmentation When Designing Social Marketing Programs’, in Segmentation in Social Marketing, Springer, Singapore, pp. 197–214, retrieved 9 May 2018, from https://link.springer.com/chapter/10.1007/978-981-10-1835-0_13. Russell, S & Toklu, C 2011, ‘Acquiring and Applying Market Knowledge for Large Software Purchases: Products, Personas, and Programs’, in Knowledge Management for Process, Organizational and Marketing Innovation: Tools and Methods, IGI Global, pp. 195–215. Saggi, MK & Jain, S 2018, ‘A survey towards an integration of big data analytics to big insights for value-creation’, Information Processing & Management, vol. 54, no. 5, pp. 758–790. Salminen, J, Jansen, BJ, An, J, Kwak, H & Jung, S-G 2018, ‘Research Roadmap for Automatic Persona Generation: Principles and Open Questions’, Poster presented at The Fifth Machine Learning and Data Analytics (MLDAS 2018) Symposium, Doha, Qatar. Salminen, J, Jung, S-G, An, J, Kwak, H & Jansen, BJ 2018, ‘Findings of a User Study of Automatically Generated Personas’, in Extended Abstracts of the 2018 CHI Conference on Human Factors in Computing Systems, CHI EA ’18, p. LBW097:1–LBW097:6, retrieved 26 April 2018, from http://doi.acm.org/10.1145/3170427.3188470. Salminen, J, Milenković, M & Jansen, BJ 2017, ‘Problems of Data Science in Organizations: An Explorative Qualitative Analysis of Business Professionals’ Concerns’, in Proceedings of International Conference on Electronic Business (ICEB 2017), Dubai. Salminen, J, Nielsen, L, Jung, S-G, An, J, Kwak, H & Jansen, BJ 2018, ‘“Is More Better?”: Impact of Multiple Photos on Perception of Persona Profiles’, in Proceedings of ACM CHI Conference on Human Factors in Computing Systems (CHI2018), Montréal, Canada. Salminen, J, Şengün, S, Kwak, H, Jansen, BJ, An, J, Jung, S, Vieweg, S & Harrell, F 2017, ‘Generating Cultural Personas from Social Data: A Perspective of Middle Eastern Users’, in Proceedings of The Fourth International Symposium on Social Networks Analysis, Management and Security (SNAMS-2017), Prague, Czech Republic. — 2018, ‘From 2,772 segments to five personas: Summarizing a diverse online audience by generating culturally adapted personas’, First Monday, vol. 23, no. 6, retrieved 3 June 2018, from http://firstmonday.org/ojs/index.php/fm/article/view/8415. Scott, DM 2007, The New Rules of Marketing, John Wiley and Sons, Hoboken, New Jersey. Sinha, R 2003, ‘Persona Development for Information-rich Domains’, in Extended abstracts of the 2003 Conference on Human Factors in Computing Systems (CHI 2003), Ft. Lauderdale, Florida, USA. Storey, VC & Song, I-Y 2017, ‘Big data technologies and management: What conceptual modeling can do’, Data & Knowledge Engineering, vol. 108, March 2017, pp. 50–67. Thoma, V & Williams, B 2009, ‘Developing and Validating Personas in e-Commerce: A Heuristic Approach’, in Human-Computer Interaction – INTERACT 2009, Lecture Notes in Computer Science, Springer, Berlin, Heidelberg, pp. 524–527, retrieved 29 October 2018 from https://link.springer.com/chapter/10.1007/978-3-642-03658-3_56. Vahlo, J & Koponen, A 2018, ‘Player Personas and Game Choice’, in N Lee (ed), Encyclopedia of Computer Graphics and Games, Springer International Publishing, Cham, pp. 1–6, retrieved March 6, 2018, from http://link.springer.com/10.1007/978-3-319-08234- 9_149-1. Venkataramani, V, Amsden, Z, Bronson, N, Cabrera III, G, Chakka, P, Dimov, P, Ding, H, Ferris, J, Giardullo, A, Hoon, J & others 2012, ‘Tao: how Facebook serves the social graph’, in Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data, http://dx.doi.org/10.1109/HICSS.2005.85 https://link.springer.com/chapter/10.1007/978-981-10-1835-0_13 http://doi.acm.org/10.1145/3170427.3188470 http://firstmonday.org/ojs/index.php/fm/article/view/8415 https://link.springer.com/chapter/10.1007/978-3-642-03658-3_56 http://link.springer.com/10.1007/978-3-319-08234-9_149-1 http://link.springer.com/10.1007/978-3-319-08234-9_149-1 Persona Studies 2018, vol. 4, no. 2 65 pp. 791–792, retrieved 29 October 2018 from http://dl.acm.org/citation.cfm?id=2213957. Vincent, CJ & Blandford, A 2014, ‘The challenges of delivering validated personas for medical equipment design’, Applied Ergonomics, vol. 45, no. 4, pp. 1097–1105. Volkova, S, Bachrach, Y & Durme, BV 2016, ‘Mining User Interests to Predict Perceived Psycho- Demographic Traits on Twitter’, in 2016 IEEE Second International Conference on Big Data Computing Service and Applications (BigDataService), pp. 36–43, retrieved 29 October 2018 from https://ieeexplore.ieee.org/document/7474353. Wang, J & Yuan, S 2015, ‘Real-Time Bidding: A New Frontier of Computational Advertising Research’, in Proceedings of the Eighth ACM International Conference on Web Search and Data Mining, WSDM ’15, pp. 415–416, retrieved 28 February 2016, from http://doi.acm.org/10.1145/2684822.2697041. Zerbino, P, Aloini, D, Dulmin, R & Mininno, V 2018, ‘Big Data-enabled Customer Relationship Management: A holistic approach’, Information Processing & Management, vol. 54, no. 5, pp. 818–846. Zhang, X, Brown, H-F & Shankar, A 2016, ‘Data-driven Personas: Constructing Archetypal Users with Clickstreams and User Telemetry’, in Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems, CHI ’16, , pp. 5350–5359, doi: 10.1145/2858036.2858523. http://dl.acm.org/citation.cfm?id=2213957 https://ieeexplore.ieee.org/document/7474353 http://doi.acm.org/10.1145/2684822.2697041 https://doi.org/10.1145/2858036.2858523 Joni Salminen, Bernard J. Jansen, Jisun An, Haewoon Kwak and Soon-gyo Jung Abstract Key Words Introduction Methodological approach Why are personas useful? Criticism of personas New criticism Evaluating data-driven personas in the light of criticism: Distinguishing between traditional and digital data-driven personas Limitations of DDDPs Discussion Conclusion Works Cited