1 Introduction

Tom Freston is credited with saying “Innovation is taking two things that exist and putting them together in a new way”. For a long time in history, it has been the prevailing assumption that artistic, creative tasks such as writing poems, creating software, designing fashion, and composing songs could only be performed by humans. This assumption has changed drastically with recent advances in artificial intelligence (AI) that can generate new content in ways that cannot be distinguished anymore from human craftsmanship.

The term generative AI refers to computational techniques that are capable of generating seemingly new, meaningful content such as text, images, or audio from training data. The widespread diffusion of this technology with examples such as Dall-E 2, GPT-4, and Copilot is currently revolutionizing the way we work and communicate with each other. Generative AI systems can not only be used for artistic purposes to create new text mimicking writers or new images mimicking illustrators, but they can and will assist humans as intelligent question-answering systems. Here, applications include information technology (IT) help desks where generative AI supports transitional knowledge work tasks and mundane needs such as cooking recipes and medical advice. Industry reports suggest that generative AI could raise global gross domestic product (GDP) by 7% and replace 300 million jobs of knowledge workers (Goldman Sachs 2023). Undoubtedly, this has drastic implications not only for the Business & Information Systems Engineering (BISE) community, where we will face revolutionary opportunities, but also challenges and risks that we need to tackle and manage to steer the technology and its use in a responsible and sustainable direction.

In this Catchword article, we provide a conceptualization of generative AI as an entity in socio-technical systems and provide examples of models, systems, and applications. Based on that, we introduce limitations of current generative AI and provide an agenda for BISE research. Previous papers discuss generative AI around specific methods such as language models (e.g., Teubner et al. 2023; Dwivedi et al. 2023; Schöbel et al. 2023) or specific applications such as marketing (e.g., Peres et al. 2023), innovation management (Burger et al. 2023), scholarly research (e.g., Susarla et al. 2023; Davison et al. 2023), and education (e.g., Kasneci et al. 2023; Gimpel et al. 2023). Different from these works, we focus on generative AI in the context of information systems, and, to this end, we discuss several opportunities and challenges that are unique to the BISE community and make suggestions for impactful directions for BISE research.

2 Conceptualization

2.1 Mathematical Principles of Generative AI

Generative AI is primarily based on generative modeling, which has distinctive mathematical differences from discriminative modeling (Ng and Jordan 2001) often used in data-driven decision support. In general, discriminative modeling tries to separate data points X into different classes Y by learning decision boundaries between them (e.g., in classification tasks with \(Y \in \{ 0, 1 \}\)). In contrast to that, generative modeling aims to infer some actual data distribution. Examples can be the joint probability distribution P(XY) of both the inputs and the outputs or P(Y), but where Y is typically from some high-dimensional space. By doing so, a generative model offers the ability to produce new synthetic samples (e.g., generate new observation-target-pairs (XY) or new observations X given a target value Y) (Bishop 2006).

Building upon the above, a generative AI model refers to generative modeling that is instantiated with a machine learning architecture (e.g., a deep neural network) and, therefore, can create new data samples based on learned patterns.Footnote 1 Further, a generative AI system encompasses the entire infrastructure, including the model, data processing, and user interface components. The model serves as the core component of the system, which facilitates interaction and application within a broader context. Lastly, generative AI applications refer to the practical use cases and implementations of these systems, such as search engine optimization (SEO) content generation or code generation that solve real-world problems and drive innovation across various domains. Figure 1 shows a systematization of generative AI across selected data modalities (e.g., text, image, and audio) and the model-, system-, and application-level perspectives, which we detail in the following section.

Fig. 1
figure 1

A model-, system-, and application-level view on generative AI

Note that the modalities in Fig. 1 are neither complete nor entirely distinctive and can be detailed further. In addition, many unique use cases such as, for example, modeling functional properties of proteins (Unsal et al. 2022) can be represented in another modality such as text.

2.2 A Model-, System-, and Application-Level View of Generative AI

2.2.1 Model-Level View

A generative AI model is a type of machine learning architecture that uses AI algorithms to create novel data instances, drawing upon the patterns and relationships observed in the training data. A generative AI model is of critically central yet incomplete nature, as it requires further fine-tuning to specific tasks through systems and applications.

Deep neural networks are particularly well suited for the purpose of data generation, especially as deep neural networks can be designed using different architectures to model different data types (Janiesch et al. 2021; Kraus et al. 2020), for example, sequential data such as human language or spatial data such as images. Table 1 presents an overview of the underlying concepts and model architectures that are common in the context of generative AI, such as diffusion probabilistic models for text-to-image generation or the transformer architecture and (large) language models (LLMs) for text generation. GPT (short for generative pre-trained transformer), for example, represents a popular family of LLMs, used for text generation, for instance, in the conversational agent ChatGPT.

Large generative AI models that can model output in and across specific domains or specific data types in a comprehensive and versatile manner are oftentimes also called foundation models (Bommasani et al. 2021). Due to their size, they exhibit two key properties: emergence, meaning the behavior is oftentimes implicitly induced rather than explicitly constructed (e.g., GPT models can create calendar entries in the .ical format even though such models were not explicitly trained to do so), and homogenization, where a wide range of systems and applications can now be powered by a single, consolidated model (e.g., Copilot can generate source code across a wide range of programming languages).

Figure 1 presents an overview of generative AI models along different, selected data modalities, which are pre-trained on massive amounts of data. Note that we structure the models in Fig. 1 by their output modality such as X-to-text or X-to-image. For example, GPT-4 as the most recent generative AI model underlying OpenAI’s popular conversational agent ChatGPT (OpenAI 2023a) accepts both image and text inputs to generate text outputs. Similarly, Midjourney accepts both modalities to generate images. To this end, generative AI models can also be grouped into unimodal and multimodal models. Unimodal models take instructions from the same input type as their output (e.g., text). On the other hand, multimodal models can take input from different sources and generate output in various forms. Multimodal models exist across a variety of data modalities, for example for text, image, and audio. Prominent examples include Stable Diffusion (Rombach et al. 2022) for text-to-image generation, MusicLM (Agostinelli et al. 2023) for text-to-music generation, Codex (Chen et al. 2021) and AlphaCode (Li et al. 2022) for text-to-code generation, and as mentioned above GPT-4 for image-to-text as well as text-to-text generation (OpenAI 2023a).

The underlying training procedures vary greatly across different generative AI models (see Fig. 2). For example, generative adversarial networks (GANs) are trained through two competing objectives (Goodfellow et al. 2014), where one is to create new synthetic samples while the other tries to detect synthetic samples from the actual training samples, so that the distribution of synthetic samples is eventually close to the distribution of the training samples. Differently, systems such as ChatGPT-based conversational models use reinforcement learning from human feedback (RLHF). RLHF as used by ChatGPT proceeds in three steps to first create demonstration data for prompts, then to have users rank the quality of different outputs for a prompt, and finally to learn a policy that generates desirable output via reinforcement learning so that the output would score well during ranking (Ziegler et al. 2019).

Table 1 Glossary of key concepts in generative AI
Fig. 2
figure 2

Examples of different training procedures for generative AI models. a Generative adversarial network (GAN). b Reinforcement learning from human feedback (RLHF) as used in conversational generative AI models

2.2.2 System-Level View

Any system consists of a number of elements that are interconnected and interact with each other. For generative AI systems, this comprises not only the aforementioned generative AI model but also the underlying infrastructure, user-facing components, and their modality as well as the corresponding data processing (e.g., for prompts). An example would be the integration of deep learning models, like Codex (Chen et al. 2021), into a more interactive and comprehensive system, like GitHub Copilot, which allows its users to code more efficiently. Similarly, Midjourney’s image generation system builds on an undisclosed X-to-image generation model that users can interact with to generate images using Discord bots. Thus, generative AI systems embed the functionality of the underlying mathematical model to provide an interface for user interaction. This step augments the model-specific capabilities, enhancing its practicability and usability across real-world use cases.

Core concerns when embedding deep learning models in generative AI systems generally are scalability (e.g., distributed computing resources), deployment (e.g., in various environments and for different devices), and usability (e.g., a user-friendly interface and intent recognition). As pre-trained open-source alternatives to closed-source, proprietary models continue to be released, making these models available to their users (be it companies or individuals) becomes increasingly important. For both open-source and closed-source models, unexpected deterioration of model performance over time highlights the need for continuous model monitoring (Chen et al. 2023). Although powerful text-generating models existed before the release of the ChatGPT system in November 2022, ChatGPT’s ease of use also for non-expert users was a core contributing factor to its explosive worldwide adoption.

Moreover, on the system level, multiple components of a generative AI system can be integrated or connected to other systems, external databases with domain-specific knowledge, or platforms. For example, common limitations in many generative AI models are that they were trained on historical data with specific cut-off date and thus do not store information beyond or that an information compression takes place because of which generative AI models may not remember everything that they saw during training (Chiang 2023). Both limitations can be mitigated by augmenting the model with functionality for real-time information retrieval, which can substantially enhance its accuracy and usefulness. Relatedly, in the context of text generation, online language modeling addresses the problem of outdated models by continuously training them on up-to-date data.Footnote 2 Thereby, such models can then be knowledgeable of recent events that their static counterparts would not be aware of due to their training cut-off dates.

2.2.3 Application-Level View

Generative AI applications are generative AI systems situated in organizations to deliver value by solving dedicated business problems and addressing stakeholder needs. They can be regarded as human-task-technology systems or information systems that use generative AI technology to augment human capacities to accomplish specific tasks. This level of generative AI encompasses countless real-world use cases: These range from SEO content generation (Reisenbichler et al. 2022), over synthetic movie generation (Metz 2023) and AI music generation (Garcia 2023), to natural language-based software development (Chen et al. 2021).

Generative AI applications will give rise to novel technology-enabled modes of work. The more users will familiarize themselves with these novel applications, the more they will trust or mistrust them as well as use or disuse them. Over time, applications will likely transition from mundane tasks such as writing standard letters and getting a dinner reservation to more sensitive tasks such as soliciting medical or legal advice. They will involve more consequential decisions, which may even involve moral judgment (Krügel et al. 2023). This ever-increasing scope and pervasiveness of generative AI applications give rise to an imminent need not only to provide prescriptions and principles for trustworthy and reliable designs, but also for scrutinizing the effects on the user to calibrate qualities such as trust appropriately. The (continued) use and adoption of such applications by end users and organizations entails a number of fundamental socio-technical considerations to descry innovation potential and affordances of generative AI artifacts.

2.3 A Socio-Technical View on Generative AI

As technology advances, the definition and extent of what constitutes AI are continuously refined, while the reference point of human intelligence stays comparatively constant (Berente et al. 2021). With generative AI, we are approaching a further point of refinement. In the past, the capability of AI was mostly understood to be analytic, suitable for decision-making tasks. Now, AI gains the capability to perform generative tasks, suitable for content creation. While the procedure of content creation to some respect can still be considered analytic as it is inherently probabilistic, its results can be creative or even artistic as generative AI combines elements in novel ways. Further, IT artifacts were considered passive as they were used directly by humans. With the advent of agentic IT artifacts (Baird and Maruping 2021) powered by LLMs (Park et al. 2023), this human agency primacy assumption needs to be revisited and impacts how we devise the relation between human and AI based on their potency. Eventually, this may require AI capability models to structure, explain, guide, and constrain the different abilities of AI systems and their uses as AI applications.

Focusing on the interaction between humans and AI, so far, for analytic AI, the concept of delegation has been discussed to establish a hierarchy for decision-making (Baird and Maruping 2021). With generative AI, a human uses prompts to engage with an AI system to create content, and the AI then interprets the human’s intentions and provides feedback to presuppose further prompts. At first glance, this seems to follow a delegation pattern as well. Yet, the subsequent process does not, as the output of the AI can be suggestive to the other and will inform their further involvement directly or subconsciously. Thus, the process of creation rather follows a co-creation pattern, that is, the practice of collaborating in different roles to align and offer diverse insights to guide a design process (Ramaswamy and Ozcan 2018). Using the lens of agentic AI artifacts, initiation is not limited to humans.

The abovementioned interactions also impact our current understanding of hybrid intelligence as the integration of humans and AI, leveraging the unique strengths of both. Hybrid intelligence argues to address the limitations of each intelligence type by combining human intuition, creativity, and empathy with the computational power, accuracy, and scalability of AI systems to achieve enhanced decision-making and problem-solving capabilities (Dellermann et al. 2019). With generative AI and the AI’s capability to co-create, the understanding of what constitutes this collective intelligence begins to shift. Hence, novel human-AI interaction models and patterns may become necessary to explain and guide the behavior of humans and AI systems to enable effective and efficient use in AI applications on the one hand and, on the other hand, to ensure envelopment of AI agency and reach (Asatiani et al. 2021).

On a theoretical level, this shift in human-computer or rather human-AI interaction fuels another important observation: The theory of mind is an established theoretical lens in psychology to describe the cognitive ability of individuals to understand and predict the mental states, emotions, and intentions of others (Carlson et al. 2013; Baron-Cohen 1997; Gray et al. 2007). This skill is crucial for social interactions, as it facilitates empathy and allows for effective communication. Moreover, conferring a mind to an AI system can substantially drive usage intensity (Hartmann et al. 2023a). The development of a theory of mind in humans is unconscious and evolves throughout an individual’s life. The more natural AI systems become in terms of their interface and output, the more a theory of mind for human-computer interactions becomes necessary. Research is already investigating how AI systems can become theory-of-mind-aware to better understand their human counterpart (Rabinowitz et al. 2018; Çelikok et al. 2019). However, current AI systems hardly offer any cues for interactions. Thus, humans are rather void of a theory to explain their understanding of intelligent behavior by AI systems, which becomes even more important in a co-creation environment that does not follow a task delegation pattern. A theory of the artificial mind that explains how individuals perceive and assume the states and rationale of AI systems to better collaborate with them may alleviate some of these concerns.

3 Limitations of Current Generative AI

In the following, we discuss four salient boundaries of generative AI that, we argue, are important limitations in real-world applications. The following limitations are of technical nature in that they refer to how current generative AI models make inferences, and, hence, the limitations arise at the model level. Because of this, it is likely that limitations will persist in the long run, with system- and application-level implications.

Incorrect outputs. Generative AI models may produce output with errors. This is owed to the underlying nature of machine learning models relying on probabilistic algorithms for making inferences. For example, generative AI models generate the most probable response to a prompt, not necessarily the correct response. As such, challenges arise as, by now, outputs are indistinguishable from authentic content and may present misinformation or deceive users (Spitale et al. 2023). In LLMs, this problem in emergent behavior is called hallucination (Ji et al. 2023), which refers to mistakes in the generated text that are semantically or syntactically plausible but are actually nonsensical or incorrect. In other words, the generative AI model produces content that is not based on any facts or evidence, but rather on its own assumptions or biases. Moreover, the output of generative AI, especially that of LLMs, is typically not easily verifiable.

The correctness of generative AI models is highly dependent on the quality of training data and the according learning process. Generative AI systems and applications can implement correctness checks to inhibit certain outputs. Yet, due to the black-box nature of state-of-the-art AI models (Rai 2020), the usage of such systems critically hinges on users’ trust in reliable outputs. The closed source of commercial off-the-shelf generative AI systems aggravates this fact and prohibits further tuning and re-training of the models. One solution for addressing the downstream implications of incorrect outputs is to use generative AI to produce explanations or references, which can then be verified by users. However, such explanations are again probabilistic and thus subject to errors; nevertheless, they may help users in their judgment and decision-making when to accept outputs of generative AI and when not.

Bias and fairness. Societal biases permeate everyday human-generated content (Eskreis-Winkler and Fishbach 2022). The unbiasedness of vanilla generative AI is very much dependent on the quality of training data and the alignment process. Training deep learning models on biased data can amplify human biases, replicate toxic language, or perpetuate stereotypes of gender, sexual orientation, political leaning, or religion (e.g., Caliskan et al. 2017; Hartmann et al. 2023b). Recent studies expose the harmful biases embedded in multimodal generative AI models such as CLIP (contrastive language-image pre-training; Wolfe et al. 2022) and the CLIP-filtered LAION dataset (Birhane et al. 2021), which are core components of generative AI models (e.g., Dall-E 2 or Stable Diffusion). Human biases can also creep into the models in other stages of the model engineering process. For instruction-based language models, the RLHF process is an additional source of bias (OpenAI 2023b). Careful coding guidelines and quality checks can help address these risks.

Addressing bias and thus fairness in AI receives increasing attention in the academic literature (Dolata et al. 2022; Schramowski et al. 2022; Ferrara 2023; De-Arteaga et al. 2022; Feuerriegel et al. 2020; von Zahn et al. 2022), but remains an open and ongoing research question. For example, the developers of Stable Diffusion flag “probing and understanding the limitations and biases of generative models” as an important research area (Rombach et al. 2022). Some scholars even attest to models certain moral self-correcting capabilities (Ganguli et al. 2023), which may attenuate concerns of embedded biases and result in more fairness. In addition, on the system and application level, mitigation mechanisms can be implemented to address biases embedded in the deep learning models and create more diverse outputs (e.g., updating the prompts “under the hood” as done by Dall-E 2 to increase the demographic diversity of the outputs). Yet, more research is needed to get closer to the notion of fair AI.

Copyright violation. Generative AI models, systems, and applications may cause a violation of copyright laws because they can produce outputs that resemble or even copy existing works without permission or compensation to the original creators (Smits and Borghuis 2022). Here, two potential infringement risks are common. On the one hand, generative AI may make illegal copies of a work, thus violating the reproduction right of creators. Among others, this may happen when a generative AI was trained on original content that is protected by copyright but where the generative AI produces copies. Hence, a typical implication is that the training data for building generative AI systems must be free of copyrights. Crucially, copyright violation may nevertheless still happen even when the generative AI has never seen a copyrighted work before, such as, for example, when it simply produces a trademarked logo similar to that of Adidas but without ever having seen that logo before. On the other hand, generative AI may prepare derivative works, thus violating the transformation right of creators. To this end, legal questions arise around the balance of originality and creativity in generative AI systems. Along these lines, legal questions also arise around who holds the intellectual property for works (including patents) produced by a generative AI.

Environmental concerns. Lastly, there are substantial environmental concerns from developing and using generative AI systems due to the fact that such systems are typically built around large-scale neural networks, and, therefore, their development and operation consume large amounts of electricity with immense negative carbon footprint (Schwartz et al. 2020). For example, the carbon emission for training a generative AI model such as GPT-3 was estimated to have produced the equivalent of 552 t \(\hbox {CO}_2\) and thus amounts to the annual \(\hbox {CO}_2\) emissions of several dozens of households (Khan 2021). Owing to this, there are ongoing efforts in AI research to make the development and deployment of AI algorithms more carbon-friendly, through more efficient training algorithms, through compressing the size of neural network architectures, and through optimized hardware (Schwartz et al. 2020).

4 Implications and Future Directions for the BISE Community

In this section, we draw a number of implications and future research directions which, on the one hand, are of direct relevance to the BISE community as an application-oriented, socio-technical research discipline and, on the other hand, offer numerous research opportunities, especially for BISE researchers due to their interdisciplinary background. We organize our considerations according to the individual departments of the BISE journal (see Table 2 for an overview of exemplary research questions).

Table 2 Examples of research questions for future BISE research on generative AI

4.1 Business Process Management

Generative AI will have a strong impact on the field of Business Process Management (BPM) as it can assist in automating routine tasks, improving customer and employee satisfaction, and revealing process innovation opportunities (Beverungen et al. 2021), especially in creative processes (Haase and Hanel 2023). Concrete implications and research directions can be connected to various phases of the BPM lifecycle model (Vidgof et al. 2023). For example, in the context of process discovery, generative AI models could be used to generate process descriptions, which can help businesses identify and understand the different stages of a process (Kecht et al. 2023). From the perspective of business process improvement, generative process models could be used for idea generation and to support innovative process (re-)design initiatives (van Dun et al. 2023). In this regard, there is great potential for generative AI to contribute to both exploitative as well as explorative BPM design strategies (Grisold et al. 2022). In addition, natural language processing tasks related to BPM such as process extraction from text could benefit from generative AI without further fine-tuning using prompt engineering (Busch et al. 2023). Likewise, other phases can benefit owing to generative AI’s ability to learn complex and non-linear relationships in dynamic business processes that can be used for implementation as well as in simulation and predictive process monitoring among other things.

In the short term, robotic process automation (van der Aalst et al. 2018; Herm et al. 2021) will benefit as formerly handcrafted processing rules can not only be replaced, but entirely new types of automation can be enabled by retrofitting and thus intelligentizing legacy software. In the long run, we also see a large potential to support the phase of business process execution in traditional BPM. Specifically, we anticipate the development of a new generation of process guidance systems. While traditional system designs are based on static and manually-crafted knowledge bases (Morana et al. 2019), more dynamic and adaptive systems are feasible on the basis of large enterprise-wide trained language models. Such systems could improve knowledge retrieval tasks from a wide variety of heterogeneous sources, including manuals, handbooks, e-mails, wikis, job descriptions, etc. This opens up new avenues of research into how unstructured and distributed organizational knowledge can be incorporated into intelligent process guidance systems.

4.2 Decision Analytics and Data Science

Despite the huge progress in recent years, several analytical and technical questions around the development of generative AI have yet to be solved. One open question relates to how generative AI can be effectively customized for domain-specific applications and thus improve performance through higher degrees of contextualization. For example, novel and scalable techniques are needed to customize conversational agents based on generative AI for applications in medicine or finance. This will be crucial in practice to solve specific BISE-related tasks where customization may bring additional performance gains. Novel techniques for customization must be designed in a way that ensures the safety of proprietary data and prevents the data from being disclosed. Moreover, new frameworks are needed for prompt engineering that are designed from a user-centered lens and thus promote interpretability and usability.

Another important research direction is to improve the reliability of generative AI systems. For example, algorithmic solutions are needed on how generative AI can detect and mitigate hallucination. In addition to algorithmic solutions, more effort is also needed to develop user-centered solutions, that is, how users can reduce the risk of falling for incorrect outcomes, for example, by developing better ways how outputs can be verified (e.g., by offering additional explanations or references).

Finally, questions arise about how generative AI can natively support decision analytics and data science projects by closing the gap between modeling experts and domain users (Zschech et al. 2020). For instance, it is commonly known that many AI models used in business analytics are difficult to understand by non-experts (cf. Senoner et al. 2022). As a remedy, generative AI could be used to generate descriptions that explain the logic of business analytics models and thus make the decision logic more intelligible. One promising direction could be, for example, to use generative AI for translating post hoc explanations derived from approaches like SHAP or LIME into more intuitive textual descriptions or generate user-friendly descriptions of models that are intrinsically interpretable (Slack et al. 2023; Zilker et al. 2023).

4.3 Digital Business Management and Digital Leadership

Generative AI has great potential to contribute to different types of value creation mechanisms, including knowledge creation, task augmentation, and autonomous agency. However, this also requires the necessary organizational capabilities and conditions, where further research is needed to examine these ingredients more closely for the context of generative AI to steer the technological possibilities in a successful direction (Shollo et al. 2022).

That is, generative AI will lead to the development of new business ideas, unseen product and service innovations, and ultimately to the emergence of completely new business models. At the same time, it will also have a strong impact on intra-organizational aspects, such as work patterns, organizational structures, leadership models, and management practices. In this regard, we see that AI-based assistant systems previously centered around desktop automation taking over more and more routine tasks such as event management, resource allocation, and social media account management to free up even more human capacity (Maedche et al. 2019). Further, in algorithmic management (Benlian et al. 2022; Cameron et al. 2023), it should be examined how existing theories and frameworks need to be contextualized or fundamentally extended in light of the increasingly powerful capabilities of generative AI.

However, there are not only implications at the management level. The future of work is very likely to change at all levels of an organization (Feuerriegel et al. 2022). Due to the multi-modality of generative AI models, it is conceivable that employees will work increasingly via smart, speech-based interfaces, whereby the formulation of prompts and the evaluation of their results could become a key activity. Against this background, it is worth investigating which new competencies are required to handle this emerging technology (cf. Debortoli et al. 2014) and which entirely new job profiles, such as prompt engineers, may evolve in the near future (Strobelt et al. 2023).

Generative AI is also expected to fundamentally reform the way organizations manage, maintain, and share knowledge. Referring to the sketched vision of a new process guidance system in Sect. 4.1, we anticipate a number of new opportunities for digital knowledge management, among others automated knowledge discovery based on large amounts of unstructured distributed data (e.g., identification of new product combinations), improved knowledge sharing by automating the process of creating, summarizing, and disseminating content (e.g., automated creation of wikis and FAQs in different languages), and personalized knowledge delivery to individual employees based on their specific needs and preferences (e.g., recommendations for specific training material).

4.4 Economics of Information Systems

Generative AI will have significant economic implications across various industries and markets. Generative AI can increase efficiency and productivity by automating many tasks that were previously performed by humans, such as content creation, customer service, code generation, etc. This can reduce costs and open up new opportunities for growth and innovation (Eloundou et al. 2023). For example, AI-based translation between different languages is responsible for significant economic gains (Brynjolfsson et al. 2019). The BISE community can contribute by providing quantification through rigorous causal evidence. Given the velocity of AI research, it may be necessary to take a more abstract problem view instead of a concrete tool view. For example, BISE research could run field experiments to compare programmers with and without AI support and thereby assess whether generative AI systems for coding can improve the speed and quality of code development. Similarly, researchers could test whether generative AI will make artists more creative as they can more easily create new content. A similar pattern was previously observed for AlphaGo, which has led humans to become better players in the board game Go (Shin et al. 2023).

Generative AI is likely to transform the industry as a whole. This may hold true in the case of platforms that make user-generated content available (e.g., shutterstock.com, pixabay.com, stackoverflow.com), which may be replaced by generative AI systems. Here, further research questions arise as to whether the use of generative AI can lead to a competitive advantage and how generative AI changes competition. For example, what are the economic implications if generative AI is developed as open-source vs. closed-source systems? In this regard, a salient success factor for the development of conversational agents based on generative AI (e.g., ChatGPT) are data from user interactions through dialogues and feedback on whether the dialog was helpful. Hence, the value of such interaction data is poorly understood and what it means if such data are only available to a few Big Tech companies.

The digital transformation from generative AI also poses challenges and opportunities for economic policy. It may affect future work patterns and, indirectly, worker capability via restructured learning mechanisms. It may also affect content sharing and distribution and, hence, have non-trivial implications on the exploitation and protection of intellectual properties. On top of that, a growing concentration of power over AI innovation in the hands of a few companies may result in a monopoly of AI capabilities and hamper future innovation, fair competition, scientific progress, and thus welfare and human development at large. All of these future impacts are important to understand and provide meaningful directions for shaping economic policy.

4.5 Enterprise Modeling and Enterprise Engineering

Enterprise models are important artifacts for capturing insights into the core components and structures of an organization, including business processes, resources, information flows, and IT systems (Vernadat 2020). A major drawback of traditional enterprise models is that they are static and may not provide the level of abstraction that is required by the end user. Likewise, their construction and maintenance are time-consuming and expensive and require manual effort and human expertise (Silva et al. 2021). With generative AI, we see a large potential that many of these limitations can be addressed by generative AI as assistive technology (Sandkuhl et al. 2018), for example by automatically creating and updating enterprise models at different levels of abstraction or generating multi-modal representations.

First empirical results suggest that generative AI is able to generate useful conceptual models based on textual problem descriptions. Fill et al. (2023) show that ER, BPMN, UML, and Heraklit models can not only be generated with very high to perfect accuracy from textual descriptions, but they also explored the interpretation of existing models and received good results. In the near future, we expect more research that deals with the development, evaluation, and application of more advanced approaches. Specifically, we expect that learned representations of enterprise models can be transformed into more application-specific formats and can either be enriched with further details or reduced to the essential content.

Against this background, the concept of “digital twins”, virtual representations of enterprise assets, may experience new accentuation and extensions (Dietz and Pernul 2020). Especially, in the public sector, where most organizational assets are non-tangible in the form of defined services, specified procedures, legal texts, manuals, and organizational charts, generative AI can play a crucial role in digitally mirroring and managing such assets along their lifecycles. Similar benefits could be explored with physical assets in Industry 4.0 environments (Lasi et al. 2014).

In enterprise engineering, the role of generative AI systems in existing as well as newly emerging IT landscapes to support the business goals and strategies of an organization gives rise to numerous opportunities (e.g., in office solutions, customer relationship management and business analytics applications, knowledge management systems, etc.). Generative AI systems have the potential to evolve into core enterprise applications that can either be hosted on-premise or rented in the cloud. Unsanctioned use bears the risk that third-party applications will be used for job-related tasks without explicit approval or even knowledge of the organization. This phenomenon is commonly known as shadow IT and theories and frameworks have been proposed to explain this phenomenon, as well as recommending actions and policies to mitigate associated risks (cf. Haag and Eckhardt 2017; Klotz et al. 2022). In the light of generative AI, however, such approaches have to be revisited for their applicability and effectiveness and, if necessary, need to be extended. Nevertheless, this situation also offers the potential to explore and design new approaches for more effective API management (e.g., including novel app store solutions, privacy and security mechanisms, service level definitions, pricing, and licensing models) so that generative AI solutions can be smoothly integrated into existing enterprise IT infrastructures without risking any unauthorized use and confidentiality breaches.

4.6 Human Computer Interaction and Social Computing

Salient behavioral questions related to the interactions between humans and generative AI systems are still unanswered. Examples are related to the perception, acceptance, adoption, and trust of systems using generative AI. A study found that news was believed less if generated by generative AI instead of humans (Longoni et al. 2022) and another found that there is a replicant effect (Jakesch et al. 2019). Such behavior is likely to be context-specific and will vary by other antecedents highlighting the need for a principled theoretical foundation to build successful generative AI systems. The BISE community is well positioned to develop rigorous design recommendations.

Further, generative AI is a key enabler for developing high-quality interfaces for information systems based on natural language that promote usability and accessibility. For example, such interfaces will not only make interactions more intuitive but will also facilitate people with disabilities. Generative AI is likely to increase the “degree of intelligence” of user assistance systems. However, the design of effective interactions must also be considered when increasing the degree of intelligence (Maedche et al. 2016). Similarly, generative AI will undoubtedly have an impact on (computer-mediated) communication and collaboration, such as within companies. For example, generative AI can create optimized content for social media, emails, and reports. It can also help to improve the onboarding of new employees by creating personalized and interactive training materials. It can also enhance collaboration within teams by providing creative and intelligence conservation agents that suggest, summarize, and synthesize information based on the context of the team (e.g., automated meeting notes).

Several applications and research opportunities are related to the use of generative AI in marketing and, especially, e-commerce. It is expected that generative AI can automate the creation of personalized marketing content, for instance, different sales slogans for introverts vs. extroverts (Matz et al. 2017) or other personality traits as personalized marketing content is more effective than a one-content-fits-all approach (Matz et al. 2023). Generative AI may automate various tasks in marketing and media where content generation is needed (e.g., writing news stories, summarizing web pages for mobile devices, creating thumbnail images for news stories, translating written news to audio for blind people and Braille-supported formats for deaf people) that may be studied in future research. Moreover, generative AI may be used in recommender systems to boost the effectiveness of information dissemination through personalization as content can be tailored better to the abilities of the recipient.

The education sector is another example that will need to reinvent in some parts following the availability of conversational agents (Kasneci et al. 2023; Gimpel et al. 2023). At first glance, generative AI seems to constitute an unauthorized aid that jeopardizes student grading so far relying on written examinations and term papers. However, over time, examinations will adapt, and generative AI will enable the development of comprehensive digital teaching assistants as well as the creation of supplemental teaching material such as teaching cases and recap questions. Further, the educator’s community will need to develop novel guidelines and governance frameworks that educate learners to rely appropriately on generative AI systems, how to verify model outputs, and to engineer prompts rather than the output itself.

In addition, generative AI, specifically LLMs, can not only be used to spot harmful content on social media (e.g., Maarouf et al. 2023), but it can also create realistic disinformation (e.g., fake news, propaganda) that is hard to detect by humans (Kreps et al. 2022; Jakesch et al. 2023). Notwithstanding, AI-generated disinformation has previously evolved as so-called deepfakes (Mirsky and Lee 2021), but recent advances in generative AI reduce the cost of creating such disinformation and allow for unprecedented personalization. For example, generative AI can automatically adapt the tone and narrative of misinformation to specific audiences that identify as extroverts or introverts, left- or right-wing partisans, or people with particular religious beliefs.

Lastly, generative AI can facilitate—or even replace—traditional crowdsourcing where annotations or other knowledge tasks are handled by a larger pool of crowd workers, for example in social media content annotation (Gilardi et al. 2023) or market research on willingness-to-pay for services and products (Brand et al. 2023). In general, we expect that generative AI will automate many other tasks being a zero-shot / few-shot learner. However, this may also unfold negative implications: Users may contribute less to question-answering forums such as stackoverflow.com, which thus may reduce human-based knowledge creation impairing the future performance of AI-based question-answering systems that rely upon human question-answering content for training. In a similar vein, the widespread availability of generative AI systems may also propel research around virtual assistants. Previously, research made use of “Wizard-of-Oz” experiments (Diederich et al. 2020), while future research may build upon generative AI systems instead.

Crucially, automated content generation using generative AI is a new phenomenon, but automation in general and how people are affected by automated systems has been studied by scholars for decades. Thus, existing theories on the interplay of humans with automated systems may be contextualized to generative AI systems.

4.7 Information Systems Engineering and Technology

Generative AI offers many engineering- and technology-oriented research opportunities for the Information Systems community as a design-oriented discipline. This includes developing and evaluating design principles for generative AI systems and applications to extend the limiting boundaries of this technology (cf. Section 3). As such, design principles can focus on how generative AI systems can be made explainable to enable interpretability, understanding, and trust; how they can be designed reliable to avoid discrimination effects or privacy issues; and how they can be built more energy efficient to promote environmental sustainability (cf. Schoormann et al. 2023b). While a lot of research is already being conducted in technology-oriented disciplines such as computer science, the BISE community can add its strength by looking at design aspects through a socio-technical lens, involving individuals, teams, organizations, and societal groups in design activities, and thereby driving the field forward with new insights from a human–machine perspective (Maedche et al. 2019).

Further, we see great potential that generative AI can be leveraged to improve current practices in design science research projects when constructing novel IT artifacts (see Hevner et al. 2019). Here, one of the biggest potentials could lie in the support of knowledge retrieval tasks. Currently, design knowledge in the form of design requirements, design principles, and design features is often only available in encapsulated written papers or implicitly embedded in instantiated artifacts. Generative AI has the potential to extract such design knowledge that is spread over a broad body of interdisciplinary research and make it available in a collective form for scholars and practitioners. This could also overcome the limitation that design knowledge is currently rarely reused, which hampers the fundamental idea of knowledge accumulation in design science research (Schoormann et al. 2021).

Besides engineering actual systems and applications, the BISE community should also investigate how generative AI can be used to support creativity-based tasks when initiating new design projects. In this regard, a promising direction could be to incorporate generative AI in design thinking and similar methodologies to combine human creativity with computational creativity (Hawlitschek 2023). This may support different phases and steps of innovation projects, such as idea generation, user needs elicitation, prototyping, design evaluation, and design automation, in which different types of generative AI models and systems could be used and combined with each other to form applications for creative industries (e.g., generated user stories with textual descriptions, visual mock-ups for user interfaces, and quick software prototypes for proofs-of-concept). If generative AI is used to co-create innovative outcomes, it may also enable better reflection of the different design activities to ensure the necessary learning (Schoormann et al. 2023a).

5 Conclusion

Generative AI is a branch of AI that can create new content such as texts, images, or audio that increasingly often cannot be distinguished anymore from human craftsmanship. For this reason, generative AI has the potential to transform domains and industries that rely on creativity, innovation, and knowledge processing. In particular, it enables new applications that were previously impossible or impractical for automation, such as realistic virtual assistants, personalized education and service, and digital art. As such, generative AI has substantial implications for BISE practitioners and scholars as an interdisciplinary research community. In our Catchword article, we offered a conceptualization of the principles of generative AI along a model-, system-, and application-level view as well as a social-technical view and described limitations of current generative AI. Ultimately, we provided an impactful research agenda for the BISE community and thereby highlight the manifold affordances that generative AI offers through the lens of the BISE discipline.