When did Confluent, Inc. (CFLT) hold its June 13, 2023 earnings call?

Confluent, Inc. held its June 13, 2023 earnings call on June 13, 2023.

How can I access Confluent, Inc. earnings call transcripts via API?

Confluent, Inc. (CFLT) earnings call transcripts and historical archives are available through the EarningsCalls.dev API. The API returns full transcripts, speaker segments and AI-generated summaries for Confluent, Inc. and 9,000+ listed companies across 70+ countries, typically within ~30 minutes of the call's publication.

Confluent, Inc. (CFLT) June 13, 2023 Earnings Call Transcript & Summary

Shane Xie

executive

#1

All right. Look at this room. All right. Welcome to Investor Day 2023. I'm Shane Xie from IR. Thanks for spending the time with us today. We have a lot to cover today. So let's just start with my favorite slide, that today -- during today's presentation, we will make forward-looking statements regarding our business, financials and future prospects, which are subject to risks and uncertainties as described in our safe harbor and most recent Form 10-Q, we undertake no obligation to update these statements after today's presentation except as required by law. We will present non-GAAP financial measures, which should not be considered as a substitute for GAAP financials. A reconciliation between these GAAP and non-GAAP financials is included in the appendix in today's presentation. Now with that out of the way, please join me in welcoming the next speaker to the stage, Confluent Co-Founder and Chief Executive Officer, Jay Kreps.

Edward Kreps

executive

#2

Thank you, everyone. I really appreciate everyone coming and spending a little time with us today. And it's my pleasure to dive into a number of different areas of the business. We're going to kick it off with me talking a little bit about data streaming. What is it? I know some people have been with us for a while and know all about this area in great depth, some are totally new to the story. And so I'll talk a little bit about that. And then we'll dive into our product vision in more depth. We'll talk about our go-to-market model. We'll get to hear from some of our partners, some of our customers and we'll get to talk about some aspects of the business in more depth than we have. So there's exciting stuff throughout the day. And so I'll start with just a snapshot of the business. So in the last 2 years since we went public, the business has grown tremendously. We're now at a $700 million run rate, 38% year-over-year growth, 130% net retention rate on a path to breakeven by Q4. And a really important part of this, especially in the last 2 years has been our cloud product. As we went public, this was kind of a fledgling business but had been a huge focus and investment for us in the underlying technology. We felt very confident about the trajectory that, that was on. And that has really proven itself. That's now on a $300 million run rate, 140% NRR and really become a huge part of our business. And in many ways, the default when we think about how we build, how we release features, how we deliver value to customers and very much a part of the infrastructure for data for a lot of our customers and how data flows between environments. And we've done all of this really in a short period of time. Confluent is still a young company. We were created in -- end of 2014. So this has been a fast rise for us. And what's allowed that is really a pretty significant paradigm shift that we're part of. When we think about data and the infrastructure for data, that's an area that really grew up around storage. How do we take a pile of bytes, how do we store it in 1 place, that was the rise of file systems and databases. It's all about how do I look up the right little bit of data at the right time. And that turned into an enormous category and market, but it doesn't solve the whole problem. And the reason it doesn't is because data isn't just about storing the bytes for 1 application. Increasingly, it's about how all the parts of a company put together because data isn't stored in one place anymore. It's not just about static usage at a point in time. Increasingly, data is spread in an organization across hundreds or even thousands of databases, applications, SaaS systems, data warehouses, data platforms, and the task for companies is to bring all this together into one set of customer experiences, one operational backbone for the company. And that was a problem that was never really considered in traditional data infrastructure. Nobody thought about how data flowed. They only thought about how it was stored. And for a long time, this was something that was solved really with kind of duct tape. Things you would build or a hodgepodge of different partial solutions and nobody really thought there was a general purpose way of connecting the parts of a company of integrating data. And that was really until the rise of data streaming. And this took a whole bunch of problems that people saw as separate. How data moved, how it was processed, how is it flowed, and came up with a very general purpose platform for this. And this idea of real-time data, the idea that you could have data flowing continuously, it made that mainstream. So the fundamental idea here is instead of treating data as something that's stored in a pile, you treat it as a stream. You can imagine the stream of what's selling in a retailer, all the products that are being sold all around the world. You can imagine the stream of shipments and the logistics of any operation. You can imagine the stream of trades that are occurring or customer experiences that are being delivered. And this is a very natural way to think about a business. The business is a very real-time entity. Things are happening out in the world all the time. And the business is about how you react and respond to that. And so having the computational capability to take these streams of data, feed them out to all the systems that need them, react and respond and compute against them in real time. That's a very fundamental, foundational capability. And so it's actually kind of hard to imagine that it was missing. This whole time, but it was in part because it was hard to do. And the technology to do this wasn't around it. So people were faking parts of the problem in different data movement tools. They were faking it in different application layers and foundations. And what happened in this area was we kind of went back to basics, rethought how you did this, the idea of how you would do core processing was brought into real time, the idea of how you would think about modeling data was brought into real-time, this co-occurred with the rise of distributed systems and the ability to do this across a company in the large. And that enables a really powerful role for this technology. This kind of big intertangled mess that companies have of this system kind of hands, knitted into that system, which is kind of duck taped to this other system can be turned into something that is much more robust, that's much cleaner, it's much cheaper to build against and maintain and enables new capabilities. You can take this big mess and turn it into something where the parts are connected in a more structured way. We like to think of this has been kind of like the central nervous system in a person, right, where the different cells are not just connected via a mishmash, there actually is some structured way that the impulses of what you see, what you feel come through and all the parts of the body can react and respond in real time to that. And a similar thing is needed in a business, right? Every application can hand wire itself to everything else. It can't reinvent the kind of fundamental infrastructure for working with that kind of data. You need to have something that says, "Hey, a sale occurred." And the hundreds of applications that react to that in some way, you need to be able to tap into that and respond and process that as it occurs. And that ultimately is what Confluent provides. That's what data streaming is about is that substrate of the flow and real-time processing of data. And this isn't just theoretical. Early on, this was seen as a kind of niche like, oh, this real-time stuff, that's great. Maybe some of your data movement needs will be real-time or some of your processing will be real time. But increasingly, the technology proved itself out and it became, in many ways, the default, right? And it became something that could be as reliable as batch processing, as high throughput as batch processing and as well-known as batch processing. And as that's happened, you've seen the adoption go up. And so this graph shows the unique users for Apache Kafka and that has been this rapid rise up and to the right over the years that we've worked on this. And this has become one of the most popular open source projects in the world. So this transition into real-time data it's not just theory or something that's kind of in the very early days, it's something that's happening out in the world at large in production, in some of the most mission-critical use cases in the world. And you can see that in the adoption of this technology, if you look at the Fortune 500 companies using Kafka, it's incredibly prevalent, right? This is the ones that we know of as with any open source thing. You don't know everybody who uses it. And so it kind of rounds up to everyone, right? This is being used everywhere. And we've started the commercialization of that. This is a set of Confluent customers. And you would see 9 out of the 10, top 10 travel and retail companies; 10 out of the 10, top 10 banks; 8 out of the top 10 insurance companies and so on. And there's a lot here for us to go and pick up. We're really kind of just starting this journey of picking up all the use cases and customers. And we're excited about that opportunity in front of us. And so there's a ton of open source use cases left to go get. And one of the exciting things about our cloud offering has been our ability to go convert those. Our ability to go pick up those customers and bring them on to our platform. And this has been powered by an incredibly diverse set of use cases. So there's a set of common use cases you would see in every industry, building out real-time pipelines for data, the integration between on-premise and cloud environments, building a real-time 360 customer view, fighting fraud and ensuring security, acting as a backbone for micro services. Those are the kind of common IT focused use cases. But some of the most interesting things are the ones that are unique to individual industries, managing inventory, helping with claim automation and insurance, personalizing recommendations in entertainment, this list goes on and on. And you can read about these broadly on the Internet because as with any open source technology, there's lots of descriptions. You can read about how companies are putting this into practice in their business. And the breath here, I think, is incredibly important because the key analogy for data in motion is really to this world of data at rest, right? It's really to databases. And one of the key thesis we've had since the formation of the company is this is a broad data platform. It's something that's going to form the backbone for data flow in all companies. And part of the proof points for that is you want to see not just 1 big use case or 2 big use cases, which would probably mean it would be better suited to a verticalized platform than a kind of broad platform technology. What you want to see is this breadth, where every company has not just 1 or 2, but hundreds of use cases around the technology. And there's been a set of trends in the world that have driven this adoption, right? There's a kind of natural idea that things would move to real time. Nobody ever said that they wanted their data slower or they wanted things to be more out of date, but there's been trends in our industry that have helped push this word. I'm going to touch on a couple of these. I'll start with just the general expansion of data, software, the cloud. This is a story that doesn't need a ton of support, I think everybody knows, right? There are more applications. Those applications are more data rich. The spend on cloud environments has increased with that -- has come a whole new set of systems that have data. And that leads to this big mess. Data is spread over more places than it was before, in part because it's easier and cheaper to adopt more of this stuff, in part because companies are just spending more. But that combines with the second trend, which is the expectations that customers have on digital customer experiences have gone up. This has become increasingly a default way of doing business. And customers want the digital interaction to be a first-class citizen. They wanted to know everything that they've done with the organization. They want it to be up to date. They want it to be personalized and contextual and all of those things put pressure on the infrastructure layers. It's really hard to serve that. If what you're doing in the back end is a bunch of old mainframes and unintegrated relational databases and antique messaging systems, all spread across dozens of applications. That doesn't come together into something that meets any of those requirements. And so all of this puts pressure on the modernization of data flow. And in order to make this possible, you have to be able to connect this diverse set of operational data systems, SaaS applications, analytical systems, custom applications, and you have to be able to do it broadly across all the environments in real time. And increasingly, companies are split across geographically diverse environments, they're split across multiple different clouds and on-premise installations. And one of the key aspects of our technology is the ability to bridge all this. The ability to show up in each of these locations in each of these environments that a customer has and be able to bring together all the data in real time across all of this. And I'll touch now on the third one, which everyone has been waiting for me to get to, which is the rise of artificial intelligence. And in many ways, Kafka was created at LinkedIn to help drive the richer use of data in some of the machine learning and data-driven applications there. And that's continued. But this is something that is really evolving and changing with the rise of generative AI. This is something that has some differences in its architecture, it has some differences in how it's used, has some differences in the implications from traditional machine learning. I'm going to touch a little bit on this. And when I talk to investors about this, I think broadly, people kind of understand the thesis of how we would fit in. They say, okay, it's kind of very natural that AI is going to drive more usage of compute and more usage of data. And so I kind of get that you guys are going to help move the data around, and so that makes sense. But I want to get a little bit more into the details of how this fits with what's emerging as the modern architectures, and why and how this is different from what already had happened with more traditional machine learning modeling capabilities where you would see Kafka and Confluent kind of broadly used today. And so the key difference for generative AI and especially these large language models is that the models are reusable and they're not generally built by the company that's deploying it. This is something that kind of comes pretrained. And it's trained off of public data sets, which are kind of broadly available. And this training process is very expensive. So in traditional machine learning, these models are built in-house by a team of data scientists, off of your data. And with large language models, that's not the case, right? OpenAI is running some big process, which builds this. What that means is that the key question is no longer the model building, it's really about how do I combine this reusable model with my data, with my information about my customers what I have, right? And that's not going to be really mostly in the training process. So it's actually done at prompt time, in real time. And so there is a bit of a shift. So whereas with traditional machine learning, a lot of the focus was on model building. It was about, hey, how do I get data into my data lake, and how do I train ad hoc models off that problem by problem? With generative AI, this now moves to inference time. The focus is now how can I take the data I have about my customer and put that into action? How can it know the specifics of what you've done about my offerings, where things are at, how can I index that and get it ready to feed into a prompt at the right time for the right customer. And so the roles that we have played in this with traditional machine learning was largely feeding data into the data lake and then creating some of the real-time features. What's happening is that off-line batch training in the data lake, it was largely going off to OpenAI. But the feeding in of real-time data is increasing because it's easier and easier to add more context there. And so you can think of an example as a kind of, if I want to add a real-time chat bot that gives information about my business, I need to bring together the information about you, their current customer asking, about my services and kind of broad policies, and I need to combine that with this kind of general purpose model of the world. And in order to fuse that, I have to do it by actually accessing all the systems that I have that have that data. And so this is increasingly the problem that our customers are looking at. How do I actually access this big mess of systems that I've got? And how do I make sure that what I would be presenting in any output is up to date with your interaction with me, with the state of the world? And the answer to that can't be that you just kind of ad hoc hook into every system you have. You need to have some kind of structured indexing. And this is where Confluent comes in, is that ability to actually bring your data together, feed it into the right systems to be able to index this for AI and able to look it up. And I'll go through some examples of how this works out in practice in some of our customers where you can see this happening live and in production. And I think that this is actually an incredibly important role to play. And one of the reasons I think that, that's true, especially if I was thinking from the perspective of an investor, is that there's a lot of uncertainty around the final state for generative AI. Will these models be consumed via open source? Or will they be consumed as a service? I think we're not really sure, right? If they're consumed as a service, who will be the provider of that service that gets most of the business? I think the answer is we're not sure, right? When they are consumed as a service, where will the value accrue? Will it be primarily in existing SaaS platforms, which maybe add new capabilities? Or will it be in disruptive start-ups, which build purely around AI from the ground up? I think the answer is we're not sure. What are the kind of new specialized AI technologies that are going to be needed to help adopt to this? I think the answer is we're not sure, right? So there's a lot of these areas where we actually just don't know what the end-state architecture is going to be in 2 years, let alone 5 years or the time frame that this kind of stuff plays out on. There's a few things that we do know, though. We do know that it doesn't really matter how you consume the model. It's going to use a lot of compute, right? So that's like one really sure bet. The other thing we know is that companies are going to have to really get a hold of their data and be able to feed this into these models to be able to take any advantage of this and put it into practice. So when you think about the beneficiaries, I think this data flow and compute are kind of very obvious ones that are kind of right there in the supply chain for making this work. And I think one of the nice things about our area is if you ask people, okay, how are you going to build the data architecture that gives you this kind of real-time access to data that allows you to take advantage of this? I think Kafka and Confluent is going to be a very solid answer to that question that most people would give you back. And that is why I think this shows up so commonly in the traditional machine learning architectures around using this and this is why I think it also is showing up now in these generative AI applications that are coming to production now. So those are kind of the 3 driving trends of new things. The other trend that I think is on everybody's mind, is just economic uncertainty, and this is what we hear from customers. This is one we've talked a lot about in our earnings call as one of the pressures on the business. But it's worth noting that this is also something which has a silver lining, where a lot of our customers are going from thinking about, hey, big internal teams that run open source data platforms to just consuming something as a service. And we've seen increased kind of build versus buy analysis and real interest in the TCO message that we have around our cloud offering. And we'll get into a little bit of what that enables, why that's compelling to companies that are trying to be more efficient, and why also like access to data and being able to drive better automated decision-making is part of how companies actually drive savings and realize efficiencies out of the software stack that they've already acquired. So that's a little bit about the underlying paradigm shift and the kind of trends that have helped drive this out into the world. And that kind of leads us to Confluent, and the content that we're going to show for the rest of today. Our mission is to help accomplish this transition. We want to set data in motion. We want to make this a major platform in every company in the world. That's what we're here to do. And we've done this in a number of companies in really incredible use cases. And so I'll talk about a few of these examples. So I'll start with Expedia. So Expedia has been on this journey with us for a number of years. They have a very rich and broad use of streaming technology that cuts across a lot of their interactions and helps integrate all the information about the customer and travel, et cetera. They think in a stream-first way. And naturally, as they were thinking about how to integrate AI, and I think they were a little bit ahead of the curve on this one. I think [ Samala ] who was maybe on the board. They were early in the adoption of this stuff. The natural use case for them was interaction with customers. So despite being an online booking service, they said that 58% of their customers would contact them live. And so I talked with Anush from the Expedia team at Current last fall, and we talked through a little bit about this AI chatbot that they built, and it's actually really cool. So what they wanted to do was improve that experience. If their customers have to call them. And first of all, a customer has to wait. So it's not that great. Secondly, the experience of talking to the agent is not wonderful. So how can they unlock that? And in terms of that is pretty hard in UI, right? Because what the customer wants is pretty diverse and finding it in all the ways you click through is not that easy. So the chat interface is actually very natural. They had existing ways of chatting with agents, so this plugged in. And it was incredibly successful. So to do this, they had to actually bring to bear the information about your travel, right? What is the flights that you have booked? Which of those have actually happened? Are they delayed? Where is your stuff? All the things that people would actually ask about that were relevant to them. That had to be available to the agent that was doing the AI chat, right? And so the way that they do this is by combining a general purpose model with that specific information about the customer, and about broad questions and putting that together in real time to produce contextual, relevant answers. So if you ask, "Hey, I missed my flight, can I find another one?" They have to know what your flight is and what the other options are and be able to carry out that search. And so this was incredibly successful for them. At the time that we talked at current, it had already handled 29 million virtual conversations. So this is one of the early AI use cases to really make it in production. And it saved over 8 million hours of live agent time. And so this is a really big deal that unlocks a whole another way of interacting with them. Another customer is Michelin, the tire company. So you would think what on earth would making tires have to do with real-time streaming data. And it turns out quite a lot. This is a business that has an incredibly complex supply chain, and an incredibly complex path out to market. And so that whole supply chain, inventory, e-commerce and distribution mechanism is really being rebuilt around a live streaming view. So they know where things are, where things are coming from, all the quality control end to end. They can now tap into that and use this across different parts of the business. They started with open-source Kafka and then moved to Confluent Cloud really to get something that they could depend on as a mission-critical system and realize the TCO benefits of that. And this went from just serving that kind of inventory use case rolled out to then serve the Michelin guide. And so I actually wasn't aware that the Michelin ratings for restaurants was the same as the tire company, turns out it is. And the publishing cycle for that went from something that took 2 days. And this is actually really important for them. They're trying to go from something that's a published book to something that's an online app where you actually find out about restaurants when you're looking for a restaurant. It's really important for them to have all the kind of data-rich features that Yelp or other online service would have and integrate that. And so this now cuts across many lines of their business. And finally, Instacart. This is a very natural real-time technology-focused business. The initial use case at Instacart was around personalization across kind of mobile and web and giving a single view of their customer. But this rolled out across many parts of the business. And is being used to capture changes from their databases, is being used to actually model a real-time view of inventory. You can imagine their core business is knowing what's in stock where across all of these different stores, having a real-time view of those products as they evolve and being able to provide that in a way that's contextual and send shoppers to the right place for the things that you actually have and makes smart suggestions when they don't, et cetera. And they've done actually a really great log on the underlying architecture for some of the machine learning use cases that power this. This is actually an interesting one if you're interested in how we fit in both to the kind of traditional machine learning as well as like newer world, they had a very kind of batch-oriented machine learning system that they were using to do a lot of the inventory modeling, a lot of the predictive suggestions. And they talked about how they moved that to Kafka and Flink and other open source technology that we're adding to our stack to enable the move into real time. And this is an incredibly foundational thing for them. In order to do this well, all the predictions and suggestions they have, have to be up to date with like what's in the store. Otherwise, it just gets the wrong answer. And so this was something they've done over the last year and really wrote up a very detailed technical blog about it. So in the rest of the section today, other folks on the management team are going to go through a bunch of aspects of the business. So we'll get into a little more depth on our product and some of the differentiation it has. Why is it that customers are coming to this cloud service rather than just using the open source? Well, we focus on 3 pillars of differentiation: making something which is truly cloud native. Making a complete data streaming platform that brings together all the parts that a company would need to harness streaming data and then making this available everywhere in all the environments that they would operate on. And I'll touch on just a little bit of each of these. So I'll start with cloud native. So what does that mean? To a lot of people, it's kind of a buzzword, like something is cloud native, is there any substance to it? It turns out there is, right? When you look at the difference between something like a Snowflake, and then something like Teradata or Redshift. There really is a very different way of consuming something that's built as a cloud-native service. And this is something that matters a lot to customers. And it's not just an adjective that's sprinkled on top. It actually goes right to the heart of how the service is built. So what serves the underlying Kafka infrastructure in Confluent is something called Kora. And this is an incredibly sophisticated system that was built to run across the 3 clouds and provide tens of thousands of Kafka clusters as a fully managed service in a way that's totally elastic. And that actually provides these benefits to our customers. So they can use just what they need. They cannot think about how you pre-provision the right amount of storage or how you scale it up or down, really consume this as a kind of utility. And this provides better elasticity. It provides better resiliency. So we offer much stronger guarantees than anybody else who offers any kind of service in this space. It provides infinite amounts of storage. And then perhaps most importantly, it's backed by a much cheaper cost of operations. So we have, we think, probably about a 1,000x advantage in the human cost of operating this than a company that kind of tried to do it themselves with the open source. We have a very significant advantage in the underlying use of cloud infrastructure. A lot of that comes out of the multi-tenancy, but also just how the service is built from the ground up. And this adds up to a really strong advantage in TCO. Something that's up to about 60%, depending on the size of the team and the use case that we're targeting. And this is really important. This means it's just coming in for just the core Kafka bits. We can offer something to customers that is just not just more feature full, not just a better product but also cheaper. So they get something that's a better deal and does more. And this is a really powerful message, particularly in the environment we're in right now, where, as I said, all the same pressures on the customer experience, on integration, on doing the new digital things are there. But at the same time, there's no pressure to do it more efficiently, right, to do it while spending less, to do it with fewer people. And this is something that we enable. So that's a little bit about cloud native. We'll get into more depth in that area and what it enables. Next is complete. So what we offer is not just Kafka, and what customers need is not just the core stream of data. They need a set of rich capabilities on top of that. They need the ability to connect into all the different systems they have and capture the real-time feed of data. They need the ability to govern that flow of data across their organization. They need the ability to process that data in real time as it occurs, and they need the ability to share it, both within their organization and between organizations. And these are capabilities that we bring around this core streaming abstraction that kind of complete the streaming platform. We'll get into more details on what our platform has here, and in particular, on the importance of stream processing. This is an area that we think is going to be a big expander of our ability to capture some of the value that we create. Today, a lot of the application development that customers would do is kind of out in their account. It's not really monetized by us. They might produce streams of data, we would monetize the stream, but the application is theirs. As the stream processes and capabilities come into our service, that application in itself will move to a what we're selling to the customer and what they're paying us for. It will drive up the consumption that they have with us. And we're doing this with a technology called Flink. This is a very popular open source technology used for stream processing. And we think that this is really important. When you think by analogy to data at rest, if you think about the stack for stored data, well, the foundation of that stack would be the file system. You have one of your laptop it's storing data. But usually, the layer for integration with an application is something which combines not just storage but also data processing, and that's the database. And so when we think about where was a lot of the value captured in the data at rest stack, it was in that database use case, it combined processing and storage. When we move to data in motion, maybe the analogy to storage is that stream, and that's Kafka. But when you think about the integrated processing layer, that's Flink, and that's a huge opportunity for expansion for us. Date in motion is just getting to that layer where now it becomes incredibly easy to build applications that tap into this, they use it. This, we think, can really exponentially increase the usage of streams. And you would see that in the rise of the open source Flink project, which is newer than Kafka. It hasn't been around as long, but it's kind of following the same up and to the right trajectory in terms of adoption. And this is an area where we brought in a lot of the core developers of Flink through an acquisition, and we're adding that to our platform at Kafka Summit. Just recently, we announced the first release of this out to customers in early access form, and we'll be rolling that out through public access and GA through the rest of this year. And then finally, the third pillar here is offering this everywhere. And this is something which is in many ways, unique to streaming, that it has to be not just where 1 application is but where all your applications are because it has to plug into all the different systems that you have. And this ability to span all the parts of the company and turn it into 1 fabric for the flow of data. This is something that's uniquely important in our area. And this has been important for us when we think about the competition with cloud providers, when we think about other on-premise technologies, it's not enough to serve customers in 1 cloud or serve them just in their data centers. You have to be able to bring it all together and actually connect it together into 1 product. And that is a critical thing that we offer. So those 3 differentiating pillars being cloud native, complete and everywhere, those are things we've built around now for a number of years. If you looked in the documents around the IPO, we would have the same 3 pillars we're building around the same 3 things. They matter to customers. We've built significant IP around each of them. And that's one of the things that's driven our success as a product beyond the open source technology. And then we've really built, I think, a unique customer journey and go-to-market around this. And Erica is going to go into more depth about what we've done. But this comes out of the fact that the usage of this technology is a little bit different from other data technologies. Database is the back end for one application, but it's kind of an island into in and of itself. But it streams fundamentally, they go between things. They connect different applications. They connect to different parts of the company. And that means that there's a very natural network effect that drives adoption. And this starts within a company where maybe the first application comes for the features, right? If I'm a retailer and I'm building an inventory system, I need to bring in the stream of sales and to bring the stream of product information, but the next use case probably comes for those streams. It comes because this is probably the only place you can get a real-time feed of what's selling or what products you have on hand. And that application brings its own streams of data. And those streams attract new applications and so on. And that spins up the usage of this technology within an organization from that first use case to something that spreads. And this happens industry-wide as well. When we think about why has Kafka remained so prevalent over the years as this is spread? Why have all the competitors kind of stayed in the single-digit market share? Why has nothing else been able to catch up? In part, that's because we've worked hard and stayed ahead. But in part, it's because there's a strong gravitational pull. If you are a system and you're building integration into the world of streaming, you're going to pick the thing that connects to everything else. The thing that has the broadest ecosystem, that has all the connectors that everybody knows how to use. And increasingly, this brings in a whole set of partners that help drive us, right? The cloud service providers, other technology, ISVs, the SIs that are taking and building this out into use cases within customers, these are becoming an important force for us, and we'll talk a little bit more about this in the go-to-market area. And then structuring these efforts around the natural progression within a company. Our customers don't start by installing a new central nervous system. They start with one application. And so how do we land for that first application as easily and as low friction away as possible? How can we leverage kind of product-led growth and early sign up and experimentation with as little friction just with developers? How can we help that grow within the organization and spread go up the chain of command and turn this into something that's a broad data platform that's used across the organization that is -- becomes a mandated part of their data architecture that grows into that central nervous system role. Being able to orchestrate this and build our teams, our product capabilities, our consumption model, all around this customer journey in a way that we're providing value to the customer in the right way, at the right time for where they're at, that's an incredibly important part of how we take this to market. And that means combining the product-led part of the journey, the open source adoption, the cloud sign-ups, the experimentation of existing customers with new features, the expansion to that on their consumption plan with an enterprise sales effort is able to take this up the chain of command that can influence data strategy broadly as we come to that. This is a critical part of how this succeeds in companies broadly. And then finally, we'll touch a little bit on the business we've built around this, and we'll have some new material in a number of these areas. We'll talk about our execution so far. We'll talk about what has helped keep that strong and consistent. We'll talk about what gives us insight into where we're heading. And some of the durability of growth. We'll get into a little bit more depth on the TAM and how to look at that in different -- with different lenses from kind of a product lens in other areas. We'll talk about what underlies some of the customer growth and give a little bit more detail about how to think about our RPO and what that means. And then a little bit of information about how we're balancing growth and profitability. We'll give a little bit more depth on the growth and profitability model that we gave at our IPO and some refinements around that. So all of this will come in the sections that are coming up. And with that, I'm going to turn it over to the first session, which is Shaun and Chad, our Chief Product Officer and CTO, who are going to talk a little bit about our product strategy and how we're innovating for the future, and how we're really taking data streaming forward. So Shaun.

Shaun Clowes

executive

#3

Thank you, Jay. It's fantastic to be here. Professionally, I've been in technology for a very long time. I was most recently at Salesforce owned MuleSoft, which makes software that helps enterprises join their systems together. And over the past 10 or 20 years, I've had the opportunity to chat with many, many business leaders across many business domains in many geographies. And a common refrain across basically all of those conversations is just how difficult enterprises find it to join their data across their various different enterprise systems to deliver new customer experiences or employee experiences and how difficult it is they find it to embed their data and their analytics in all of their business functions to make better business decisions. It is actually much harder than it looks, and it's holding organizations back from being able to take advantage of their data. And that's the giant spaghetti mess that Jay just talked about. But I'd like to take a moment and talk a little bit about why this spaghetti mess exists. What is driving it? Why is it pernicious? And how can we start to solve some of these problems? In general, you can think about data in the enterprise as existing broadly across 2 major estates, the operational estate and the analytical estate. In the operational estate, the data is stored as part of running the business. And in the analytical state, it's stored and reported on to try and understand how the business is performing and make business decisions. Now these different estates have very different users, different use cases, and they grew up separately so they fundamentally have different approaches, different architectures and entirely different technology stacks. Now let me dig in just a little bit into the mass. In the operational estate, what you're looking at is the business applications that drive all the value because they are the things that actually enable you to do your business products and services. So think about something like a CRM for customer relationship management or an ERP for supply chain and logistics. Now those applications obviously have need to store their data, so they have a database that sits alongside them in operational data store. But to be clear, the data as it sits on the disk is one thing, but a lot of the value is actually driven from the business processes and user interfaces that manipulate that data and use it and sit on top as part of the business application. Now if you went back 15 years ago, we used to have very large, very monolithic business applications that could do entire large lines of business. So if you went and purchase some stuff in a store, you might find that the cashier was using a terminal that was connected to a mainframe. And the mainframe application could do everything related to retail. It might have a product catalog, it might do payments, it might do inventory and might do almost everything related to retail. But in the modern well, it just isn't like that anymore. That is just too hard. And so instead, we've separated out a lot of those different business capabilities into different business systems. Now that's okay. But obviously, any customer experience or employee experience is going to touch many of those different systems. They're going to have to work together to deliver anything you recognize as a business capability. So how do you solve that? Well, the truth is that in the enterprise, you write a lot of code and you join these systems one by one, painstakingly together to deliver these capabilities. Now that's expensive and tedious but it's absolutely critical because every organization is trying to deliver new products, new services, new business competitiveness, et cetera. So what's going on in your operational estate? In your analytical estate, your business leaders are trying to understand how the business is performing, and be able to make decisions. They're asking questions like, how is marketing going? Are we driving sales? Is sales going well? What is sales performance like? How is the overall business posture looking? And so they're asking these questions and in order to answer their questions, typically, what happens is there are batch jobs that suck the data from all of those operational data stores into one enormous data warehouse and try and stitch it back together. Contact-free stitch it back together to produce a view of what is going on inside the enterprise so that they can make business decisions. Now what's actually happening, though, is that the environment continues to get ever, ever more complex. In the operational estate, you can see that there are an explosion of new different business applications that are being deployed. Think of SaaS applications, microservices, new homegrown applications. Every time you're adding a new business capability or a new mobile app or a new voice assistant skill, you're adding new systems into the operational estate. And those operational estates -- operational applications are bringing even more operational data stores and different database technologies to the operational estate. This is getting incredibly more complex and you still have to wire everything together to deliver decent customer experiences. And that same complexity that you saw in the operational estate is getting manifested back over in the analytical estate as well. You're still trying to bring together all of that information from all of these different systems and try and make sense of it. And it's become easier than it has ever been to stand up massive kind of data warehouses, but you're still fundamentally migrating data, you're usually doing it in batch because it was built to serve the needs of reports that we looked at once a week, once a month or once a quarter, you're doing a much, you're trying to produce a simulacrum of the world, something about how the world actually looked back in the day. But monthly isn't good enough anymore. You're looking at weekly, daily, hourly, for example. And when the data is incorrect or it's not up to date or doesn't exactly mimic the real world, it's hard to use it to make any significant business decisions. This is a very, very complex landscape that exists in the enterprise everywhere. So you just help me describe some of those problems. And I also described how important they are in the enterprise because every organization needs to continue to innovate. And so what actually happened was the industry built entire technology stacks inside each of these estates to try and make the problem just a little bit easier. In the operational estate, you're looking at things like message-oriented middleware, application integration and governance and change data capture. And in the analytical estate, you're looking at tools like data pipelines, data processing and data governance tools. Now these fragmented tool sets, they kind of make it a little bit easier, but fundamentally, it's still a one-by-one game, solving it piece by piece by piece. And what's worse is that you solve every problem at least twice once in the operational state and once in the analytical estate. It's unbelievably tedious and painstaking to try and make it work, and worse, what typically happens is when you move data from the operational estate into the analytical estate, you haven't brought all of its context. It's lossy. It's not a perfectly great representation of what truly has happened. And even when you're ready to take action back in the operational estate, you can't easily do that. It's kind of a one-way door from one estate into the other estate. It's really, really messy. And yet the world just marches on and is not waiting for this mess to get solved. Consider your average customer experiences these days. when I'm shopping on a mobile app for a specific item, I want to know, is the item available? When will it be delivered if I order it right now? I want you to give me personalized offers, personalized coupons, personalized recommendations. It all has to just work. Those are my customer expectations. And when I actually go ahead and I ordered the item, a whole bunch of things have to happen, again, all in real time. You have to process my payment, sure that's obvious. But you also have to run our fraud checks. You have to reserve inventory item in the warehouse. You have to reserve space for the item to be shipped on the next truck roll that's going to my zip code and going to my house. And even beyond that, it goes all the way down into your supply chain. If this item is going to run out soon, I need you to order more stock from the supplier so that the next customer is not disappointed, and I don't lose a sale because I don't have any more of that item to ready to sell to you. A whole bunch of predictions and actions and reactions happen when any of these interactions happen. What you really need is you need all of your operational systems to work together as one. And you need all of your analytical systems to have the exact same ground truth view of the world, all at once. Now when you try and brute force this problem by connecting systems one by one by one, that is what is creating this giant spaghetti mess. And that is how we got here. But putting it really simply, these 2 estates, the operational estate and the analytical estate just cannot exist separately anymore. It is just impossible to make this really scale. And so that's what Confluent is solving from the ground up in a whole new way by delivering a central nervous system. What we're doing is we're making it easy for your operational systems to work together transparently, trivially and to enable that exact same view to be shared with the analytical systems so that everybody has the exact same view of the world. With Confluent high-quality, consistent up-to-date data is shared naturally all the way through all of your estates everywhere inside your enterprise. Now this actually has massive benefits for everybody for the CTO, this is a whole new architecture that enables them to build new applications that deliver business services, business capabilities and drive business automation faster than they ever could before. Think about getting more from your development efforts than you ever could. And it also benefits your COOs and your CROs because they get unparalleled business visibility into everything that's happening in every one of the business systems, all in real time, beautiful, incredible business level visibility. So we provide a better way to put data to work. We package data ops products, and we're changing the conversation from what is my data? Where is it? How do I access it? How do I query it? Is this even up to date? Is this even accurate to? What is my data? And how can I put my data to work? That's the whole point. And in doing so, we're rethinking and reimagining how data is spread across the organizations and how teams, applications and systems work together to deliver business outcomes. We liberate data from silos. We make it all interoperable. And by doing that, we unlock endless different use cases across all of the businesses where we're delivering a central nervous system. So you may be wondering, how does this relate to the various different technologies I just talked to you through? We have been doing this for years. Years and years and years. And it has not been solving this problem. If we keep doing more of it, it is unlikely it is going to continue to solve this problem. Confluent is reimagining how this problem is solved from the ground up with a central nervous system. Now in the operational plane, we make integrations between applications, making applications work together to deliver new products, new services, business automations, much easier than it has ever been before. And in doing so, we can replace a lot of the need for message-oriented middleware, application integration and change data capture. And over in the analytical estate, we make it easy to gain visibility into everything that's happening in your business, all of your most important data, all in real time. And so by doing that, we replace a lot of the need for these old batch ETO, ELT, data pipelines, data processing pipelines that exist in the enterprise. Now I want to be really clear about one thing, we are not another technology joining this hodgepodge of technologies. We are fundamentally solving this problem from the ground up, and we are unifying these 2 estates. We are in neither the operational estate or the analytical estate, we are in both. And that's what enables us to apparel all of these different use cases I have just been talking to you about. So it's that, which is what's driving the $60 billion addressable market for Confluent. We are breaking down data barriers in the enterprise. There's no need for more fragmented, duplicative technology and processing layers across different use cases. There's no need for siloed processing systems across the analytical and operational estate. We are purpose built to set data in motion and to solve the problems I just talked you through. And looking over the horizon, we're going to take pieces of the pie as we deliver on the central nerve system and we're going to unlock significant growth opportunities. Now these are really important problems, and these are problems I care very deeply about bridging the divide between operational and analytical, decentralizing data strategy and democratizing innovation are passion areas for me as an individual, and they make me very proud to be able to join Confluent. I have spent many years of my life working on this problem in my prior roles. And it's very clear to me that doing more of what we have already done, doing more of these old approaches to this problem can never solve problem and scale. It is impossible. And I'd encourage you to consider that when you look at this $60 billion TAM that doesn't even include various things. It doesn't include the opportunity ahead of us as we monetize more of the open-source Kafka ecosystem and bring them into our streaming platform where they can get a lower TCO and a better cloud native offering and replace their infrastructure and personnel spend. And you also need to consider the incredible rise of more and more data in the enterprise, more and more real-time data in the enterprise, driven by customer expectations and AI, which I know all of us are looking at right now. So that's kind of what we're doing. Now let me drive in just a little bit into how we are doing it. And obviously, it all starts with streams. With streams, data is not static. It's not passive. It's in motion. It's constantly moving, constantly evolving, constantly being processed and shared everywhere across your organization to power your business. Then we make it easy to connect to the static and historical data and bringing data from wherever it sits anywhere across the organization as streams into Confluent. We take those streams, and we process them, enrich them, evolve them, improve them, and we end up with streams that actually are carrying the most important data at every one of the enterprises we're working with. The lifeblood of the data as it flows through those enterprises. And we take that data and we govern it. We make it safe. We make it reusable. We make it known, discoverable and so that other people can safely consume it and use it. And then we enable that data to be shared as data products elsewhere inside the organization through a catalog where everybody can find, manipulate and understand all of the data that's going on inside the enterprise. And then that data, those real-time data assets can flow like water into the operational applications I've spent so long talking to you about and into the analytical applications that I've spent so long talking to you about. The same source of truth into any type of application, build it all from the same ground source, ground truth -- of source of truth. So our product investments, you heard Jay talk about them. They primarily are across those 5 dimensions I just talked about, about streaming, connecting, governing, processing and sharing. And I'd like to dig in a little bit more into our product innovations in each of those different areas. And for that, I'd like to welcome up my friend and colleague, Chief Technology Officer at Confluent, Chad Verbowski.

Chad Verbowski

executive

#4

Thank you, Shaun. I'm excited to be at Confluent where I can bring together my experiences from creating a bunch of cloud services like Azure SQL data warehouse and Google BigQuery, along with what I learned from my time in Microsoft Research in Bing. And it's taught me that there are 5 key foundational areas that we need to innovate in to be able to build a data streaming platform for the enterprise. So today, I'm going to walk us through some of those unique things that Confluent is doing to bring each of those 5 areas together. The first area, of course, is streaming, and it's powered by our cloud native, complete and everywhere Kafka technology. Second, you need to easily connect the streaming core to all of your existing SaaS, database, data warehouse and any other system in your enterprise that's producing or consuming data to enable real-time interactions across your entire data ecosystem. And third, you want to enable everyone in your enterprise to be able to work with and process data in real time. You're going to be mixing, filtering, transforming these raw inputs and reusing those outputs and combining them to create even more valuable insights. And fourth, this rich data set is then shared, reused and perhaps even further processed by other departments or even partners to your company. And finally, for all of this to be safe and compliant we ensure that the data, all of these operations and the interactions are securely governed end to end. And Confluent Cloud covers all of these, the complete stream, connect, processing, sharing and governing journey to make it easy for organizations everywhere to unlock all of their data. Let's talk first about streaming, where we've reimagined data streaming to be secure, resilient, scalable, cost-effective and of course, it has to be everywhere that your data exists. Now as Jay mentioned, Apache Kafka has become the foundational technology for data streaming. 75 of the Fortune 500 companies and nearly all of the major players in every segment are using Kafka. And this is a huge win for Confluent because it means that most enterprises are already excited about using streaming, and therefore, it's very easy for them to adopt and consume our products. Now anything great for your business is, of course, going to grow. It's going to expand with new use cases, more teams are going to adopt it. It's going to be deployed in even more environments. And then these products are going to get mission-critical. And as the usage of this grows, so do the expectations of these systems, you're going to need to scale them and you're going to need to provide them with some major operational guarantees, and that's going to take more engineering resources to run. Now operating open source Kafka is hard. You've got to deal with sizing the infrastructure, load spikes, expanding your clusters. You have to manage the cost. You have to provide it with high availability and disaster recovery. And of course, you've got to secure it. You've got to manage the upgrades, not just to the Kafka system, but to that underlying infrastructure as well. And this is why it's such a huge opportunity for a fully managed cloud native Kafka service that handles all of these complexities. And this enables companies to innovate faster, lowers their TCO and it eliminates all of these operations, so companies can focus on their core business innovation. Now being truly cloud-native is not as simple as just taking your open source software and running it on a bunch of machines or cloud instances. To be truly cloud native, what you need to do is provide something that is very cost effective. It has to be efficient. It has to be secure. And it needs to serve literally thousands of customers on that same set of infrastructure, not just one. So to be a multi-tenant scalable solution with these isolation and optimizations that you need, you need to work very closely with every cloud provider. Not every cloud is created the same. They have their own ways of building networking, processing, storage, security. And if you're going to optimize your system to have maximum performance and cost optimizations, you need to tune what you're building to every one of those environments. And this proprietary innovation that we have at Confluent enables us to have differentiated cost efficiencies and reliability and differentiated ease of use with capabilities like elasticity for automatically sizing what resources are needing for your instances. Pay-as-you-go pricing and, of course, eliminating this operational burden, Confluent Cloud is the easy button for using Kafka. So this is why we build Kora, it's a new engine, 100% compatible with Apache Kafka, and it's a completely reengineered system from the ground up. We've deployed it across 85 regions in the world, and it runs on all 3 major cloud providers. It replicates data across availability zone, so it can provide strong availability guarantees and again, eliminates that operational burden. We've done that work with the cloud providers, with close relationships between our engineering teams to really maximize that elasticity, resilience and scalability with over 5 million engineering hours put into this core system. And in 2023 alone, with Kora, we've already delivered a 4x improvement over open source Apache Kafka performance, and we've also delivered improvements to our data and workload balancing along with massive improvements in our network performance and cost optimization when we're driving very big data loads. Kora powers more than 10,000 clusters on thousands of customers running their critical workloads, saving them time and money and of course, infrastructure and operational management costs. And on a number of dimensions, Confluent Cloud is at least 10x better than any self-managed open source system or even some of those semi managed systems where people are just taking open source Kafka and running it in the cloud. So with our fully managed cloud services, our customers no longer need to worry about any pre-provisioning infrastructure. They can elastically expand from 0 all the way up to gigabytes per second to meet the demands of their business with just a few clicks and without any manual intervention. And with automated load balancing and continuous monitoring, we provide guaranteed reliability of [ 4 9s ] and our customers can be confident that with their mission-critical use cases are being powered by highly reliable and resilient platform. And we've eliminated limits on storage. So customers contain all of their current data and all of their historic data as well for as long as they want. Eliminating the burden of balancing the need for longer data retention with rising infrastructure cost. And this means that customers can use Confluent as the source of truth for all of their data across their entire ecosystem. And these innovations also do double duty here. They provide us with a dramatic advantage in delivering our cloud service. For instance, we have less than 5 Kafka engineers for running these tens of thousands of production systems. This gives us a cost structure for operations that we believe is over 1,000x better than our competition. Now with our deeply differentiated stack, we remove the complexity and the operational burden of this do-it-yourself Kafka, and we're driving compelling savings and faster ROI for our customers. An independent total economic impact research by Forrester highlights how Confluent has helped customers save millions of dollars in TCO with a payback period of fewer than 6 months and an ROI of more than 250%. Now on our mission to build a complete data streaming platform, we need to connect all of the systems that you have in your enterprise to make it easy to ingest and egress data continuously between Confluent and any system or application, whether that system is on-premise or in the cloud. And while capturing dynamic data is foundational to our data streaming strategy, we also provide the seamless experience for connecting to any non-streaming source. We minimize the effort to integrate and ultimately unlock that data so that you can enhance your existing real-time use cases. And our Connect strategy really has 3 core capabilities. Number one, it reduces the TCO of integration with our prebuilt and existing managed connector offerings. Number two, it enables custom connectors so that anybody can build unique connections to any system they have in their enterprise. And number three, we want to ensure that every customer can use their tool of choice. So we have native integrations with any of the major native applications that they are using. Now let's dive into each of these 3 areas. So in the streaming world, we have the best ecosystem of zero-code connectors built out across the various data estates. Operational systems, analytical systems and SaaS platform. So you don't have to know Kafka to get data into and out of our platform. We have over 120 prebuilt connectors and over 70 fully managed connectors for our customers to use. And an ecosystem of integrations that our community continues to build to connect their data sources to and through Confluent. And this means our customers can quickly and easily connect their systems with Confluent cloud using a few clicks or lines of code. We take care of all the heavy lifting so our customers can focus on unlocking the value from the real-time data estate. Now this can save them typically 1 to 2 engineering years of work or investments they might have in proprietary tools that can be prohibitively expensive. And today, our customers are running thousands of connectors across our Confluent Cloud and Confluent platform products. So when customers begin to use these connectors, this, of course, drives more data into our Kafka platform, but that in turn leads to more processing. And this processing is powering a flywheel to increase consumption of Confluent resources, and this highlights the networking effect of real-time data. This flywheel spins even faster as customers push higher volumes of data to power their generative AI workloads. So one thing I'm quite excited about is what we call custom connectors. It's a new capability we launched last month at Kafka Summit in London, and custom connectors gives our customers added flexibility to simplify integrations with any possible sources that they have. So let's say a customer or some advance user has a specific requirement to integrate their particular application or custom source of data. Now with a few lines of code, they can create this. They upload it to our Confluent Cloud, and we take care of everything to do with running it and making sure that it's operationally sound. They don't have to worry about the infrastructure details of managing this connector because we take care of it for them, saving them time and the cost of data integration. Now in addition to enabling our customers to bring their own connectors, we also want to make it easy for them to use their tools of choice. So we never want them to leave the UI of the tool they're working in so that they can seamlessly connect with any of the data that we provide in real time. So we're working very closely with our technology partner ecosystems to build Connect with Confluent so they can build these native integrations into their own platforms. And we're launching this next month, and we have many partners in our ecosystem already lined up for the launch. Now continuing on our mission to build this complete data streaming platform, we need to be able to easily derive value from these real-time data systems. We do this by providing powerful processing capabilities to enable filtering, joining, analyzing and reusing these results to generate actionable insights in real time. Now I want to take a moment to signify the importance of stream processing. Any data or event has by itself some intrinsic value and traditional messaging systems or even simple point-to-point API connectors, they can capitalize on, but they're missing a huge opportunity. And that is that data is more valuable when it is combined with other data. So if you combine one event with data from another, you can get much deeper insights and way more meaningful actions. For example, let's say you're making a purchase on your favorite retail site. If that retailer is able to join that purchase information with your customer information to know the location you are in, then they can provide much more meaningful shipping options. And then if you take that a step further, if they can further analyze that information about what's being shipped to your location, they can, of course, bundle many of these together and ship them to you in a single package. And so this is an example of just where you can take one real-time event, join it with some of these offline systems and produce even more value. And this ability to share, reuse and enhance processing results in the same data reuse over and over again. And this is the power of stream processing, and it's one of the key value propositions of our data streaming platform. Now while Kafka Streams is dominated in building stand-alone applications, there has been a whole family of stand-alone frameworks that have emerged to manage the life cycle of application development. And because there are ecosystems around streaming, this is an opportunity for us to pull in additional things into our platform. In the ecosystem of stream processing frameworks, there has been one technology that has come to the forefront in this space, and it has become an emerging de facto standard. And that, of course, is Flink. Now Flink, as Jay pointed out, is one of the most popular open source projects ever. We can see that it's growing at a similar rate to what Kafka was about 4 years ago. In fact, we're seeing a high level of overlap between companies that were early adopters of Kafka now becoming adopters of Flink. And many of these innovative companies are used these technologies together to support mission-critical workloads in their enterprise. And this means that just as Kafka is the de facto standard for streaming, Flink is the de facto choice for stream processing. For example, the AI recommendation system powering TikTok called monolith is built on Kafka and Flink so they can respond in real-time to changes in user preferences. There's a lot of synergy between Kafka, Confluent and Flink. But Flink doesn't have a cloud native offering to make it easy for customers to adopt. So customers are spending a lot of their time and engineering resources on operating, maintaining, monitoring and scaling Flink infrastructure. So it naturally makes sense for Confluent to deeply integrate Flink into the best-of-class cloud-native Kafka offering that we have. It allows us to reuse our expertise in building this cloud native offering, and it also allows us to reuse some of those native cloud native primitives to provide a Flink native service as well. Now Immerok is a company founded by Apache Flink community members. And 6 months ago, Confluent announced our acquisition of Immerok so that we could have the preeminent Flink leaders at Confluence as well. And this really complements our existing stream processing offerings, which extends it with rich capabilities for pro code and low-code developers or even data analysts. Now all product development and data engineering can be built from this one data estate. We will provide a fully managed cloud native Flink in all 3 of the major cloud providers in every location that we have our Kafka service. And our customers won't have to worry about infrastructure details or pre-provisioning of any infrastructure, they will only pay for what they use. And today, Kafka Streams is a library that runs in the client infrastructure, and it isn't monetized by us today except for the data it produces. But in the future, we will bring that processing into Flink. So we can monetize the processing as well as the data that's being created. And we're integrating all of our stream processing capabilities to provide an end-to-end data streaming platform. That makes it operationally easy to access all features needed to build streaming applications. And often the most valuable data to work with is also the most sensitive. And this is why it is foundational for our data streaming platform to guarantee that all the data and processing is end-to-end governed. We ensure the observability, the compliance and the confidentiality of data that's always on the move. Now the explosion of data and its commoditization has given rise to new challenges for enterprises like how to foster collaboration between producers and consumers with the right checks and balances in place. How do we free up and unlock data to innovate and push businesses forward without losing control over that data? And how do we make sense of data and ecosystems as they grow and become more complex? And organizations balance this data access between innovating faster, onboarding these use cases with security that they need to protect and meet enterprise regulatory and compliance needs. So solving this is critical. In fact, a lot of companies have had to build this themselves because there really was no commercial offering that's provided governance for data in motion. So many organizations already use Kafka and trusted for their most important data. They unlock this critical data for reuse in any applications across business boundaries, and it's really critical for accelerating their agility in the enterprise. But it's only possible if organizations can do this safely. And Confluence Stream governance suite empowers distributed teams of engineers to have this balance, this balance between moving fast and staying secure. And we provide 3 things as part of this offering. The first one is our stream catalog, which enables people to increase their collaboration and productivity with self-service data discovery that lets teams classify and organize the data and find the streams as they need them. The second part is stream quality, which enables customers to deliver trusted, high-quality data streams to drive the business with data integrity. Third is data lineage, which enables customers to understand the complex data and uncover insights and value with their interactive end-to-end maps of data streams. Stream governance is key to fostering collaboration and knowledge sharing necessary to become a data-centric business, while remaining compliant with ever-evolving regulatory needs. So one area that Confluent really dominates in and is enabling operational workloads. Mission-critical workloads that run the core businesses and our customers. And it's handling things like social security numbers or credit card numbers. And we make sure that our customers are confident that only the right people with the right access can work with this data. Our innovations protect customer data confidentiality. They help maintain contracts. They enable early detection of any problems. They can assess the quality. And they provide the fidelity and shape of the data, making it easy to discover inconsistencies and anomalies quickly. This empowers developers to use all of the data that's at their fingertips more confidently and quickly. And this creates a network effect. A well-governed set of data has more reason to use the Confluent data streaming platform as an interchange for all of the data in your enterprise. Now the final piece of our data streaming platform is to enable data sharing. Sharing enabled self-service and up to the date, trustworthy data and analytics effortlessly for everyone in an enterprise or even to your partners. Now developers spend a lot of time discovering trustworthy data to use for their applications. And we make it easy to do this discovery, consumption and sharing so that developers can spend less time looking for their data instead of just consuming the right data at the right time in the right format. And this means they're better equipped to use data to power their real-time applications without worrying where data is or how to access it. Customers can easily integrate data into real-time systems and ensure that it's secure and safe for sharing. First, we're building a developer portal to enable easy discoverability and access to ready to use data. And using this portal, a developer can search for a data stream, click a button to request access to it, and this request gets sent to a central operator where it can be approved and they'll ultimately be granted access to use it. And in addition, we are the only company to support any combination of on-premise hybrid cloud and multi-cloud deployments for your data streaming architecture. Our customers can connect all of their systems, all of their data and applications across their distributed architecture and unlock valuable data in these siloed environments and stream that data in real time, wherever it needs to go. No native hosted cloud offering can deliver this. And if they ever choose to migrate from 1 provider to another, we can help our customers maintain optionality and eliminate lock-in with seamless data mobility across clouds. Lastly, we make it easy to share data in your organization. And we also deliver that same simple real-time experience between organizations because we use common -- we have -- the use case is actually quite common. There's actually no organization that exists in isolation. For example, inventory companies or delivery companies or even financial trading companies, they're constantly exchanging real-time data internally and with their external partners. And they're often using very primitive methods. They're sharing with things like file uploads or e-mail attachments or one-off APIs. But that means that the data is stale. And every new source requires yet another IT project to get things wired up. So when data flows across organizations in real time, both parties have the same view of what's happening and what is possible. Stream sharing provides the easiest, most secure sharing across organizations in real time with just a few clicks. And now I'd like to invite back Shaun.

Shaun Clowes

executive

#5

[Audio Gap] With some of the aspects of our technology, which will unlock the data stream era and data streaming across the organization everywhere. But you all would know that it's not just about technology, it's also about how that technology is brought to bear in every enterprise. And there's a lot to do to truly unlock that potential. Now one of the biggest limiting factors of the way data is used in most enterprises is actually the people. It's not just the people that have access, it's the fact that most of the most important data that stored in streams are available in the enterprise is often only accessible to developers. Now I love developers. I was a developer. I am a developer. But I recognize that there are actually only 20 million or so developers in the world. 20 million developers and there are many times that data consumers in the enterprise. And so if the data is just available for use and leveraging by those developers, it can't ever reach its potential. So that's why we've implemented technology like Stream Designer. Now Stream Designer, which you can see here on the right is a completely no-code technology that enables a nondeveloper, data consumer to access all the streams that exist, understand what is in them, use them, filter them, evolve them, improve them, process them and to produce your own new streams that go out the other side, and those streams can then be consumed by that person, but by other people inside the enterprise, not just other data consumers in the analytical estate, but even by developers to build new applications to deliver new business services as well. It's a true unification of the way data is used in the enterprise and democratizes data. Now of course, if we're going to unlock the data for use by everybody everywhere. We need to do a couple of things. Firstly, the data needs to be governed and heard Chad talk extensively about governance and making sure that the data is reliable. But we also need to be able to deliver a cloud platform that meets the world's highest standards for security, governance, compliance and reliability. And we have been working on that for a very long time. We have the industry-leading controls, monitoring, compliance certifications, and we're continuously reaching higher to offer the world's highest standards to our customers. This has actually been a journey that we've been on for a long time. Apache Kafka was invented about 12 years ago by Jay and the other founders of Confluent as part of LinkedIn. And Confluent has been around for 9 years. And over that time, streaming has gone mainstream. You don't have to explain to anybody what Kafka is anymore. You don't explain to people what streams are. Streams are effectively ubiquitous. In all of the largest businesses in all of the most advanced startups all over the world, streaming is a thing, and they're all leveraging it to deliver brand-new incredible experiences. So our journey continues. And we have an opportunity to take a lot of that open-source Kafka streaming market and to bring it into our cloud data platform where we can offer a lower total cost of ownership and as Jay mentioned, yet offer more features, more capabilities. It's a better product that cost less, that's our opportunity to convert more of the Kafka ecosystem into our cloud. But it's not just that. Even that is just the journey beginning. You've heard us talk a lot about streaming and putting data in motion, about it being a way to convert data from being static into flowing through your organization, but to truly achieve that, we have to deliver a complete data streaming platform, not just streams, a data streaming platform. We have to be able to stream, connect, govern, process and share, and that's the data streaming platform that we're building. And it's that, that unlocks the $60 billion plus market opportunity that I talked to you about earlier today. So with that, I believe we're going to break into a quick Q&A session. So welcome back up, Jay and Chad.

Shane Xie

executive

#6

That's right. We're going to be doing a 15-minute Q&A section, focusing on products. So we've got a mic here. And Fahim has a mic on the other side. We'll take a minute here, set things up on stage.

Brad Zelnick

analyst

#7

Over here, Brad Zelnick, Deutsche Bank. Thanks so much for today and for the presentations. Just a simple question. Can you dive in a little bit on the investments required to bring Flink up to the standard of where you are with your core products today? And just how do you see the evolution like when we look back in 5 years, what will be the relative scale of Flink relative to the rest of the business?

Edward Kreps

executive

#8

Yes, do you want to take that, Chad?

Chad Verbowski

executive

#9

Yes. So one of the things I'm most proud of in my career actually, I've actually never seen this anywhere else is we were able to decide we were doing Flink, we bought Immerok, and we announced our first product in less than 6 months. And I've been just super proud of the ability for us to take a company of 20, 30 people, merge that with a company of equal size and get those teams to work together. We really were able to take something that was designed to be run more or less on machines, non-cloud-native, and we actually have a multi-cloud offering. I think it speaks to the innovations that we've made in Kora that we were able to take this new set of people, new set of technology and really deliver that within 5 months to make it profitable. So I'm very excited about what we can do. I think Flink brings one more piece to Confluent Cloud that we didn't have before, which is hosting third-party code. So a lot of our work is really around making sure that we can do that super securely and super scalably so that we can basically run and power any of the streaming applications that are out there.

Edward Kreps

executive

#10

And I would just add the 3 pillars of differentiation I talked about being truly cloud native, like really building from the ground up to be a cloud service, being a complete platform and being everywhere. When we thought about it, that really applies very directly to Flink as well, right? There's an ability to really build a processing layer as a service. I mean we've seen this with the cloud data warehouses, as Chad would know much better than me. And the ability to integrate this with Kafka, with our governance, really make it one product that all fits together and then offer it to customers multi-cloud across all the environments they run in. Very much the same kind of 3 pillars of value. And I think that's helpful for us just both in terms of guiding our investments and then also just kind of simplifying the message to customers. You don't want to have different modules with different value propositions. You want to have just a simple description of why you're better, why you're the thing that they want. And so when we think about where are we investing, I think those 3 are a pretty good guide for us. Obviously, a lot of details of, hey, what does it mean to be cloud native. What is the integration into the rest of the platform, et cetera, but those are kind of the right buckets.

Sanjit Singh

analyst

#11

Sanjit Singh, Morgan Stanley. Jay, I wanted to revisit the topic around AI and this sort of next wave of Wave 2 AI applications that are going to get built? And where you sort of see the logical flow. Because as I sort of imagine it, we imagine a customer or stakeholder user asking a question in natural language, and we have to have to surface that response that could span both structured data as well as unstructured data. And I guess the question is just like, what is Confluent's role in providing those answers. And what percentage of applications do you think are going to need to be real time going forward? And is there any difference between the real-time requirement for this next-generation app versus the stuff that we've built in the past couple of years?

Edward Kreps

executive

#12

Yes. Yes. So it's a great question. So there is certainly a difference from the kind of more traditional machine learning architectures and the newer AI architectures. And I would say the difference tends to be less about the model building, which, to some extent, is kind of done for you or at least mostly done before you by the kind of foundation model builder. And so yes, our role is very much the connectors that tap into the different applications, bring together data, kind of munge it into the right format, help it be indexed for surfacing in these applications. Some of those details, I think, are still being filled in around us, right? So will the model that you're integrating with be like an open source model that you kind of fine-tune yourself? Or is it going to be something that you consume as a service? I don't know. And to some extent, it doesn't change that architectural pattern or our role. Will you be indexing this storing data in more traditional systems or into some kind of vector database or will there be other aspects of integration into LLMs that have yet to emerge. I actually don't know the distribution of that, but it doesn't really matter to the end state architecture, like the problem is the data that needed these models is locked up in this massive old systems. It was not meant to serve any kind of real-time query requirements, especially broadcast out across a bunch of systems. And that's the kind of key problem that has to be solved to really unlock those use cases. So that's kind of a very obvious application. I think there's other integrations beyond that, right? So Chad mentioned kind of the usage -- TikTok, which is not just bringing together data in real time, but actually training the model for you based on responses as they come, it's much more sophisticated. I think that is kind of real-time learning probably is an end state for a lot of these things, but it's not the starting state for most companies. Most companies are just trying to figure out how to combine some version of their data with some version of intelligence and get that into a usable product right now. So I think in these early iterations, we're going to see a lot of that kind of integration problem for data, and we'll see a lot of applying AI to an incoming stream to augment it or improve it or classify data in different ways. I think those are the use cases we're hearing from customers and that I think are likely this first wave. Beyond that, I think the world opens up a lot more. We're still talking about language models, but there's a lot more that can be done on different data sets and applications in many other domains. And so that's where it gets harder and harder to predict where this could go. I mean, I think it will be interesting, and I think there will be a role for us in a lot of those areas. But that's where it gets beyond kind of what people are doing today with what the technology does today.

Michael Turits

analyst

#13

Michael Turits with KeyBanc. So I'm sorry, kind of a rip on Sanjit's question, but I was -- you said that, obviously, more compute, more data is -- it is a feature of having to do real-time gen AI. But also when you were talking about the fact that you need to interact at prompt time and an inference time?

Edward Kreps

executive

#14

Yes.

Michael Turits

analyst

#15

So I'm wondering what kind of feedback you're getting already, whether it's from commercial software application developers are trying to develop applications and want that kind of data integration at prompt time or whether it's enterprises that are seeking to build their own applications. So what are you hearing back from customers so far in terms of their needs for real-time in order to hit that prompt time?

Edward Kreps

executive

#16

Yes. Yes. The motivation for real time, I think it's pretty clear when you consider the concrete use cases, right? And so that would be the case whenever the data that would be the right answer to your question, needs to be kind of up to date and accurate. The inclination people have is, I think, well, what will happen is we will take these models and we'll train them on the enterprise data and that will be the way that we answer questions. And I think that, that's probably not the most obvious direction for this to go. It could be, right? That was kind of the tick-tock example, Chad gave. The challenge is basically a few fold, right? So one, that enterprise data is private. Two, it has to be up to the moment, not like a year old or even a month old or even a week old, right? Like when you're asking Expedia about your flight, you don't want to know the state of your flight like a week ago, you want to know like right now. And in fact, when people interact with the company, it's about something right now. And so the -- so I think the majority of use cases will have some aspects that needs to be in sync with what you've done with the company recently, what you just bought your interaction on the web, your interaction in person. I think a lot of these kind of copilot type or agent type use cases will have some element of that needing to be in sync with the world. And when that's not the case, customers are just confused. They don't understand why they're getting something that's I'm saying wrong, hallucinated not right. And so I think if you think about how this is working in practice, it's actually not that difficult to understand right, your kind of materializing a lot of data about the customer and what they're doing in their interactions and other things, and you're effectively augmenting the prompts that you're sending out to the language model and saying, hey, it's not just where is my flight, right? It's where is this customer flight, given all this information about the flights they were supposed to take and the status of those flights and alternative routes and that enables the kind of complete and rich and correct answer off of the up-to-date status of the world.

Jason Ader

analyst

#17

Jason Ader, William Blair, first, just a clarification on the $9 billion in the TAM in database market. Can you talk about what you're addressing in the database market? And then secondly, can you talk about the differences between Flink and spark ecosystems. And I'm assuming that's where you're going to run off a little bit against Databricks. Maybe talk about that as well?

Edward Kreps

executive

#18

Yes. So when we think about where we're kind of taking spend from, as you said, a portion of that is out of the database market. And that happens in a couple of different ways. And so one usage for databases is, hey, you're trying to do some data processing. You're effectively loading it into a database, running some processing and shipping out the result. Often it's some kind of batch job that would happen at the end of the day or periodically throughout the day. That's the kind of thing that's just inevitably going to real time that's moving to streaming, right? For the things where there's a lot of pressure to make it faster, it will happen sooner. For things where there's less pressure, it will happen over time as it becomes cheaper and more convenient do it that way. But over time, I think this kind of batch processing is really kind of an odd artifact of mainframe computing that just doesn't really make sense with how the world works or what the needs of applications are. And so it's literally just been held back by aspects of technology that made that a more convenient that are entirely going away, right? So there's no inherent limitation of streaming that means it has to be more expensive or they won't get the right results. So there's no inherent limitation on that says, "Oh, there's going to be this category of things that will always be batch forever". It's really just a question of convenience, driving down cost, building the ecosystem to be able to make that the most obvious route. And so I think we'll see that conversion happen over time. And I think that there are a blend of other database use cases once you take that kind of obvious batch processing stuff that move beyond just kind of the processing and into some of the materialization and storage of data, I think we'll see some of those move as well. To your second point on what's the difference between Flink and Spark. Yes, there's some overlap. Spark was kind of a batch computing framework that added some streaming capabilities. Flink was more directly targeted as a pure streaming framework that added batch processing capabilities. So there's some analogy between the two. When you look at the user base, I would say that where Flink has really grown and thrived has been these kind of application run the business use cases. And where Spark has really grown and thrived has been the kind of data scientists and data science workloads. And obviously, there is some boundary between those two countries. And there can be some border dispute there. But by and large, we don't see the data warehouses or data lakes or lake houses as competitors, and we certainly don't kind of battle deal by deal with them in any way. Right now, there's a very kind of rich and exciting competition in that analytics world. But the use cases targeting streaming tend to come to us and often data flows out of that streaming escape into all these things, into the data warehouse, into the lake house, into the new generative AI systems, there's a bunch in that area that we're helping to feed. And so by and large, these are partnerships, and we've won Partner of the Year awards from Databricks, from Google around the analytics area for that reason because it is primarily cooperative rather than competitive.

Shane Xie

executive

#19

All right. Thanks, everyone. This is the end of our first product focus FAQ, you have more time leading the program with the rest of the management team as well. So right now, we'll take a short break, and we'll come back in 5 minutes. [Break]

Shane Xie

executive

#20

All right. Welcome back, everyone. And right now, I would like to introduce our next speaker to the stage, Erica Schultz, our President of Field Operations.

Erica Schultz

executive

#21

Good Afternoon, It's great to be here. As Shane said, for those of you that I haven't met, I'm Erica Schultz. I'm the President of Field Operations. I'm going to talk to you about the power of our go-to-market model. Our go-to-market model has evolved a lot in my nearly 4 years in the company, and we're excited to get into it. So first, let me start with the massive market opportunity that we're going after that we're trying to capture. You've heard a lot about this today, but let me share a couple of more comments. So as you heard Jay talk about and Shaun and Chad, traditional data architecture is based around this fundamental assumption of data at rest. And so this leads to this giant mess of silos and point-to-point connections and batch processes. And as we know, batch processing has been around for decades. It's become a habit. It's very ingrained in a lot of organizations out there. And so oftentimes, we find that even though no one really wants to use batch, it's there, and that's the default. It's still everywhere. So to meet the demands, of real-time applications and business processes that many of our -- many companies have, many of our customers have, more than 150,000 organizations have turned to Kafka for data streaming. Confluent, as we know, counts about 4,700 customers and counting, 4,690. And so not only do we have a lot more Kafka usage to go in the market to -- within many organizations. But even in the organizations that we do count as customers and even many of those using Kafka, the usage is still in very early innings. So we see the use of data streaming and the use of Confluent in very early innings in the market and even within many of our current customers today, most of our current customers today in terms of the potential for data streaming's use across the enterprise. So we are the proven leader for data in motion. We are fortunate to count the customers from many different industries, a diverse set of industries as our customers. This was on full display at our Kafka Summit London event, which we held last month. In London, we had about 1,500 attendees, another 2,300 online. Just fantastic representation from so many different industries, primarily from Europe, given the location of the event but a really fantastic opportunity to connect with customers. And I will tell you a lot of the product innovations that you heard about today and in particular, Flink, where some of the most interesting to this customer group a lot of conversations and sessions on Flink and other topics. But one of the customers that in the U.K., Sainsbury's, the U.K.'s second largest grocery retailer is a great example of kind of the adoption of Confluent. They started with Kafka. And what they really needed to do was they needed to access data that was stored in legacy systems and on-premise systems. And as they were developing more cloud-based customer-facing applications, they needed to access data that was locked away in those legacy systems and their developers didn't want to touch it. And so they adopted Confluent and they've been able to really break down these data silos, give their developers access to this really valuable data and be able to deliver inventory and supply chain management applications that are really game changing for them. So both some customer-facing applications and also some new business process applications. So it's a great story. I love the quote here from Sainsbury's "In retail, everything is becoming data-driven. It's no longer just a facet of IT. It's quickly become essential to the entire business. Kafka has become a company-wide nervous system for us and Confluent has really taken it to the next level." So this brings me to how we segment our TAM across 3 primary segments: the Fortune 500, the rest of the large enterprise market and then what we call commercial and mid-market, which includes many digital native companies, startups and even emerging markets for us around the globe. So I want to spend a few minutes talking about how we segment the broader market. So first off, let's talk about large enterprises, and we're definitely in very early innings with our opportunity in large enterprises. We think about the Fortune 500, and then we think about enterprises around the globe is defined by last 12 months revenue over $1 billion. When we met last time at Current, we talked a little bit about this bottoms-up modeling for TAM, so I won't go through it in detail again here. But we definitely see that the potential ARR in the Fortune 500 is 7 to 8 figures per customer, in significant ARR across the enterprise market as well. So we resource those markets with a slightly more higher touch coverage model. And there's -- I'm going to talk about a customer example in a minute because what we see is once we get into a company with that first use case, and they realize the power of data streaming and a data streaming platform and democratizing data, we get into that flywheel effect of bringing more and more applications and data sources onto the platform. So a little more insight into our penetration in the enterprise segment. Let me start bottoms up on this slide. So today, we count 1,075 customers over $100,000 ARR. And I want to point out that more than 50% of this customer base is using Confluent Cloud that's been a material source of growth for us with our large customers as well. And our [ $100,000-plus ] customers have grown 34% year-over-year. If we look at the next tier, we have 135 customers who spend more than $1 million a year with us. That cohort has grown 53% year-over-year. And within that cohort, we have 9 customers who spend $5 million or more and 10 -- 3 customers who spend $10 million or more. In those top tiers, those numbers have really grown in the last couple of years. A couple of years ago, we had 3 customers who spent more than $5 million a year with us and none who spent more than $10 million. So I think this is a great demonstration of the potential in these very large enterprises and some smaller companies who are just making really big bets on data streaming architectures, and I'll talk about a couple of those. So that gives you a little bit of a panorama of our current footprint within large enterprises. I want to talk through a specific customer example of how Confluent and data streaming can grow over time. This is one of the leading global online job sites. And this graphic here, you've seen it before. This is kind of our typical customer journey. So as with many of our customers, this company started with open-source Kafka and this was back in 2017 in their initial steps in their journey to an event-driven architecture. And as Kafka and streaming grew, they realized that managing Kafka on their own was untenable. So we started with a small deal in 2020 for a very specific use case. It was a competitive deal. We won it due to our cloud-native benefits and the completeness of our platform. And then by the next year, that initial commitment quickly grew to $1 million as the customer expanded to more lines of business, and they grew to more than 600 users and 1,500 applications once they got the data streaming platform up and running. All of this was on Confluent Cloud. Today, we have become critical to really being a unifying real-time data layer across the organization, and they're running many use cases with Confluent, use cases like employer campaigns, sponsored jobs, targeted searches that help candidate matching for employers as well as optimize the job seeker experience. So what's interesting as well, we know that all companies, especially those that are very heavily invested in the cloud. This is an environment where there's a lot of pressure to optimize cloud spend in this environment, their commitment to Confluent, their spend with Confluent, their consumption continues to grow significantly. We're up to a trajectory of more than 10 million today, which is larger than their spend on MongoDB, Snowflake or Databricks. So as you can see, what started as a very small land has grown quite significantly, more than 100 times over the last several years. And we've become a very strategic partner to them in the data space. So hopefully, this is a valuable illustration. And here's a quote from the customer, "Confluent here to stay. Confluent helped us become cloud-first by making it easy to migrate and scale our Kafka deployment globally across multiple clouds. All of our Kafka use cases now run seamlessly and efficiently on Confluent Cloud". So really proud to count these guys as a customer. And we're not done yet. We see future growth. Okay. So I'm going to shift from talking about the large enterprise segments to what we call commercial and mid-market. And as I mentioned earlier, this segment counts digital native companies, start-ups in a number of emerging markets for us. And this is a really exciting segment for us. We generally define the companies in this segment as having revenues less than $1 billion in the last 12 months, a little bit less than that, smaller than that in some of our international markets. And we really focus on the startups and digital native companies within that segment. Many of these companies, as we can imagine, are born in the cloud. And many of these companies are starting with Confluent, and with a data streaming architecture, which is really exciting. And we have really strong product market fit in this market because these companies being young, they're kind of starting with the mindset of adopting managed services, data streaming is core, and they don't want to put resources on self-managing, open source. And so the product market fit is really strong, really great value prop. So we're excited about this market, not only because of the volume of companies in this market, but they're really high propensity for us, even within relatively small companies. We've seen incredibly strong performance with our go-to-market engine in the commercial space. So a few stats specific to this commercial segment over the last 3 years, we've seen a 7x increase in customer count. We've seen a 13x increase in customers spending over $100,000 with us. I referenced on a prior slide with about 1,075 customers who spend more than $100,000 with us. More than 20% of those are in the commercial market. So these are companies -- relatively small companies, but material spend on Confluent. And this engine has helped us achieve global reach. So we're in now more than 100 companies -- countries, excuse me, in large part due to the power of this commercial engine. Our product-led pay-as-you-go model is very robust in this market. Over 90% of our customers start via a self-serve sign up in this market. And often start spending and consuming a few thousand dollars in their first few months. And as they get up and running and successful, their consumption accelerates, and we see from a few thousand dollars to over $100,000 in ARR in less than 5 quarters on average, which, again, as you can see, is happening at a pretty significant scale. So we're really excited about what's been accomplished in this market and the high propensity for us across this market as well as it's been an engine for us to kind of perfect this product-led bottoms-up motion that we can then bring into the enterprise to really complement with that enterprise sales motion. So I want to talk about a specific customer example from this market. Metronome is a customer of ours. This is an early-stage company, a hyper-growth startup for those of you not familiar with the company, but they offer a real-time usage-based billing platform. So a lot of demand in the market for their offering. And it's core to the business model for so many startups today who are depending on real-time usage-based billing. So the billing platform is built on Kafka to compute usage and invoices in real time for millions of their end users. And given their size, this is a great example of what I mentioned before, Metronome quickly realized this is not where they wanted to spend significant engineering resource, managing open source Kafka. It was costly, and it diverted valuable engineering talent away from their core mission to low-level infrastructure management. So with Confluent Cloud, they were able to reallocate at least 60% of that engineering time to delivering new product innovation instead, which is great. So in just a couple of years, their consumption has increased more than 10 times, growing with the demand for usage-based billing. They count OpenAI is one of their top customers. And their recent 6-figure expansion with Confluent, puts Confluent is the vendor with the second most spend after AWS in their environment. So they're making a pretty big bet with us. They're very early in their company journey. So we see a bright future for our partnership with them. And worth mentioning that, again, just to round out kind of our opportunity in the start-up market, we have a Confluent for start-ups program, where we have almost over 100 participants, which we're really excited about. So we see this as a really great opportunity for us. Okay. So let me spend a few minutes on our customer growth go-to-market model. This is how we describe our unique go-to-market model. You've heard me talk about this before. But let me just re-anchor us in what differentiates our customer growth go-to-market model here at Confluent. We are a product-led, consumption-oriented and purpose-built for the data in motion journey. And one of the beauties of this go-to-market model is that as we bring Flink to market, this is the ideal go-to-market model for Flink as well. It maps to the customer journey to the personas. And so we see a lot of synergy as we bring Flink and other offerings to the market as we move up the stack. So first, Product Led. Jay shared a similar slide earlier, and we've talked about this before, but this Product Led motion and the fact that Product Led and enterprise sales motions are complementary is really fundamental to our go-to-market motion. So our product-led motion revolves around that self-service, pay-as-you-go, frictionless adoption targeted at the practitioner personas. We might want to make it so easy for developers and architects to get their hands on our products and get familiar, tinker around and ultimately start in maybe dev environments with Confluent cloud. Then we layer on an orchestrated set of what we might think of as traditional enterprise sales motion. So getting to the economic buyer, working with InfoSec to get through security reviews, doing some joint planning to make sure we have aligned definitions of success with that first use case. And a clear vision for the power of data streaming across the enterprise, maybe we do some TCO modeling. So that's where we bring and of course, we work on a contract. That's where we bring in some of those traditional enterprise sales motions and roles in the sales organization. But meanwhile, our technical stakeholders in the account can get right into the product and that experience can be a self-service as is appropriate. Of course, we'll want to partner with them to do really robust technical evaluation, tech [indiscernible]. So that's a little bit how these 2 pieces work together. The second pillar of our customer growth go-to-market is consumption-oriented. And this is really powerful, our cloud consumption model. We -- given the mission criticality of the use cases that we power across both revenue-generating applications as well as data-driven business operations, we've seen our product and go-to-market model drive really exciting network effects in our accounts. We've talked about this and this consumption model where we have usage-based billing for our customers is just a really powerful and frictionless way for them to consume more use cases and scale their usage with us. It also comes into play, it enables us to move up the stack seamlessly with Flink and other offerings. So this has been incredibly powerful for us. And I think that as we see new use cases come on where maybe the usage -- the future usage isn't known yet by the customer, a lot of gen AI type use cases and that sort of thing. This is a really beautiful model because customers can kind of get started without having to size an environment and plan for volumes. We can kind of get into it a little bit together, before we look at those elements. So our consumption model has been really powerful for us. The third pillar of our go-to-market model is purpose built for the data in motion journey. And we've talked about this slide, but I just want to highlight that our data in motion journey, we think of it generally in kind of 5 stages. Where early on, there's early interest in developers getting engaged with the product, maybe there's an early production application and then it moves into maybe that application moves into production, or we move the data streaming architecture to other more mission-critical applications across Disparate LOBs. And then ideally, the network effects start to be envisioned and customers start to in this stage 4, really think about a business-wide integrated streaming strategy. This is where we partner with customers, as you can see here on the slide, across more topics like people, process, technology, and we can help them build a data streaming center of excellence. We might bring in partners to help companies with the transformation associated with really making streaming a core capability enterprise-wide with that ultimate destination being Confluent as a central nervous system. There's a great quote here from Bankers Health Group talking about making Confluent the backbone of the company and really helping them integrate critical apps and data systems together. So this is a company that's been partnering with Confluent for a few years and their deployment spans a number of modern cloud-native services, both in the cloud and also on-prem databases and batch ETL, but we've really become the backbone of that environment. So a great example. So the power of this customer growth go-to-market model really is manifested in the growth that we're able to see within our customers. As we talked about across those 4,690 customers that we have, we have customers in many different stages but the vast majority are in relatively early stages. And we can see the power and the potential of the growth when we look at the customers who've grown with us over the last several years, and we know that even these customers who are on the larger side, have plenty of growth to go. So 3 customers here that I'll talk about. The first is -- and all 3 of these started on our pay-as-you-go offering before moving to a commit contract. The first one is a Generative AI Research Technology Company, which uses Confluent Cloud to field -- to feed real-time data into their billing applications, so that they're leveraging our service to minimize the risk of lost revenue. Their consumption has increased 19x in the last 3 quarters, primarily due to the recent rise of Gen AI. So we're obviously still in really early innings with this customer. Next, we have an Online Delivery Service who uses Confluent for use cases like real-time inventory management and fraud detection. And here, we have a 12x growth rate driven by -- both by the increased demand for online delivery as well as the customer incorporating Confluent Cloud into more parts of the business, more use cases on their way to a central nervous system. And then finally, we have an Online Gaming Platform. This company uses us to power use cases like player analytics for millions of users as well as a community safety use case to detect and prevent in real time if players are using harmful language on the platform. And here, we've seen a 7x growth rate over the last couple of years. So just a couple of examples of how our customers are growing with us, is they're incorporating more and more use cases. All right. I want to spend a minute here on TCO. You've heard us talk a lot about our TCO advantage at various junctures. But this is really important and especially in this macro environment. So obviously, a really hot topic for companies. There -- most companies are not looking for incremental spend, but rather how can your solution help me achieve lower total cost of ownership than what I'm doing today. So we believe that Confluent offers a fundamentally better, faster and cheaper solution for our customers for data streaming compared to any alternative, the most common of which is self-managing open-source Kafka in-house. And our TCO advantage comes from savings across 2 primary sources of spend when self-managing Kafka. The first is the underlying infrastructure required. And the second is the engineers and operators operating talent responsible for configuring, deploying and managing Kafka. So we've built a deeply differentiated stack that drives compelling savings in both of these areas. So let me just give a couple of examples of how we save companies money -- save our customers' money across both of these cost categories. So first, on the infrastructure side, multi-tenancy. Confluent Cloud's multi-tenancy allows us to pool our thousands of customers on shared infrastructure to drive higher utilization with a serverless experience, and then we get to pass these savings on to our customers. I'll also highlight elasticity, our intelligent tiering of data between memory, local storage and object storage helps manage the cost of stored data and allows instant scalability, enabling higher utilization of the compute resources. So there's some very concrete items that help us drive lower TCO on the infrastructure side. And on the development and operations side, first, our infrastructure improvements actually do double duty here. The improvements I outlined drive vastly higher utilization. And so we manage an order of magnitude less infrastructure than our customers otherwise would. So that's a win. And then the big difference in our operations is that it's done by software, not people that our customers have to staff. So we're able to automatically, for example, detect and remediate the kinds of rare problems that become common at scale. And we have real-time monitoring and checks for every aspect of the integrity of the system. So these capabilities provide Confluent Cloud with a dramatic advantage in the cost of human management. So some very concrete areas where we help customers drive costs down. We work with customers to do customized TCO assessments sometimes in the sales cycle, sometimes once customers are into deployment, we'll work with them to go back and quantify the TCO. We have a business value team that helps us where we need a high-touch method to do this. We've done this with hundreds of customers. And across those hundreds of customers, we see an average savings of 45%. 75% of these customized assessments that we've done have demonstrated 25% or more than 25% in savings for a total across this pool of customers that we've partnered with of $300 million in total savings. So we're quantifying this very directly. And as you heard earlier, we've also leveraged a third-party study with Forrester to the total economic impact study with us to quantify very significant savings across the customers they spoke with as well. So this is very real. And I want to talk about one specific customer example here. This one happens to be a premier global nutrition company, where we were able to drive $7.1 million in quantified savings with them. This customer looked to Kafka and data streaming to modernize its data infrastructure from a myriad of legacy ETL tools and batch oriented technologies as they modernize to an event-driven architecture. We were chosen as the best fit for this modernization initiative, not only due to our platform capabilities, but also because of this quantified TCO. This was really important to them. So by switching to Confluent Cloud, this customer identified $2.1 million in savings and infrastructure, primarily related in this case to storage, which is included and fully managed by us. And then by leveraging our cloud native capabilities, customer identified another $2.1 million reduction in engineering costs because they were able to drive a 60% reduction in engineering time dedicated to this environment due to our automated operations. And then in this case, the customer was also paying a third party for Kafka expertise to augment their internal staff. So they were able to move away from that and free up some costs. And then finally, with Confluent Cloud, customer was able to get to production 14 months faster on use cases such as customer experience and supply chain optimization and this contributed to $2.2 million in accelerated operating income. So very quantifiable. This is a great story, and we're pleased to be able to do this work with many, many customers. I think that the power of this TCO offering and the fact that we've been able to quantify it with so many customers really enables us to capture far more of the open source Kafka market. Because oftentimes, folks will make analogies from on-prem open source models to the cloud. When in reality, the fully managed cloud offering offers a very different value prop as we've talked about. Traditional on-prem open source business models typically offer a premium product but it's better features for more money. And as a result, the companies that are monetizing that open source are only able to capture a smaller fraction of the market. In contrast with a fully managed cloud offering, we're not just replacing free software, but we're also replacing expensive infrastructure and people costs. And so we can help customers drive costs down. And as the mindset shift is happening in the market and more and more companies are seeing this opportunity, fully managed cloud services is an opportunity to drive costs down. This is a great win for us. And so we think it's going to enable us to capture more of the open source Kafka market, and we're definitely seeing this across our customer base. All right. Let me spend a couple of minutes on our partner ecosystem. And following my presentation, we'll actually have one of our partners from AWS join me on stage for a fireside chat. But let me provide the overall frame of Confluent partner ecosystem. So we view the partner ecosystem as absolutely essential for a data streaming platform's role as a central nervous system. And partners obviously play a critical role in our go-to-market strategy. They expand our reach, increase our efficiency, but it's really strategic for us as well. And in order to become a central nervous system in an organization, we need to be integrated into our customers' ecosystem. And so it means that we have to really think of ourselves as an ecosystem company and make it very easy for our partners to send and receive data from us. So we build the infrastructure, the integrations and the connectivity once. And then every joint customer can leverage Confluent to build their business outcomes about -- across both our products and our partners' products and services. We've talked about the network effects that are very present in our market. And these network effects really drive partner engagement. I would highlight that what's been exciting about our partner ecosystem is that the wins for our partners are so clear when partnering with Confluent in addition to the traditional benefits to us from partnering with folks. But CSPs, for example, benefit as they can capture valuable data that's locked up in legacy data sources. We can get that into cloud environments and unlock more cloud use cases for our joint customers. That's been a really big win for customers and for the CSPs. ISVs benefit because we take care of the ingestion for them, allowing them to focus on value-add services versus data acquisition. And then the SIs have a big opportunity because they can help customers really get to that business outcome and where there's a business transformation layer to take care of. So there's a lot of a lot of interest in the SI market. And this becomes a continual flywheel and an ever-expanding network of the ecosystem and in mutual customers who are benefiting from our highly differentiated ecosystem and go-to-market. There's really 3 types of partners. I think these are fairly self-evident, but we generally bucket our partners across CSPs, ISVs and SIs. And we think about kind of the key pillars of how we partner with each of them a little bit differently. CSPs, first and foremost, it really is about joint innovation and integrations with their CSP native services. And we spend a lot of time with them kind of doing road map sharing and planning and making sure that we have seamless integrations for our joint customers across various services. The CSPs are also important for lack of a better term, kind of distributors and provide us reach to market. Their marketplaces are very heavily utilized by a lot of our joint customers. So we do a lot of transactions through the CSP marketplaces, which helps customers burn down some of their commits to the CSPs. And of course, the CSP sellers are highly incentivized to partner with ISVs for transactions through the marketplace. And as I mentioned, a lot of joint innovation. On the ISV front, integrations with ISV services are key. The network effects that drive mutual consumption. We see a lot of just very clear kind of accelerated consumption on both sides when we partner and make integration easy for our customers. And so based on that, we're going to market with some joint sales players with a number of our top ISVs. And then SIs, as I mentioned, are really helping our customers drive bigger transformation, get to business outcomes. And SIs are interested in building services businesses based on these modern technologies, and we're interested in leveraging SIs to help our customers drive utilization and consumption faster. So there's really a win-win. So all of these pieces add up to increased leverage in our go-to-market model, and that's really what it's all about. I want to just summarize the levers of growth that we see for Confluent. Each of these areas are opportunities for us to continue to drive best-in-class top line growth. We see a large and growing TAM for Confluent, which we've talked about from a couple of different perspectives. The secular shift to cloud is definitely a tailwind for us. We see that in our cloud results. It's part of every conversation with customers, whether they're digital native customers where the entire conversation is about cloud, or whether it's more traditional enterprises, we are core to their shift to cloud and part of that journey. We will continue to penetrate and expand in high propensity segments from digital native to financial services to a public sector, especially as we deliver FedRAMP certification in the future. But we're very focused on our highest propensity segments and how can we -- Tier 1 markets internationally and how can we penetrate and expand in those. Growing and harnessing our partner ecosystem. We believe we're in early innings there. There's a lot of opportunity for growth and leverage in the different partner types that I talked about. Continued international expansion. Today, about 40% of our revenue is from outside the U.S. We are very focused on the highest propensity opportunities first. Very focused on Tier 1 markets for Confluent and putting a disproportionate amount of resources there, while leveraging efficient go-to-market engines to be a little more opportunistic in more markets around the world so that we have a very deliberate and intentional and efficient growth strategy. And then finally, we see large cross-sell and upsell opportunities within our account base. So many use cases, Jay talked about the proliferation of business use cases in every industry. And with every new capability that we bring to market like Flink, and every new demand driver in the market, like generative AI, we -- so many -- those count on data streaming and really help us identify more opportunities and grow the opportunity for Confluent. So these are the levers of growth that we see. And I'll wrap on this that, of course, top line growth is critical, but so is leverage in the model. So as we're building out our growth strategies, at the same time, we are very committed to driving increased efficiency overall for the company, with starting with our go-to-market model. We're very pleased with this trajectory. These results and this trajectory of sales and marketing spend as a percent of revenue for the company. We're on a really solid path. And coupled with continuing to put up best-in-class NRR, 130% plus for 8 consecutive quarters, and getting efficiency from continued cloud adoption and more than 50% of our cloud customers having started on PAYG with an increasing percentage quarter after quarter. So that's a little bit about our go-to-market model and how we're balancing growth and efficiency. And with that, I would like to transition to a fireside chat. I'd like to welcome to the stage Matt Janssen from AWS. So coming up and join me, Matt, and we'll talk a little bit about our partnership. Thank you.

Matt Yanchyshyn

attendee

#22

Thanks for having me.

Erica Schultz

executive

#23

Excellent. Welcome. Thank you for joining us. It's great to have you.

Matt Yanchyshyn

attendee

#24

It's great to be here.

Erica Schultz

executive

#25

I'm going to introduce to you here. I'm going to read your bio, and then we're going to get into some questions. So hopefully, I've got this right. You can correct me if I miss anything. But -- for those of you that don't know, Matt, Matt is the General Manager for AWS Marketplace and partner engineering. It's a new role for him at AWS, but he's been with AWS for, I think, 11 or 12 years. So quite a while, has managed a number of different service lines. He's been in the digital media and technology industry for more than 25 years. And of course, if he's been at AWS for about 12, he's been there since relatively early days. You've led a number of different AWS service teams and technical sales organizations, including AWS Control Tower, service catalog, well architected and now all engineering for AWS partner organization, which is pretty vast. Before joining AWS, you spent a decade at the associated press where you help build field offices all over the world and led modernization efforts for the AP's digital media distribution systems. You hold multiple AWS patents. You're an Amazon bar raiser with more than 650 interviews under your belt. And you've had the privilege of leading great teams. You have launched dozens of new AWS programs and features and products. So thank you for joining us.

Matt Yanchyshyn

attendee

#26

Thanks for having me.

Erica Schultz

executive

#27

Yes. Great. Okay. So a few questions for us. So the first one, this is obviously a hot topic, generative AI, and specifically, the importance of augmenting large language models with streaming data to make enterprise applications even more useful. Would love to get your take on that and how you see Gen AI evolving -- real-time Gen AI evolving in the near term and what that might mean for enterprises?

Matt Yanchyshyn

attendee

#28

Great. I mean first of all, it's evolving quickly, way faster than I think anyone could have predicted a year ago. It's no secret we like to say data is gravity and the more data you have, the better your models are. But I think specific to AWS strategy and what we're able to do with Confluent. Our strategy is pretty simple. We have a strong partner strategy when it comes to generative AI. So when you look at SageMaker Bedrock, which is our upcoming gen AI service, it 3 out of the 4 foundation of models that we're launching with are actually third-party models. So Stability AI, AI21, Anthropic with their very cool cloud foundational model. And I think what sets AWS apart when it comes to Gen AI and again, how it ties back to what you're doing is we're all about, yes, foundational models are amazing. But they're even better and they take on better relevancy when you train them and customize them with your own data. And importantly, if you're going to customize with your own data, you don't want that data leaking. So privacy and augmenting our core foundational models with data that you bring in yourself is really sums up our strategy. And then customer choice. Not all models are made the same, models are really, really expensive to train. But you can't just rely on 1 or 2, you need a selection. We have our own Titan model. But if you want to use -- I was just using stable diffusion to generate some amazing images for my kids, my kid just turned 11, so I made him a picture this morning. And that picture, what you can generate or the text or the documentation gets better with the private data that you incorporate into the model. So the more data you have, the more secure it is, the better you're generative AI results are going to be.

Erica Schultz

executive

#29

Yes. That's great. So you're a long-time AWS veteran, and you've recently stepped into this new role, looking after all of partner engineering. We've been fortunate that among the demands of being new in your role, you've invested in Confluent early on, which has been great. And I know the joint innovation is one of the most important pillars of our partnership. So you've been spending some time with Shaun and Chad, which we appreciate. Would love to just get your perspective on how -- why we're a priority for you and kind of how you view the partnership?

Matt Yanchyshyn

attendee

#30

Yes. I mean -- well, I don't say this lightly. I love you guys. I mean, when I was just looking at the slides, I was acting as a customer or as an investor in the audience. And one thing that stood out, and actually the last slide you had was the 50% -- your go-to-market plan and the 50% of the pay-as-you-go opportunity is converting into. And that really resonates with actually like my and our current strategy for Marketplace. So it's no secret that we're an enterprise procurement vehicle for software and data and professional services, but the overwhelming majority of those enterprise-focused private offers start with self-service and start with that discovery phase. And when you think of Confluent, you're really at the vanguard of that self-service sort of re-revolution. I say re-revolution because when cloud marketplace is launched a long time ago, about 10 years ago, almost as long as I've been at AWS, it was a self-service vehicle at the beginning. But very quickly, we pivoted into private offers, enterprise motions. And now we're kind of returning. We're realizing that, hey, if we can drive up really high conversion rates and really bring in organic joint opportunities, that can feed the private off the flywheel, like you just said, and we can all be successful. So your strategy is actually highly aligned with our current marketplace strategy. Not to mention that your top partners like Databricks and Mongo, I mean those are also my top partners. And the fact that Confluent in particular, works really well in these tri-party multiparty solutions for customers, really aligns well with solution-based selling as well, which is all we're doing these days. So strategically, we're just 2 highly aligned companies and you're willing to lean in with us. You're co-investing in a lot of these new innovations and features we're coming out with. And not every partner is willing to kind of put themselves out there and you are, innovate and take risk together and that's awesome.

Erica Schultz

executive

#31

Yes. Are there any specific areas of joint innovation between the 2 companies that you're most excited about?

Matt Yanchyshyn

attendee

#32

Yes. I mean I mentioned sort of self-service specifically -- we're really interested in sort of reimagining the sort of end-to-end journey of the different personas who are now buying through cloud marketplaces, it's really evolved. It's amazing we started with the developer, but now there's procurement offices, even increasingly lawyers and governance teams who are using marketplace. And like really co-innovating on features that target these different personas. Again, investing in some of our self-service features together. But also the post procurement journey, I won't get into too much details in such a huge room, but some of the work we're actually currently doing on what we can do after that purchase motion in terms of configuring, getting things to work well together after the procurement motion. And that's one area that you're really interested in, how can we get people using the software faster and certainly an area where we want to help. So that is probably the area most excited for when it comes to co-innovation.

Erica Schultz

executive

#33

That's great. We're very aligned there. I know we're working on some exciting stuff. I think it would be worth spending a minute on -- because not everyone is familiar with how this works. Why the AWS field organization is interested in partnering with Confluent or ISVs in general? So maybe talk a little bit about the incentives that are out there for the AWS field organization to partner with ISVs and then why Confluent is interesting to AWS sellers and solution architects?

Matt Yanchyshyn

attendee

#34

Yes. I mean you just touched on a couple. I mean, there's the obvious stuff like commit drawdown, enterprise discount plans, et cetera. But I think in AWS certainly in the last couple of years, our field is really woken up to the fact that when you sell with partners, those deals close faster, they're bigger deals, and it's kind of a win-win. So I always say, don't believe me, you just talked about a Forrester total economic study we commissioned our own. And it showed that -- it was from Q1 2022, and it showed that deals that are transacted through Marketplace, AWS Marketplace specifically are 40% faster and 80% bigger. And so when you say that, you couple it with -- and by the way, that was 80% bigger, 40% faster deals can help you draw down the customer commit and hit your quota, their eyes light up. So the field -- once they started to see this third party, this analyst information, they wanted to double down with the top partners, the ones that are actually driving this. And so when I look at -- I think we had, what, 300% year-over-year growth in shared customer wins last year. And if you look at the conversion, I don't think I'm allowed to share that stat. But the high conversion rates that you're seeing in the marketplace. And again, that Pay As You Go to a private offer and sort of long term, they're going to go with the partners that are seeing success so they can double down on what Forresters telling them. So that's why they're excited about Confluent.

Erica Schultz

executive

#35

Yes. That's great. I think it is worth pointing out that even in these areas where you have a breadth of native services and also your sellers are very highly incentivized to sell partner solutions, which is great.

Matt Yanchyshyn

attendee

#36

I mean they don't differentiate. They want to do what's best for the customer. And I know that sounds almost cliche for commit, but it's true. AWS, like they drilled into our heads every day, do what is best for the customer. And data came these days. It's particularly came with generative AI and what's coming out these days. you bring more data into the platform, you work well with top partners. You're closing a lot of deals and converting opportunities. So it's a win-win. It's easy for them.

Erica Schultz

executive

#37

Yes. That's great. And we have so many great stories together. Yes, which is fantastic. What -- so we're about -- we signed a strategic collaboration agreement with AWS about 1.5 years ago. We've been partners for a number of years before that, but we really kind of upped our game together 1.5 years ago. Can you talk a little bit about the specific benefits of that collaboration agreement and maybe some of the specific programs and results that strengthened our partnership?

Matt Yanchyshyn

attendee

#38

Yes. So first of all, strategically, I worked for [ Aruba Borneo ], who leads our global partner organization, and she's big on SCA. She's big on strategic calculation agreements, but not with anyone. She wants to sort of have a select group of, again, successful, highly motivated partners to do these with. In terms of the benefits, we're increasingly tying our top incentives, for example, like the ability to draw down customer spend. And some of our biggest incentives, like we have a very successful migration program. We call our MAP program, some of the top incentives were tying directly to these strategic agreements. So when you enter into a long-term strategy agreement with AWS, we're not just going to double down on co-sell, we're also going to insert you and do things like our ISV Accelerate program or the bigger deal part of our Migration Accelerator Program. So it kind of opens the door to a next level co-sell where we do it programmatically, much more globally. And also from an engineering perspective, I'm looking for partners who are committed. Honestly, if I'm going to take the time with my engineering team, just sit down and co-build features. I want to do that with partners who are all-in, honestly. And so by -- it's one thing to say, yes, we're built on AWS. It's another to say, "Yes, we're built on AWS. We're all in AWS. And by the way, we signed this multiyear collaborative agreement. I'm going to take the time to sort of invest my product leaders, my engineers with those partners.

Erica Schultz

executive

#39

Yes. That's great. It's been great for us because we've seen -- we feel like we know we've gotten access to a lot of those top programs that are good for customers, good for your sellers. And so it just gets us kind of be more -- in front of more opportunities, which is fantastic. Yes, it's been very positive. What -- any other comments about what you're hearing from customers? I know you spend a lot of time with customers just about the uptake of data streaming.

Matt Yanchyshyn

attendee

#40

Yes. I mean it's no secret that -- it's no secret that it's hard. I think just speaking of someone myself, like I have to -- like you said, we have a large portfolio of services and even just the service metrics, let alone the business metrics and have a constellation of dependencies and I have to track everything is working. And so I myself understand and appreciate the need to have, first of all, high-speed, reliable but very flexible data collection mechanisms that they need to scale in ways that they did 10 years ago. Like I'm collecting hundreds of millions, if not billions of pieces of information every year just to run my own services. And customers -- I'm built on AWS services like any customer. And so customers know this, too, and they want solutions that can scale that's not a differentiating part of the business. Like I don't get paid, I don't make money off of my analytics back end so they got to sell a marketplace. And customers have really sort of wisened up to that. But -- also with intelligence, just for us, for example, we're looking at how we can improve partner discovery through generative AI, through our own product. And that discovery only gets good with data. And if that data is super messy, can't be easily combined -- is impossible to find, you won't meet progress. So I think you're starting to see that tipping point where it's not about, hey, is my latency good or these operational metrics is, hey, how can I take these metrics that we're just part of my operational rigor and turn them into business value. And to do that, you need to have a ton of data and now the compute and the technology we have with things like generative AI actually allow you to turn that into a meaningful data. So we've really kind of flipped a bit in the last couple of years, so it becomes very important to use solutions like yours.

Erica Schultz

executive

#41

Yes. Great. Well, with that, I just want to thank you again, Matt, for the partnership. Importantly, for joining us here today. So thank you.

Matt Yanchyshyn

attendee

#42

Thank you.

Erica Schultz

executive

#43

Great. Thanks. Thank you. All right. So I'm going to wrap there. Thank you for the time, and I'd like to invite our CFO, Steffan Tomlinson, to take the stage next. Thank you.

Steffan Tomlinson

executive

#44

[Technical Difficulty] Tomlinson, I'm the CFO. And you've heard today a lot about the secular drivers that are really driving data in motion, data streaming, stream processing. And I'm really excited today to walk you through how we're leaning into this opportunity, to really be driving efficient growth. And the starting point of this journey is our operational and financial execution. In a very choppy environment over the last year, 1.5 years, we've been building a track record of delivering on our commitments even in the face of these choppy waters that are out there. And we've been able to drive very durable growth. The product market fit that we have, coupled with our go-to-market execution, and this move from batch processing to streaming has really been driving high rates of NRR, compounding and durable growth. And we've been delivering all this growth while really managing and balancing profitability. We've been very much focused on working within a framework. We had a target model that we outlined at the time of our IPO. That's guided our journey on this. And we've been able to deliver very high rates of growth greater than 30% while delivering lots of operating margin improvement. In fact, in the last 3 quarters, we've driven over 10 points of annual operating margin improvement for each of the last 3 quarters. So I want to take a few minutes to look behind the details of these and this starts with the strong execution. The momentum since our IPO, we've went public about 2 years ago. Our total revenue run rate has now reached approximately $700 million, and that's doubled. Our cloud revenue run rate is approximately $300 million, and that's grown more than 5x and we've delivered 18 points of operating margin improvement during this time period, and it's also driven by best-in-class NRR and gross retention metrics. When you look at our progress on both the top line and our path to profitability, we've achieved roughly $700 million in revenue, and we expect to be profitable in Q4 of this year. And the company is 9 years old. And that puts us in the top quartile of performance relative to peer group. So on multiple dimensions is differentiated from the alternative offerings. The TCO and ROI characteristics that you've heard about earlier today are compelling to our customers. We're able to serve our customers' needs wherever they are on-prem, in the cloud, in multi-cloud environments. This creates a big competitive moat for us relative to most solutions that are out there. And this differentiation has driven a 70% CAGR. And in the early years, most people know that the growth was primarily driven by Confluent Platform, our on-prem offering. But over the last several years, the growth has really been driven by our Confluent Cloud offering, and that became a catalyst for future growth. Cloud now accounts for 42% of revenues and we expect cloud to account for about 48% to 50% of revenues in Q4 of this year. All of these characteristics have led to strong customer commitments. RPO at the end of '22 was $741 million, and that grew at a 7% CAGR. And all of this RPO performance has really helped us with visibility and predictability. One of the key measures that we track is cRPO, and cRPO coverage to revenue is tracking at or above 60%. And the CRPO CAGR over this time period is 69%, and again, these dynamics have created visibility and predictability and has really helped us deliver on our quarters ever since we've been a public company. Now while our hybrid model of selling both Confluent Platform and Confluent Cloud is important, I'll tell you Cloud is leading the way. It's powering the top line, customers are developing new applications with a cloud-based premise-first, and they're moving existing workloads to the Cloud. Now we are a cloud-first company today and we are 10x better than a do-it-yourself Cosco approach. And our cloud product now has best-in-class NRR. More than half of our customers that have over $100,000 in ARR are using Confluent Cloud either on a stand-alone basis or in conjunction with Confluent Platform. Another driver of this growth and strong, consistent execution has been our diversified customer base. We have 4,690 customers, the customer count has grown 10x since 2018, and we're in over 100 countries around the world. And we serve nearly every industry that's out there. So it's a highly diversified customer base and industry base. Now let's turn to the durability of our growth in the 3 core drivers that underpin it. First, to start with TAM. There's a large and growing $60 billion total addressable market today. Our differentiated customer growth go-to-market model, which is purpose-built for data and motion with best-in-class NRR has led to this compounding durable growth. In our Cloud consumption model is powering growth as our customers are running mission-critical workloads across all of their environments whether it's on-prem, in the cloud or multi-cloud. So I want to take a moment to double-click into each. The $60 billion TAM, it's a new category that we've created earlier today. Shaun and Jay went through the TAM construction. I would really like to focus on a couple of things. The first is with the addition of Flink, this enables us to capture our TAM even faster than we had the opportunity before. So that's an accelerant to us. Now the explosion in data volumes that we've seen and the explosion in applications that's happening across the entire enterprise landscape and what will be coming with generative AI are catalysts for future TAM expansion. And over the next 3 years, we anticipate our TAM to be increasing at a 19% CAGR to about $100 billion. The net of it is, this is a big market, and there's a lot of opportunity for growth in front of us. Now the TAM is also supported by the significant opportunity that we have for converting open source, do-it-yourself, Kafka customers to Confluent Cloud. There are over 150,000 do-it-yourself Kafka users that are out there today. And we have 4,690 customers, that Cloud, our Confluent Cloud offering is far less expensive than the alternative offering. If you think about what we deliver, there is less personnel that's needed to run our Confluent cloud offering, than a do-it-yourself Kafka implementation. There's less bolt-on software. There's infrastructure cost and the overall TCO and ROI is very compelling. In a nutshell, it's faster, better and cheaper. And the technology has brought adoption across all of the industries that we play in. You saw these stats earlier today, over 75% of the Fortune 500 run Kafka and we have about 36% of those customers today. So there's a big opportunity for us to grow in this segment. Now our product market fit and go-to-market model is a powerful combination that's led to strong growth in [Audio Gap] And we have 1,075 customers that have spent over $100,000 in ARR and they account for more than 85% of revenue. And that's been consistent ever since we've been public. This cohort has grown 6x since 2018. Now moving up the stack, we have $135 million plus customers. That has grown 12x since 2018. And we have a growing number of $5 million and $10 million ARR customers. So when we evaluate our customer base and we look at the opportunity to grow and expand in many of these customers, we're still in the early innings of our ability to satisfy their needs for real-time data in motion. So we have a lot of room for [indiscernible] to capture more wallet share. The second core driver of our differentiated customer growth go-to-market model, which [ powers ] our land, expand and consumption engine is these NRR in gross retention metrics. For 8 quarters in a row, we've delivered net retention rates above 130% and gross retention rates above 90%. In particular, our Cloud business is a standout performer. We're above 140%, and the highest NRR that we see is for the newer cohorts. And then you've heard this earlier today, we have very strong network effects within the organizations that we serve. This network effect has really translated into as more applications and data stores get connected into our platform, it begets more applications in data stores need to be connected. And so we've seen that manifest itself in a 35-plus-x ARR expansion after 4 years. Now we have a powerful cloud consumption model. Erica presented this cloud consumption model a little bit earlier, but I wanted to share a few additional points. The reason why it's unique and powerful, it's because it's a revenue-generating model due to the network effects I just described. We're running mission-critical workloads and as a result, these workloads often come pre-optimized and over the last 12 months, in a tougher business environment, our consumption model has actually held up very well. We've been showing compounding growth in our consumption model. And it's due to the characteristics I just talked about, our consumption model, I would just say, is held up better than maybe some other consumption models that are out there, and that's due to the nature of loads and where we sit in the technology stack and the mission criticality of the workloads that we're running. All of this has led to compounding growth. Our top 20 customers for cloud span 11 unique industries. Approximately half of them started on Pay As You Go and about 40% of them are hybrid customers. In this cohort of customers, it's taken 6 quarters for them to get to $1 million of ARR, very, very quick. In the average ARR across this cohort base is $3 million, and that's up from $0.6 million just a couple of years ago. And a total ARR multiple is 17x. Again, this demonstrates the power of our customer growth go-to-market model and the land, expand and consumption characteristics that we have. Now increasingly, as the enterprise continues to move workloads to the cloud, our installed base of Confluent Platform customers are adopting Confluent Cloud, this perhaps is maybe one of the areas that isn't as well understood by some folks who are looking at Confluent from an investor standpoint. So looking at the cohort of these hybrid customers who run both Confluent Platform and Confluent Cloud and spend over $100,000 in ARR, the customer has -- the customer account has grown 70x. And there are a few attributes to highlight in this cohort. The majority of these hybrid customers started with Confluent Platform. There's an inherent cross-sell and upsell opportunity that spans both Confluent Platform and Confluent Cloud customers. In this cohort, in particular, has a high ASP in a higher cloud expansion rate than the average customer that's out there. Doing another double click on our cloud business and to put a finer point on things around the power of our consumption model. Here are two examples of -- from our key customers that started their journey with us and what the profile of their expansion is. So the first is a large international bank that started with about $100,000 in ARR and grew 34x to $4.6 million. The next is a multinational manufacturer that started on cloud as Pay As You Go with a $50,000 and expanded over 192x to $4.3 million. So the progress that we're making in these customers puts us on a path to be the central nervous system as they deploy Confluent more broadly across their environments. Now I'd like to take a few minutes to look at how we have managed growth and profitability. We've been committed to driving efficient growth and we shared a framework in a target model at the time of our IPO that we're going to be updating today. But first, I'm going to start with our path to date. Starting with the top line. Even in this difficult macro environment in FY '22 and FY '23, we've been able to navigate the environment pretty well, and our growth has been resilient. We continue to expect that we're going to hit $760 million to $765 million in this fiscal year. And while they're about 3 weeks left to go in the quarter, like ever quarter we're going to have work to do in these last 3 weeks to land the quarter. The consumption patterns that we called out on our earnings call are holding up in line with our expectations, and we're currently tracking to our Q2 guide. And so the growth drivers, we previously covered this section. So I'm not going to cover each of them, but I do want to just re-emphasize that there are multiple growth drivers to this business that we can cover during our cocktail hour as well. Turning to the unit economics of the business. We've made great progress on gross margin. And this has been a really like high point for the company. And we've done this despite the revenue mix shifting to cloud over the last 3 years. Now we're actually updating our gross margin target for FY '23. Originally, it was 70% to 72%, and we now expect gross margin to be in the range of 72% to 73% for the year. Now the drivers behind this -- we've continued to have very healthy gross margins for Confluent Platform. But the work we've done around our Confluent Cloud offering has been very robust. We've achieved scale in the cloud hosting costs. The new product and feature releases that we've been coming out with have increased the value that we're delivering to customers. There's been a lot of work and effort going into engineering and infrastructure engineering to get more optimizations. And we're also seeing an increasing mix of multi-tenancy for cloud, and that it comes at a much higher gross margin. Now we also understand this work is never done. And this is something that we're laser-focused on, and we're going to continue to work to drive efficiencies. Now turning to OpEx and operating margin, we've been disciplined in terms of where we're making investments. We've been controlling the rate and pace of hiring to ensure we're driving leverage and while also delivering high revenue growth. The investments in product and sales capacity have been key drivers. And the Pay As You Go onboarding of customers has been a key way to drive low-cost onboarding of customers. We've also seen great network effects that have driven expansion, like I talked about earlier, been focusing on improving sales productivity. So these factors have contributed to a very healthy pace of operating margin improvement as we continue to target achieving breakeven in Q4 of '23. Now looking at our path to date and how we've managed growth and profitability, our investment decisions that were powered by this historical model that we had at the time of IPO, we're also influencing how we allocated capital, and it's been guided by this growth and profitability framework. The key attributes to call out in the old model, it was predicated on the growth rate being greater than 30%. And growing cloud mix of revenue and a high net retention rate and gross retention rate, we're always focused on improving the unit economics and proactively managing expenses. And we actually accelerated our path to profitability by a full year, given everything that had been going on both in the market and then how we looked at the power of our own financial model, so we pulled profitability forward. Now I want to take a minute to talk about the future and what the path forward looks like. As we look at managing growth and profitability over the midterm, we're looking to resource the company for a 30% -- approximately 30% annual revenue growth rate. We're continuing to proactively manage expenses in the rate and pace of hiring. We continue to expect to achieve operating margin breakeven in Q4 '23. And once we've become profitable in FY '24, we intend to remain profitable for -- on a full year basis, with about 0% to 1% operating margin for the full year. That's going to be another year where we're going to be delivering outsized operating margin expansion of 13 to 14 points on a year-over-year basis. And over the midterm, we're looking at expanding operating margin to roughly 5% to 10% while delivering top line growth. Some of the key assumptions that underpin this -- that macro does not deteriorate materially, and that Flink and FedRAMP continue to perform according to plan. Now the shape of our business is changing over time. And it's changing to -- that we're basically becoming predominantly more cloud. We expect approximately 80% Cloud revenue mix long-term. And with this change, Confluent's benefits of the compounding growth that we've talked about with a high NRR and gross retention rate. Factoring these dynamics into the financial model, we're updating the growth and profitability framework that we had at the time of IPO. So in this new model, we're increasing our midterm and long-term target ranges for gross margin, we're modestly improving the long-term target margin for sales and marketing as a percentage of revenue, and we're increasing both the midterm and long-term operating margin targets and now expect to deliver over 25% operating margin in the long term. It's important to note that the top line growth will continue to be a determining factor around rate and pace of operating margin expansion over time because we are continually calibrating the opportunity that's in front of us and the amount of operating margin that we're going to be delivering and we want to make sure we're taking as much wallet share as possible while continuing to build a profitable business. There's one more item I'd like to cover, and this is how we're managing dilution. So starting with stock-based compensation expense. Most of you know it, it's a lagging indicator due to ratable recognition of expense. And a lot of this that we've been carrying for the last year or so has been impacted by the pre-IPO options and a significant amount of hiring that we did over the last couple of years. In the second half of '22, the pre-IPO option expense will end and then we should start to see some meaningful improvement for SBC as a percentage of revenue. And over the midterm, we're targeting SBC to be approximately 25%, and then we're going to be in that down to the mid-teens. Now what we're doing to control future SBC is focusing on managing dilution today. And we're committed to driving net dilution down in FY '23, targeting 3% to 4%, and then in the midterm, we're getting 3%, and in the long term, we're driving that down to less than 2%. So that wraps up my presentation. Now I'd like to introduce Rohan Sivaram, SVP of Finance to cover financial related modeling topics. Rohan?

Rohan Sivaram

executive

#45

Thank you, Steffan. And good afternoon, everyone. I'm Rohan Sivaram, Senior Vice President of Finance here at Confluent. The purpose of today's modeling session, in my mind, is to help you get a better understanding of our business model and some of our key financial metrics. And as I was thinking through the materials for this session, we've incorporated a lot of the feedback that you've shared with us. So thank you for that. With that, let's just jump right in and talk about the power of our hybrid business model. This is more of a setup before I jump in. So as you've heard from most of the presenters today, Confluent needs to be wherever our data and applications reside. Be it in the cloud, we need to be there with Confluent Cloud or be it on-prem. We need to be there with Confluent Platform. Starting with Confluent Cloud, in the last reported quarter, Confluent Cloud was 50% of the business. And as you know, it is our fully managed cloud native offering that we have. Confluent Platform was 50% of our business in our last reported quarter, and that's our self-managed software offering. What is not here in the slide is our hybrid set of customers, we have a growing cohort of hybrid set of customers that Steffan touched on earlier. And if you look at the 100,000 plus cohort, we have over 200 hybrid set of customers. Services is another offering that we sell along with our product sales. And it is primarily sold to accelerate our consumption and obviously help our customers adopt our product better. What I'm going to do probably over the next 10-odd minutes, is to walk you through a couple of customer journeys, one for Confluent Platform and one for Confluent Cloud. And as I'm doing it, I'm going to do three things. First, we'll do a little bit of a setup, talk about some definitions. Second, I'll walk you through an illustrative example. And last but not the least, hopefully, we'll leave with some key takeaways for each of these. So let's get started with Confluent Cloud. Companies and prospects can start their journey with cloud by signing up for free on our website with a few hundred dollars in credits. Subsequently, they can put in their credit cards and become what we call as a Pay As You Go customer, and Pay As You Go customers have no contractual commitments. Alternatively, the Pay As You Go customers or net new customers can commit to a contract with us, whereby they're committing to a level of consumption over a period of time. And these commitments typically happen in either annual or multiyear form, and they show up in our financials as RPO or remaining performance obligations, or cRPO, which is current remaining performance obligations. One thing when a customer consumes more than they commit, we recognize that incremental revenue as overage revenue and when a customer is under consuming, the delta is recognized as expiration revenue. Finally, time to ramp is basically the time our customers take to ramp up to their committed contractual run rate. With that, let's jump into an example and talk about data and motion. Data In Motion, Inc. is a company that's currently using open source Kafka. And one of the teams within the company wants to experiment with a managed service, and they decide to choose Confluent Cloud. So they come to our website, quickly signed up and through our in-product tutorials, best practices and some of the common recipes that we have available, they quickly become a Pay As You Go customer. What's interesting about this product-led motion is that it drives organic adoption. It is frictionless and more importantly, it comes with very low sales touch. Let's look at some of the financials. So in month 1, Data In Motion Incorporated spend $1,000 with us and we recognize that $1,000 of consumption as revenue. From an RPO or cRPO perspective, there is no RPO, cRPO at this point because the customer has not committed to anything. And finally, when you think about the total customer count, Data In Motion Incorporated is now contributing to our total customer count. Moving on the product, the Data In Motion is starting to use our product, and they are finding value in it. From experimental use case, they slowly start moving into production. And Confluent offers a lot of benefits. So they decide to sign a $120,000 committed 1-year contract with us to take advantage of the volume discounts we offer and to take advantage of the benefits that Confluent provides with respect to support and our expertise. At this point, at the inception of the contract, we see an RPO and cRPO balance of $120,000, and in the first month of the committed contract in this particular example, they consume $3,000, which is quickly recognized as consumption revenue for us. And as we exit the first month, we are left with $117,000 of RPO. You'll see there's an inverse relationship between revenue and RPO as we move forward. Now we have a few months into the committed contract and the consumption keeps happening, and we reach the 6-month time period. At this point, the company is spending approximately $10,000 with us. And they've actually hit our net consumption run rate that they have committed to us, which is $10,000, and what's also interesting to note that at this point, they've also crossed the $100,000 threshold. So when we report our $100,000-plus ARR customers, this is when they are part of our reporting. One interesting, I'd say, observation here is our time to ramp of 6 months is very unique in infrastructure and it's actually a key differentiator of our consumption business model. From with respect to rev rec and RPO, we continue to recognize revenue based on consumption and our RPO balance continues to get depleted. It's also important to note that Data In Motion does not stop growing at this point, which takes us to month 12 of the contract. And as you can see from the chart, they're consuming at $15,000 in month 12 of the contract, which is a run rate of $180,000, higher than what they had initially committed for $120,000. So a couple of observations here. Like from a setup perspective, two things which is really working well for Confluent. First, we're very well set up for net retention and gross expansion at this point given the run rate of $180,000. And second, I also feel that we are very well set up for future margin expansion because typically a year or 2 of a customer is much better from a unit economics perspective than year 1. So at this point, in month 15, we don't have any RPO and cRPO, but the customer is consuming and they're getting value out of our product. So they decide to sign a renewal with us and as they sign a renewal, the size of the renewal deal will typically be at or above the $180,000. Although it's an illustrative example, this shows a typical journey of a customer -- cloud customer that we have. Now bringing it all together, as most of the speakers said, be it from [ PayGo ] to commit to renewal expansion, there are a few things that connect to all of these. First, it's a deeply differentiated cloud-native model, which helps our customers get started, expand and scale. Second, our customer growth go-to-market model partners with the customer at every stage of their journey and with the right product, expertise and go-to-market strategy. And last but not least, our customer success team partners very closely to drive best-in-class time to ramp. With this as a backdrop, let's talk about a few key takeaways for Confluent Cloud. First, as Erica touched on it, Pay As You Go is a very important piece of what we do and 50% of our cloud customers start on Pay As You Go. Our time to ramp is approximately 6 months, which is best-in-class for infrastructure software. We've spoken about this concept in our earnings calls over the last few quarters, Cloud as a percentage of new ACV has been at or -- has been above 50% for 5 consecutive quarters. As a result, our cloud as a percentage of total RPO is approximately 50% in our last reported quarter. From a revenue mix perspective, we continue to expect to exit Q4 of 2023 with a mix of 48% to 50% for Cloud and from a revenue seasonality perspective, we expect the lowest sequential add to be in Q1 and more pronounced increase in second half of the year. And finally, when you look at the Cloud NRR, that's, of course, going to be the highest NRR for the company and greater than 140%. Let me switch gears and talk a little bit about Confluent Platform. Confluent Platform is sold as a term-based subscription that's typically built annually in advance. The Confluent Platform typically has an upfront revenue recognition that happens for the license piece which is 20% of total contract value. The remaining part of the contract with the other 80% is recognized ratably over the period of the contract. And finally, for RPO and cRPO, Confluent platform, RPO, cRPO is very similar to Confluent Cloud where we recognize based on our customers' commitments. Let's take the example of Data In Motion again. This time, they are in an industry where the data resides heavily on-prem, and they come to Confluent and sign a 2-year commitment for $200,000. So what happens at inception? At the inception of the contract, we recognized $40,000 as license revenue that's 20% of $200,000. For the remaining we typically recognize that over the period of the contract, which in this case is 8 quarters, $160,000, that's about $20,000 per quarter we'll be recognizing. Let's fast forward to end of year 1. We continue to recognize the maintenance revenue or PCS revenue, as we call and we exit year 1 with an RPO and cRPO balance of $80,000. As you'll see here, the rev rec lines here are very interesting because on Confluent is like a curve where it depends on consumption, where this is a lot more predictable, and it's every quarter, you get the same amount. As you exit year 2, you're basically depleted all your RPO and cRPO and you've recognized the maintenance revenue over the period of the contract. And at this particular time, the customer is basically in a position where they're going to sign a renewal with us with some amount of expansion and NRR is going to drive that. Finally, I think a few takeaways for Confluent Platform from a modeling perspective. Like I said, 20% of Confluent Platform revenue is recognized upfront and the remaining 80% is recognized ratably over the period of the contract. Confluent Platform drives more revenue seasonality than Confluent Cloud due to the upfront revenue recognition. Confluent Platform leads to less RPO than an equivalent Confluent Cloud deal. That's again due to the upfront revenue recognition. And finally, for some of the key metrics, like $100,000-plus ARR customers, $1 million-plus ARR customers or NRR for Confluent Platform is based on consumption -- it's based on commit, I beg your pardon, and not consumption. Let me take us home with some key company-wide modeling points. These slides will be posted on our IR website shortly. So I don't plan to go through all of them in detail, but I'll cover a few of the seasonality-related metrics. Let's start with revenue seasonality. When you think about revenue seasonality, we typically expect second half to be more weighted than our first half, with Q1 being the seasonally weakest quarter and Q4 being the seasonally strongest quarter. When you look at non-GAAP operating margins, we expect Q1 to be the lowest and that's driven by the lower revenue seasonality you see in Q1 and large onetime expenses that we see like company kickoff and sales kickoff that typically happens in the first quarter. And talking about seasonality, when you look at free cash flow, we expect Q1 to have the lowest seasonality or lowest free cash flow, and that's driven by our company bonus payout as well as our ESPP settlement. Hope this helps you get a better understanding of our hybrid business model, and you'll have this information handy on our IR website. With that, I'd like to thank all of you and invite the management team over for some Q&A.

Shane Xie

executive

#46

All right. We're setting things up. Well, we have the full management team here. So if you have questions, raise your hand.

Michael Turrin

analyst

#47

Michael Turrin, Wells Fargo. Thanks very much for taking time and for all the content we've provided. It's an investor session, so I'll start with the target model question because Steffan just laid it out for us. So margins showing clear progression, particularly relative to where we stood at the IPO. The growth assumptions tweaked to touch from above 30% to now the squiggle line 30% -- can you just frame out -- is that tied to sales capacity being at a different point versus what you were previously expecting? Is it a change in the macro? And actually, One of the questions I feel most often is how to think through Confluent's growth and a rebound recovery scenario. So it's not the world we're in currently, but can you just help us think through some of the inputs on that side as well?

Steffan Tomlinson

executive

#48

Well, on the top line growth, we're resourcing the company to be growing at approximately 30%. And a lot of that is a reflection of the macro. We've been very much focused on preserving and increasing sales productivity and capacity, I should say. And that's something that we're going to be like looking at continuing to do but do it judiciously. And then as far as if the economy rebounds down the road, I'm sure that would benefit lots of companies, including Confluent. The last thing I'll say is, as I think about the structural drivers of growth, we have a wonderful go-to-market organization but the product teams that are working not only on the Confluent Cloud product, but the upcoming Flink product that will be released that should help maintain our growth rate. And in some scenarios that we've run, it could be an accelerant. And then we haven't really baked in anything material on the role of generative AI in the short term. We think that, that's going to play out in the enterprise over a longer period. And that -- but that's something that we think about because we're really excited around what our customers going to be doing with generative AI, and we should be a beneficiary of that. But I think this is going to play out over time as opposed to something more immediate.

James Wood

analyst

#49

Great. Derrick Wood at TD Cowen over here, and you just led into -- what I was going to ask about in terms of new product efforts. And so you mentioned, Steffan, that FedRAMP and Flink, you've got certain expectations around that. If you guys could walk through kind of what the milestones are to expect. Erica, how you're thinking about the go-to-market focus once they come out, especially around Flink, but also what you're doing with FedRAMP. And then on the generative AI side, what you guys are potentially doing to lean into that opportunity right now?

Edward Kreps

executive

#50

Yes. There's a lot there. So maybe we can divide it up into chunks. I'll start with generative AI one. We laid out a little bit of the kind of architectural pattern we're seeing with customers. I think there's additional investments we can make that will help make some of that easier. But the big opportunity is definitely the flow of data to power these models and the actual application of this 2 streams of data that customers have. And so that's what we want to make easier and easier. The -- yes, we have, I think, a very big aspirations for Flink. It's kind of logically understandable that monetizing the compute and streaming applications will be an obvious vector of growth on top of the streaming data that we already have and kind of naturally combines with what's there. The key milestone that everyone should look to and expect is the GA of our Flink SQL service this year. We've communicated that, that was coming. We've seen good progress against that. It's in early access with customers now, but that's kind of the key thing that will unlock monetization next year. And we do expect that we will ramp like any cloud service. It's not 0 to 1,000 on day 1. That will roll out to customers who will bake it into their applications over time. And we'll be adding capabilities in some of the other Flink KPIs so we will open up more monetization off of that over the coming years. So that's going to be one that we think can be quite significant. We think that the opportunity around stream processing is as big as the opportunity around kind of core Kafka itself and it will play out over a number of years as we grow into that process. I think there were several other side parts that, if you wat to remind me...

James Wood

analyst

#51

Yes, FedRAMP focus.

Edward Kreps

executive

#52

Yes, yes. That's one for us as well. It's a key certification for the federal government. So when we look at our public sector business, that's contributing quite substantially today, but it's contributing almost entirely with Confluent Platform. And a lot of the public sector U.S. with the federal government is cloud and requires that certification. That's something we're actively working on. And as that unlocks, we should see acceleration in that part of the business.

Robbie Owens

analyst

#53

Rob Owens from Piper Sandler. Do you want to square that comment with something Steffan said relative to Flink because you talked about it as an accelerant, but not incremental you held your $60 billion TAM, which is enormous given the scale of the company. But why isn't it incremental at this point? And how should we think about that opportunity? And then Steffan, just to nitpick a little bit. You did raise gross margin by 1 point but operating margin stays the same for the year. So where does our extra point go?

Edward Kreps

executive

#54

I'll take the first part there. So the way we're modeling that in TAM is primarily in -- it shows up in multiple places with stream processing, right? It shows up in some of the cannibalization of older integration tools. But I think the most significant is when you look at that segment of database usage, that kind of 9% of database usage, you could look at that number in different ways, right? You could actually -- if you look at the scope of batch processing that's actually a little conservative. But the most important variable is not, hey, how much of the database market, which is quite bigger, you're taking, it's actually the pace that you take it with. And so the reason we're not calling it TAM expanding as we already have that database segment in there. We already had some preexisting stream processing capabilities. This is really about getting that 9% faster, increasing our probability really winning that segment of it and really bringing that into the monetization of our Cloud. So you can think about it either way, but we kind of had that in scope already, and this is just a better way to get it more quickly. And do you want to take the second point?

Steffan Tomlinson

executive

#55

Yes. On the gross margin side of the house, we have a track record of at least delivering the gross -- the operating margin guidance that we've delivered or beating it. And I would say that from an operating margin standpoint, the kind of like the cushions got a little bit better with the improvement in gross margins. And we've been targeting roughly breakeven for Q4, and that's something that we continue to do and the improvement in gross margin for the back half of the year is something that gives us even more confidence around the breakeven for Q4 '23.

Gregg Moskowitz

analyst

#56

It's Gregg Moskowitz from Mizuho. Thanks for the presentation today. A question for Jay or Erica on TCO and ROI. So you've done hundreds of TCO assessments for your customers, but that's a pretty small fraction of your installed base. And the data that you presented where it was so powerful and I'm wondering how you can ramp up these assessments to drive even more transparency, even more business value?

Edward Kreps

executive

#57

Hey, do you want to take that Erica?

Erica Schultz

executive

#58

I'm happy to take it. Yes, I think it's an opportunity. It's something that we're focused on, especially in this market is going out in a more scalable way to be able to do those types of assessments, so that's actually a key strategy of ours right now. But I think it's important, as we've talked about, both in the sales cycle to prove kind of the why do anything as well as in retrospect, help customers quantify the savings that they've realized yes, we want to do that across more of our base in a more programmatic and scalable way.

Michael Cikos

analyst

#59

Mike Cikos here from Needham. The question for you is like, I understand the pull of real time, right? If we had been batching on a monthly basis, you go down to daily, even micro batching. It just seems like a workaround towards getting more and more towards real time. At the same time, you guys have put in place things to reduce those barriers to adoption like custom connectors. My question is, as you see these customers going on this journey, are some of them altogether skipping going to, let's say, Kafka first and just jumping right to a commercialized offering like Confluent or no. Does the customer journey still require that initial pain point on managing that infrastructure yourself in that Kafka environment?

Edward Kreps

executive

#60

Yes. That's a great question. So one of the things we believe really early on is that the natural journey in cloud -- starts with cloud. It doesn't start with downloading some piece of software, running it, realizing it's hard in switching into indeed, a lot of our customers directly just sign up for our cloud product and start using that as their very first thing. And increasingly, the scope of that product is well beyond just Kafka. And so, we view that as a good as kind of the natural way of starting with the cloud offering. However, as I think Erica laid out and we laid out a number of these presentations, there's a ton of open source Kafka. So that Kafka conversion motion, we also want to be really, really good at. And even for those who are starting with our cloud product in the back of their minds as kind of the alternative would be -- well, we could run it ourselves, right? And so, that kind of TCO analysis, some of the other aspects of that still matter in terms of the differentiation. But yes, it's absolutely the case that a significant portion of our customers are just kind of starting in their first project is on Confluent Cloud rather than a kind of open source adoption and conversion over.

Howard Ma

analyst

#61

Howard Ma with Guggenheim Securities. Could you comment on adoption -- or expansion use cases between Confluent Cloud only. So those who start on Confluent Cloud only versus your -- hybrid. And for those hybrid customers, do they eventually -- do you expect them to eventually all convert to cloud and if they do that, what does that adoption look like? And if you can -- I guess one last part is if you can comment that within the framework of the 5 foundational elements of -- like it's going to be more on the process side on Flink or is it more governed and -- or data sharing as well?

Edward Kreps

executive

#62

Yes. Maybe I can start with kind of what's the end state and then the rest of us can jump in on some of those other aspects. So yes, we -- in the very end state, perhaps everything will be in the public cloud, like all applications but it is very end state, when you look at how enterprise technology moves, it's not that quickly. Many of our customers still have significant mainframe installations, right? And so there's many generations that they've built out. And even, if you look at that graph of the enterprise spend I showed earlier, the cloud was largely in addition to an existing on-premise footprint spend, which has kind of flattened off. But all the growth came in the cloud. And so, why is that relevant for this technology, it has to be where your applications are? And so, the critical question is, when our customers are going to not have anything on-premise. The answer we're hearing from customers is like not anytime soon for kind of large mainstream enterprises. We're open to it. We love our cloud business is extremely successful. If there are -- some of these are growing very aggressively and will be out of their data centers, and that's a real plan for them, and we'll serve them entirely with Confluent Cloud. Some of them will be on-premise for a long time. Regardless that outpost on-premise is very important to actually feed some of these newer investments in the cloud product. And that's a barrier. It's a mood for us covering all of those. And there's a lot of kind of difficulties that are not obvious in how data flows between these environments and making that something that's safe and secure and govern and done well. And it's actually a driver of growth for Confluent as well. So, to your end stage, the short answer is it will follow the overall migration of enterprise to the cloud, which I would expect will continue to be at pace, but we'll still have some on-premise footprint for a long time. And I don't know, if folks want to jump in on some of the other points there.

Unknown Executive

executive

#63

I think just briefly look at our hybrid model that you should think of as an enduring strength. I think that there will always be data in various different outputs across the organization, including in various different places on-prem or other hard-to-reach places. But it's not just about where the data is, it's about where it's flowing to. So, you need to be able to get it from where it is and make it flow to where it needs to go to and all enterprises are complex and disbursed by business unit, by function, by geography, by office. And so, when you look at some of the capabilities that we're offering, we are going to where the data is and the hybrid deployment model you just talked to, but we're also offering these global capabilities on top governance, processing, sharing, connecting. And so, organizations are looking to put their data to work wherever it is, and putting it to work means governing and sharing and pushing it across all the rest of your enterprise. So I think, there will always be those outposts at our global capabilities and the five dimensions you talked about what really adds to that ability of the data to go anywhere it needs to be -- in any way, it needs to be used.

Tyler Radke

analyst

#64

Tyler Radke from Citi. Good to see everyone. So two questions. One, I was with a lot of the slides, it was pretty clear that still a huge opportunity, right? You only have a low single-digit fraction of all the Kafka users out there. I was wondering, if you could give us an update on the success of the paywall change that you made. I think, it was about a year ago. Some of your peers like MongoDB talked about a record number of customer adds this past quarter. How are you expecting that to trend for your business based on what you can see in the top of funnel data? And then second question, just on generative AI. There's been a lot of excitement in the investor community around vector databases in vector search. And I think at your current conference in October, you had some customers talking about how Kafka is used to interact with vector data. So, I was just commenting -- I was curiously if you could comment on the vector opportunity and whether you believe there'll be new products from Confluent to address that opportunity?

Edward Kreps

executive

#65

Yes. Yes, those are both great questions. So on the to the topic of the total customer count, what's the interaction with the paywall? Yes, at that very first step of how do you spend your first few cents with Confluent. Our goal is to make it as easy as possible for customers to get going. That's gone pretty well, I think as we've removed that. That means that a lot of our open source advocacy can now directly start with our cloud product that means that kind of early development can go in many cases totally on monetize. That does have an impact on the total count, but we think probably leads to more 100,000-plus customers over time. We're really focus on getting those sticky customers that are going to persist rather than trying to monetize that early experimentation for the obvious reason in that chart and kind of ramp up that Erica has showed that there's lots of people kicking the tires, they can kick the tires with patching Kafka, but we would rather they kick the tires with our cloud product. The monetization opportunity really comes out, and it kind of starts to get into production. So, we like that's gone pretty well and has really opened up the kind of direct integration of the product into the sales motion, it's something advocacy motions and the self-service side. I do think there's an opportunity to further accelerate customer acquisition. Like, when you look at the raw numbers. Another way of asking the same question is like, hey, 4,000 maybe heading towards 5,000 customers, are you happy with that number as a percentage of the total Kafka count -- and the answer is no, like we think it should be much higher, and that's absolutely what we're building towards. To get to that, it really does mean driving down the kind of amount of friction of adoption of cloud. I think, it's something Mongo has done an excellent job of. And as you do that, you kind of access this broad pool of users, and that's definitely a focus for us. To the second point, yes, -- the key question with AI is exactly what I said. How can I combine the kind of unique and private data of my enterprise, with this kind of general purpose model that has some understanding of the world. And, if you think about how humans do that, sometimes it's by memorizing a lot of facts, i.e., training. But sometimes it's by looking something up and giving the answer out of a book or computer. And both of those strategies are going to be pursued to some extent. The vector databases or direct lookups are kind of looking at up on the fly, and that becomes increasingly important in generative AI. If you look at what people are doing, say, with the open AI, APIs, their newer models don't even allow fine tuning let alone like retraining with your data. So it is all prompt augmentation and what's out either. They're adding some of that stuff in, some of the older models you have it. But it is all about how do I get this data at the right time. And that can be either by direct lookup, where it can be out of a vector database, which allows them or in exact search. So for those who don't follow this stuff, effectively a vector database allows me to take the model of language that by LLM has and apply it on my data for indexing. And so, which of those strategies will win out. I actually don't know. And even within the vector database world, there's like a dozen of them, as well as a number of existing things like elastic and post grass, et cetera, that have kind of vector capabilities. So it's a little hard to predict. The data flow into vector databases is easier to predict. We're predicting Confluent as the mechanism for that flow. And so yes, whether it's an exact lookup -- an exact lookup by aggregating data into a vector database, something else that we haven't thought of, I think that's why I was kind of pointing out that one of the safer bets in what is a very rapidly evolving ecosystem is in the data flow layer, whereas a lot of this other stuff is very much in flux, which makes it a little bit harder to call the future state, whether that's models as a service, models trained by me, vector databases, which vector database, other lookup mechanisms, regardless of which of those kind of forks in the road you take the fact that your data is spread across 117 out-of-date systems in the wrong format and not supporting kind of real-time load and lookups, that's a problem you're going to solve with data flow one sort or another. And, we think the basis for that is going to be component.

Sanjit Singh

analyst

#66

Sanjit Singh, Morgan Stanley again. Thank you for the content today, really appreciate it. I had one question for Erika, then one for you, Steffan, and potentially Rohan as well. For Erica, you talked about really doubling down on higher propensity to consume segments. I [indiscernible] -- define like what the lower propensity ones would be because you've seen the signal an opportunity both in the large enterprise as well as in the mid-market. And so, I wanted to maybe get a sense of potentially where you guys are pulling back from?

Erica Schultz

executive

#67

Yes. I would think of it as segments within of any company size, within enterprise or within commercial, where there's high propensity for data streaming to reduce TCO and have meaningful ROI. So what are the industries that are making big bets on data. So certainly, a lot of start-ups who are digital native, all digital native companies of all sizes. Financial services has been a very high propensity segment for us for a long time. Telco is another one. Public sector has been a great segment for us for a long time across both different branches, including Department of Defense, including different citizen agencies. And so, it's more about -- it's about the size of the company and more about what are the industries and the companies are making big bets on data is how I would think about it.

Sanjit Singh

analyst

#68

Yes. Great. I appreciate that clarification. For Steffan on the margin -- the pace of margin expansion of 5 to 10 points per year. Should I think that as getting to that 5 to 10 up to the target model or is there any sort of considerations on whether some years below that or some years above that? How do I think about that in terms of doing about it? And then, the other part on the expansion business, and I think Rohan on your slides about talking about getting a really high margin expansion segment of revenue, which was encouraging for me to see because there's been other players in the space historically like a Cloudera where it wasn't land and expand, it was land and mini land. It seemed like the customer had to restart its investment, after every sort of use case. So as you go from flood detection to inventory management to whatever the use case was, it seems like it looks very costly for customers to do. So, I was wondering either you could sort of break down the drivers of that margin expansion for the expansion business.

Rohan Sivaram

executive

#69

Sure. I'll take the first one. So the midterm target model of 5% to 10% operating margin is something that we're going to get to just like over time, but the rate and pace once we get to that midterm target model is really going to be dependent upon our ability to grow our top line revenue. And if, we can grow at elevated rates, we're going to show what I would just call, very modest operating expansion on our path towards getting to our long-term model. If revenue growth rate were to slow, fairly dramatically for a long period of time, we would have a faster rate to that long-term operating margin target. We think that the market is big enough where it's a very large market. There's a massive do-it-yourself Kafka market out there as well. We have differentiated technology. We have this great customer growth go-to-market motion coupled with what we think is the best engineering and product team in the space. And so, want to play to win, but we're also delivering this robust growth while improving margins and that rate and pace of margin expansion is really going to be determined on that top line growth rate.

Steffan Tomlinson

executive

#70

Yes. To your question, I think there are a few concepts that you need to put together to come up with the answer. And I'd say the three things I'll talk about. Erica touched on her presentation, 50% of cloud customers start from [indiscernible]. And then when you double-click into that, 90% of our commercial customers are essentially start from [indiscernible]. And then you couple that with our cloud net retention rates are greater than 140%. So that's the three that basically tells you that a lot of our cloud customers start small in a frictionless manner with fairly low sales search, right? And that journey over a period of time is more profitable for the company in general. And typically, as you go to year 2 and year 3 because of the expansion side of it, the incremental dollar of expansion is less -- it's less cost clear than the incremental dollar of land. So those concepts like put together is basically where my comment came from. And I mean, from a customer standpoint, of course, like consumption that we drive the network effects and sharing of use cases, streams of data that being picked up by other teams. That's essentially what drives the incremental consumption that you see.

Unknown Analyst

analyst

#71

To the cloud makes that easier going to Phase 1 use case.

Edward Kreps

executive

#72

Yes, I think that's absolutely -- so if you compare us to like a Cloudera, that's -- I think there's a number of critical differences, but that's one of them is the facts that you can just continue that expansion in the development team with no kind of infrastructure effort, ordering server set up operationalization. So a company can standardize on a technology and it could become just one of the ingredients used in virtually every application to some extent to that expansion can happen over a very long period of time to applications are built and rebuilt and could be done without kind of big set up and wait time period.

Unknown Executive

executive

#73

Okay. We've got a last question here.

Matthew Hedberg

analyst

#74

Matt Hedberg, RBC. This has been great. It seems like you guys are squarely positioned for gen AI. And I guess the question is, in the future, do you see a way to or quantify the potential benefit that customers are attributing Confluent to gen AI? Is it API calls in some form of mechanism of monetization? And secondarily, Erica, when you're talking to these $10 million customers, what -- where are they in their journey? Do you see the path to a $20 million, $30 million, $50 million ARR customer on sort of the high end of the Fortune 500 scale?

Edward Kreps

executive

#75

Yes. I'll speak to the first half and then maybe others can jump in as well. Yes. So we had as Steffan said, we haven't included anything in the TAM around gen AI. And I would say at this point, it's just really hard to put like a number that we felt we could stand behind around that. It doesn't mean, it, won't be valuable. It just means we don't know what it is. And then, I do think that there are opportunities for additional product features that will help capture that make these use cases easier play to the patterns. In part, we want those to kind of co-evolve with customers and with the ecosystem as that kind of settles out, and we see people using it. That is one of the nice things about these kind of open popular development platforms to some extent, your customers and the larger community around the technology kind of shows you what to do, you just kind of watch what they're all doing and then eventually fold in right thing rapid than trying to guess upfront, in a very fast-moving ecosystem, and doing something that's totally off base that you end up supporting for eternity. If anybody wants to add on to that area, answer please.

Steffan Tomlinson

executive

#76

Just one thing that I would add is like when you look at gen AI, it's squarely in relationship to what we've all been talking about streaming and Flink. Consider when you're chatting with a gen AI bot, it needs to know everything about you, but not just everything about you at the moment, you began the conversation and needs to know everything about you, as the conversation is happening. That's the whole point. That's what you talk to an agent for consider an agent for travel or an agent for any other type of business process. So when you, for example, are working with a bot. You're asking about your travel, you're asking it to change your flight, you want to know if your bags moved. You want to know, if your rental car is going to be updated. You want to know all the follow-on effects of the actions that you're taking during your chat. So it might be obvious that you would need a platform that does streaming to bring together the profile of your flights, the flat that exist opportunity for changing flights, your baggage, your rental car, all of that stuff has to be brought together, that might be obvious. Now to do that, you have to have the streams, and then you have to use something like Flink to bring it together to basically bring together all of those existing streams to produce, a view of you. And then, as time goes on, you still have to update that during the conversation. And so, I think I'm just trying to illustrate that streams themselves are one thing. Stream processing is like yin to yang of streaming. It takes these existing data streams and make some super rich, super powerful. And then, giving that to a bot is what makes an experience, feel like it knows you, feel like it can actually take action in the real world and do really powerful things. So, your question is how will we quantify that, monetize that. One thing, you'll see is what we already see, which is that streams are valuable, streams driver use. When streams are joined and enriched. They create even more valuable streams. So, kind of it's a virtuous cycle where you get more and more consumption of those streams and the streams become higher and higher value. Now will we be able to directly kind of productize pieces of that. I think as time goes on, we'll be looking more about how the landscape evolves.

Edward Kreps

executive

#77

And. It's worth mentioning that Flink itself has a pretty rich and popular machine learning tool set already.

Erica Schultz

executive

#78

Great. I'll take the second part of your question around what's the potential we see at the high end within, say, our Fortune 500 customers. Today, I shared the example of a customer who's currently at about $10 million in consumption and we can see in the near term 15, that customer is not a Fortune 500 company, not even close. And so that's just -- just to put that example in context. When we met at current last year, I talked about another customer example. Currently, already in the 10 million-plus cohort, and we -- this is a large global bank. And we shared that we have direct line of sight to 20 million-plus for that customer, and I'm confident that there is potential for us beyond that, but that's kind of what we have line of sight to now. So, we're very comfortable with that $10 million on average ARR potential across the Fortune 500 because we do see it play out customer by customer.

Unknown Executive

executive

#79

All right. Thank you. This is the end of our Q&A session. We will now transition to a customer panel with the three amazing customers.

Stephanie Buscemi

executive

#80

All right. As we wrap up at the day here, saving the best for last, which is our customers. It's always great to hear from Confluent and everything we're up to. And hopefully, you've got a lot out of that today. But ultimately, it's the customers who have the challenges, and are solving the problems and driving value in their business. And today, we have three amazing customers here with us. I'm going to have them do proper introductions in a second. But I just wanted to give you a backdrop of what we're looking to cover on this last session. One, we just want to make sure you know a bit about the people in these companies and how they're engaging with data streaming and Confluent. Many of you have asked us about the personas. Who are you talking to in an organization, who are responsible for these roles. You know the obvious ones like Kafka developers, but executives in the organization. The second is, many of you have asked us about how does this tie to the business. So you can give us all the technical underpinnings, but connect this to the business value and what are the business challenges that you're trying to solve. And that's what we want to have in this conversation here. We then, want to get into why Confluent over an open source option or a competitive option. And these folks are going to speak freely and candidly about that. They're going to talk about the value they're driving in their organizations to the extent that they can publicly here. I will say that they're going to be joining us at the reception afterwards. So they're going to be available to talk to you all one to one as well. And then, we'll look at moving forward with data streaming in terms of what are the possibilities, what are the things that can move forward with [ NEXT ]. So with that, let's dive in. We have here Mahesh from McAfee. We have Sarvant from Penske Transportation Solutions and Babak here from Vimeo. So I will kick off with you, Mahesh, tell us a little bit about yourself.

Mahesh Tyagarajan

attendee

#81

Thank you, Stephanie. I'm Mahesh Tyagarajan. I'm currently the VP of Platform, Architecture and Protection Tech at McAfee. It's all the back-end side of the house. So going back maybe about 10, 12 years, I've been focused on taking monolithic code bases to the cloud, breaking them up into micro services and starting to deploy at scale in the cloud. So walmart.com that you see today was my team between 2012 and 2015. Subsequently, I had a similar role at Kroger. And after Kroger, I've moved to McAfee, and I'm doing something very similar on breaking up, what was there for about 30-odd years. And we have actually transitioned all the workloads to the cloud. But were repeal of that as micro services. And this is the year we're migrating out of the monolith into those micro services that have been built. And I've at 3 hats. One is the platform engineering. The other is architecture, as we actually change for all of the stuff that is going on right now. And the third is all the back end protection stuff like VPN, antivirus, identity protection and so on and so forth.

Stephanie Buscemi

executive

#82

Thank you. We'll come back in a moment on about McAfee, what you guys are up to these days. But, Sarvant tell us about yourself?

Sarvant Singh

attendee

#83

Absolutely. Happy to be here, Stephanie. I'm Vice President, Data and emerging Digital Solutions at Penske Transportation Solutions, where I lead the creation and implementation of data management, IoT, AIML, generative AI and broader IT transformation strategies. I'm an industrial engineer by training from IT [indiscernible], and I got my MBA from Vanderbilt. I work for companies like Tata Steel implementing ERPs. And before Penske, I led information management at Cummins. I've been working in the data analytics and on the cutting edge of technology throughout my career.

Stephanie Buscemi

executive

#84

Thank you. And Babak from Vimeo.

Babak Bashiri

attendee

#85

I'm Babak. Director of Data Engineering at Vimeo. I've been working tech for close to 15 years, 9 years in data analytics data engineering. And currently, I leave the data engineering, data ops and data platform at Vimeo and all those 3 teams in track with Counsel and we're at different capacities.

Stephanie Buscemi

executive

#86

Well, now that you've got a little bit of backdrop on each of them as individuals. Let's dive into each of your organizations. I think some of us -- most of us probably know all these names, but I know in my time interacting with you guys, I learned a lot more about what each of you are up to. So maybe Babak, talk to us about Vimeo. Most of us have seen a video I know a little bit, but you're a pretty innovative tech company, maybe tell a bit about your business priorities, and what you are up to as an organization.

Babak Bashiri

attendee

#87

Absolutely. So Vimeo will take pride in providing an all-in-one end to end solution and video solution for our customers. And, we have transformed from more of a viewing platform to software as a service company and currently, we serve a variety of customers from creative individuals to small -- to marketers, small businesses and enterprises as well. And we have over 206 million active users, 1.7 million subscribers and billions of video views every month. So as you can imagine, we collect and consume a lot of data.

Stephanie Buscemi

executive

#88

Fantastic. And Sarvant, I was guilty when we got on the call and talking of thinking of the yellow trucks for Penske, but I learned here a lot more than that. Can you tell us a little bit about Penske Transportation Solutions as a whole?

Babak Bashiri

attendee

#89

Absolutely. So, at Penske, we love our yellow trucks, but we are definitely more than that. We provide innovative transportation and logistics solutions that we believe are vital to the success of our customers. Our product lines include full-service leasing and contract maintenance. We have a truck rental service that provides both commercial and consumer truck rentals in North America. We also have a logistics business. Penske Logistics is a leader in logistics and supply chain management, and it is our wholly owned subsidiary. We operate more than 400,000 trucks and serve our customers from 1,400-plus locations. We are an information intense business with a lot of information getting exchanged between our customers, associates and partners. And in our industry, vehicle uptime and supply chain visibility is extremely important.

Stephanie Buscemi

executive

#90

Fantastic. Thank you. Mahesh?

Mahesh Tyagarajan

attendee

#91

As many of you might know, McAfee is in the protection of you as a person, your devices. So we started off at [indiscernible], but as your data and your identity and so much of your information is out there online. We provide that holistic protection of you as a person identity, your credit, your data privacy. If you go look up, how far your data is in all the fun places of the web, you will be surprised. So, we take care of you and your identity and your family. And that transformation, we're going to actually 3 different transformations at the same time. One is the product mix that goes from just the devices to protecting the person an account could have a family, could have a household, could have a lot of members associated with it. So that's one aspect of it, which is all aspects of your identity and data apart from just the devices. The second transformation we're undergoing is we have a technology lift and shift that we have done, but we are going through this micro services transformation. We're breaking up the model so that it allows us to have better velocity. And that comes with all of the joys of like taking something and making it a cloud-native architecture. The third is also our end point that is really on the devices have gone through a significant [ updrift ] in the last couple of years. So even if you're using it unbeknown to you, it's gone through a pretty big change, and you just see something -- a new product show up in your hands as we've actually been able to deploy that out. So a lot going on right now at McAfee in that respect.

Stephanie Buscemi

executive

#92

You had shared with me, and I think it's pretty cool that you had over 100 million active endpoints right now. I mean that's a lot of data and maybe we bridge now, into talking about data and data streaming and then why Confluent. So, let's talk about those 100 million active endpoints and how you're thinking about data streaming to address.

Mahesh Tyagarajan

attendee

#93

Absolutely. So, we have active endpoints on PCs, Macs, all the phones. We sell very significantly through our partners in the OEM retail an ISP world. So it comes pre-installed then you become a customer. So there's a huge amount of data that's making its way back. It's both telemetry and business metrics. So, we can know to actually protect you better by looking at samples or by looking at events that happen in the endpoints. But that's one part of it. The main reason we chose Kafka and Confluent was the move to the micro services architecture. We are completely decoupled. We have domains that had 450 microservices right now, we were 300 micro services about 3 months ago. So -- and by the time we're done with the migration would be about 1,200 micro services. And these are completely decoupled. They don't call each other understand the same domain in the domain space. Like user is different from a card, which is different from the subscription and so on and so forth. So in those cases, we actually event between the various domains and various services. So somebody will say there's a subscription and the protection domain will say, "Oh, I have to protect your identity. So we'll subscribe you for identity protection when the subscription domains as like there's a new subscriber that's shown up, things like that. So that was, I think, use case one. We were already using it for the business metrics and the telemetry that was coming in. We are also using it for reporting and analytics. So the microservices body cannot do ELT or ETL very well. So we generate events, and that becomes the source for how the data is viewed from analytics, how we report into our financial systems and so on and so forth. We also do that same thing for real-time business events. And more interesting thing, and like it's usually the forgotten word migration, and it's very difficult to actually migrate large amounts of data. So, we are using the venting system for that as well as we actually go through with the migration.

Stephanie Buscemi

executive

#94

So when you say a venting system, maybe share with the group where you started, what were you using and then your path to Confluent and why Confluent?

Mahesh Tyagarajan

attendee

#95

Sure. The micro services was a greenfield build-out. So we chose Confluent directly for the micro services build-out. It so happened that my architect came from prior experience having worked with you so that helped bridge the gap. But we did look at -- we were running open source Kafka before. We looked at why we would do this. And one of the things that we chose Confluent is the manageability, the governance, the security. And more so, I would rather look put my engineers to look and work on protections that then to be fully brought up to speak Kafka it's difficult to manage. So taking that difficulty out of our hands and putting it to the hands of the experts gives us the ability to rely on it and redeploy our resources into what we do best which is protecting you as a customer.

Stephanie Buscemi

executive

#96

Thank you. Sarvant, path to us in the same vein, the journey at Penske and turns into streaming data and how you got started and then how you arrived at working with Confluent?

Sarvant Singh

attendee

#97

Let me share an interesting statistics first truck fleets deliver approximately 80% of the freight in the U.S. and the supply chains are extremely lean. Customers who are operating trucks, they have a need to know where, when and how their trucks are operating. Similarly, those customers who are waiting for their freight to arrive, they have a need to know where and when the freight will be delivered. And without having access to real-time data. It's a huge issue. Disruption to the movement of freight can have a domino effect, whether it is by truck rail or ship, we all saw during the pandemic how disruption to supply chains had a domino effect and led to scheduling delays at manufacturing plants or stock-outs in a local grocery store. So real-time data helps trucking companies and those who depend upon them in the supply chain make adjustments in real time.

Stephanie Buscemi

executive

#98

Perfect. And maybe share with us about your use cases, how you're utilizing streaming today as well as your journey, starting with open source Kafka and moving to Confluent.

Sarvant Singh

attendee

#99

Sure. So I'll start with the journey of going from traditional data warehousing to real time because this is something that limited us to making decision with a data lag when we are only relying on daily batches of data, and you're limited to midterm and long-term decisions as opposed to being agile to pivot, to change decisions or make decisions quickly after a product launch. I would say we have two major use cases for real-time data. One is Vimeo as a video platform. We have a reputation in the industry as a high-quality video platform, and we want to maintain that and improve that. And part of that is the experience of the user, when they're viewing video when you're playing the video watching the live stream. And that it's important to get that to collect data. And in real time, for example, the network connection of the user or any information from the user platform or the infrastructure in general to adapt the video quality, for example, the packet sizes that are delivered to the customer to provide that seamless video experience in live stream and video playback in general. And secondly, we -- as a SaaS company, we are interested in understanding user behavior and to make product decisions and that is to help improve and optimize the user experience to understand how -- what's the user journey like? And based on that, we can optimize that user experience and also help user with product discovery, for example, so they can get the most out of Vimeo as a software. And this is -- again, we are getting the entire demand from product managers, product marketing people, marketing and analytics folks that they want to understand how a campaign is performing, and in real time. They don't want to wait a day when a campaign is launched, when we launch a new product, when we are running an experiment and AD test, we want to really be able to pivot if we need to or make decisions quickly. That time of the decision is really important for Vimeo as a SaaS company. And overall, with that would drive growth and also would help optimize the cost that eventually impacts Vimeo's profitability and bottom line. So that's why it's getting -- it's important already. It's a mission-critical thing for Vimeo to have access to real-time data and analytics and -- but the importance is growing. .

Stephanie Buscemi

executive

#100

Fantastic. One of the things, as all of you are on your path with streaming data and with Confluent is maybe sharing with this group, how it's impacting not only the customer experience, but the business value it's driving. And I'll caveat by saying this is a public forum, and I know that this question is within limits of what you probably can share. But I think, to the extent you're comfortable with the group here just talking about the changes in your business that you're seeing, the positive changes that you're seeing in your business, the results that you're seeing in your business, as a result of streaming and Confluent. I think that would be helpful. Like what are the gains that you're getting out of your business as a result? Mahesh, do you want [indiscernible] it off?

Mahesh Tyagarajan

attendee

#101

Yes. I think I'll hit the nonfunctional area and then it's a business benefit and allowing my team to redeploy resources from what they were doing to support, what we were managing on our own to be able to free up that resource to be concentrating on looking at other things. That was one part of it. The second part of it is we do real-time metrics far more. We had a small number of real-time metrics, which we were doing through ELTs with low databases and then pull from there and then following some metrics. Now, pretty much everything that we do is through events. So we've built this real-time event like, if you will, and we can run -- if we use [indiscernible]. And we're now in the process of building out, oh, you ask me any question. I'll give you an answer, and I can give you a real-time answer. Does the use case for the long tail and the longer lived analytics in the lake, but there is also this real-time metrics that we can see right away, if card conversions are changing, we call it ATP or ATUs activation to pay. So all these metrics are very important to us. So, it allows us to monitor a lot more and slice and dice a lot more because we are able to bring that into real-time system. And I'm not getting into net numbers per se, but you can imagine, if you had a few metrics and then you have a lot more you can certainly look at all the dials and turn it differently.

Stephanie Buscemi

executive

#102

Thank you. Sarvant?

Sarvant Singh

attendee

#103

Absolutely. So for our customers, vehicle uptime is extremely important. As I mentioned earlier, before real-time data streaming, if there was a problem, and let's say, a driver is experiencing an issue, it was very difficult for us to find out where the truck was located and what was happening to it. Now, with real-time data streaming we can predict when a truck is going to fail. And in case of failure since not all failures are preventable. We can real-time data streaming helps us track where the truck is located, what's happening to it, what kind of halt codes it is generating. And we can monitor the health of the vehicle remotely. And that's where the value lies, real-time data streaming is also helping us deliver a better driver experience. And it is making the technicians job easier which is especially helpful given the technician shortage the transportation industry is facing. So there is definitely value -- significant business value being able to understand this sensor data on a real-time basis.

Stephanie Buscemi

executive

#104

Yes. Thank you. Babak, how about you talk to us a little bit about the benefits that you're deriving from Vimeo?

Babak Bashiri

attendee

#105

Sure. As I mentioned, what we are currently getting value out of is the real-time monitoring of our campaigns, our product launches, experiments and campaigns. But if I go beyond that, what we want to achieve and we're looking into is to go beyond analytics and having a human in the decision-making process and enable real-time on automated flows when -- and by that, I mean streaming the data and analytics back into our product and operational systems, that means adapting the user experience in an automated way by providing recommendations. But, changing the user experience, making it customized and more personalized. And also for marketing campaigns, whether it's an e-mail marketing campaign or an in-product messaging one. You want that to be customized and also in real-time, I give you an example, if the user signs up the first actions matter. And if we wait a day to send a message to user to, for example, upload a video or show them how to do it, how to create a video that wouldn't be as impactful. And that's why in these automated so that can be unlocked and enabled by real-time data time point. So that's an area that we're looking into. Another area is real-time enrichment of data. I mean, we have our data warehouse that has been hosting a wealth of data. But again, what we need when we are talking about enabling operational systems. That's where we want to be able to enrich data in real time because that will takes the load away from our application that improves the performance of our application and also again, as I mentioned, will create that impact in real time, but this is where we are looking into implementing using stream designer and then inflate that would enable these opportunities?

Stephanie Buscemi

executive

#106

Thank you. That's great. As we start to wrap up here, I would love for you guys to share forward path for you guys on your streaming journey. You've talked a little bit about everything from micro services to analytics, to the supply chain. I mean how data streaming and how Confluent are helping to drive your business around those? It would be interesting to hear any future use cases you're thinking about? And your path forward. So maybe Mahesh, let's start with you.

Mahesh Tyagarajan

attendee

#107

Yes. This year, we'd be using it extensively for just our data migration. So, we have an enormous user and subscription base. We have to bring them over, so that could be the most immediate tactical use case. And then, we would like to move our data closer to where our customers are. So I'm hoping that at some point in the future, we use that the geofence the data and various locations. We do business in 46 different countries. We will -- we are in this baby step, right. First get to the cloud, get in one place and get to many locations and so on and so forth. So that's the journey that will be very much looking forward to using this as a methodology to get us there.

Stephanie Buscemi

executive

#108

Fantastic. Thank you. Sarvant?

Sarvant Singh

attendee

#109

Absolutely. Transportation technology is evolving very, very fast. And we will see this industry transform itself, as we see more digital innovations. We will see a lot more gate data getting generated as more EVs, electric vehicles, join the fleet. We will see the change in the behavior in terms of as vehicles get used as more autonomous or say, my autonomous vehicles rollout. There will be a lot more predictive maintenance happening. We may also see what is inside the truck that getting monitored on a real-time basis, for example, real-time monitoring of temperature inside temperature of products inside the refrigerated unit. So we may see that happening. The combination of artificial intelligence, IoT, real-time data processing and computing on the edge, that is going to expedite this race to the left and I see business and technology architecture is evolving accordingly. I believe that's going to unlock a significant amount of economic value in this sector. So overall, I see transfer a lot of very exciting things happening in the transportation technology world..

Stephanie Buscemi

executive

#110

Fantastic. Thank you. And how about Vimeo, what's next, what's on the horizon?

Babak Bashiri

attendee

#111

Well, part of it is that the opportunity exists with providing better video experience for our users. And that's something that we are leveraging, but there's so much more we can do in terms of monitoring infrastructure, for example, looking at the network latency average playback time and to identify issues, if there is an infrastructure issue to identify and resolve it in automated flow. And that requires analytics in real time because we are talking about milliseconds of we can't afford minutes even when we are talking about them providing that experience to users. And the other is, again, related to video is content and delivery. And we have automatic content delivery network selection, for example, understanding and the reading user preferences and adjusting the video according to that is again is something that needs to happen in real time. And when, we are talking about billions of views and millions of users that requires robust and scalable and reliable way of processing that data in the stream.

Stephanie Buscemi

executive

#112

If real-time insights means no buffering of videos I'm happy. And I think we all will be very much.

Babak Bashiri

attendee

#113

Livestream as well. And I have the -- mentioned about the use case of AI and video, currently, it is being used in creating videos you can -- with a few clicks, you can pick a template and create a professional level video within minutes. But talking about the future, the general AI and the concept of text to videos instead of the way that we interact with video creators, we do created tools, editing tools can change where we can describe what we want video as a marketer, for example, or how we want to edit it. And then that is -- and then the tool generates it. And that's, again, go back and forth with the data, the processing of data in real time is important because we can afford that kind of delay in batch processing.

Stephanie Buscemi

executive

#114

Fantastic. Well, I want to thank all three of you for your time today. Again, all of folks here will be with us at the reception for any one-off questions that we can ask, but I know it's a massive investment of your time to come here, be here. I'll probably state the obvious, but it's not always easy for customers to also speak at an investor conference. So many thanks to you guys, and we really appreciate the partnership, if we can give them a round of applause. So with that, we are wrapping up today. A couple of quick housekeeping things for all of you. [Operator Instructions]. Thank you.

Unknown Executive

executive

#115

All the slide decks are now on our IR website. So feel free to go there and download them. And thanks to those joining us again online. Take care.

Confluent, Inc. (CFLT) Earnings Call Transcript & Summary

Earnings Call Speaker Segments

Shane Xie

Edward Kreps

Shaun Clowes

Chad Verbowski

Shaun Clowes

Shane Xie

Brad Zelnick

Edward Kreps

Chad Verbowski

Edward Kreps

Sanjit Singh

Edward Kreps

Michael Turits

Edward Kreps

Michael Turits

Edward Kreps

Jason Ader

Edward Kreps

Shane Xie

Shane Xie

Erica Schultz

Matt Yanchyshyn

Erica Schultz

Matt Yanchyshyn

Erica Schultz

Matt Yanchyshyn

Erica Schultz

Matt Yanchyshyn

Erica Schultz

Matt Yanchyshyn

Erica Schultz

Matt Yanchyshyn

Erica Schultz

Matt Yanchyshyn

Erica Schultz

Matt Yanchyshyn

Erica Schultz

Matt Yanchyshyn

Erica Schultz

Matt Yanchyshyn

Erica Schultz

Matt Yanchyshyn

Erica Schultz

Steffan Tomlinson

Rohan Sivaram

Shane Xie

Michael Turrin

Steffan Tomlinson

James Wood

Edward Kreps

James Wood

Edward Kreps

Robbie Owens

Edward Kreps

Steffan Tomlinson

Gregg Moskowitz

Edward Kreps

Erica Schultz

Michael Cikos

Edward Kreps

Howard Ma

Edward Kreps

Unknown Executive

Tyler Radke

Edward Kreps

Sanjit Singh

Erica Schultz

Sanjit Singh

Rohan Sivaram

Steffan Tomlinson

Unknown Analyst

Edward Kreps

Unknown Executive

Matthew Hedberg

Edward Kreps

Steffan Tomlinson

Edward Kreps

Erica Schultz