Consensus Cloud Solutions, Inc. (CCSI) Earnings Call Transcript & Summary
September 7, 2023
Earnings Call Speaker Segments
Laura Hinson
executiveGood morning, and welcome to the Consensus investor webinar on AI and interoperability. I'm Laura Hinson and joining us today are John Nebergall, COO; and Jeff Sullivan, CTO. Before we begin our presentation, allow me to direct you to the forward-looking statement on Slide 2. This call and webcast may include forward-looking statements. Actual results may differ materially from the forward-looking statement projections. Additional information concerning the factors that could cause results to differ materially is contained within the discussion and documents. Thank you, again, for being with us today as we dive into this complex topic. And now I'll turn the presentation over to John Nebergall.
John Nebergall
executiveHi, my name is John Nebergall, and I'm the Chief Operating Officer of Consensus Cloud Solutions trading on the NASDAQ under CCSI. As a quick introduction to Consensus, we're a relatively new public company, who in October of 2021 was spun out of the former J2 Global, now Ziff Davis. The company has been doing business in secure document exchange for over 25 years. And we go to market with our Conductor, formerly Summit Exchange, interface engine, our eFax brand cloud fax service, jSign digital signature and the Clarity transformation engine. Healthcare is far and away the largest sector that we serve, although we do also do measurable business in financial services, legal and government. Our work on Clarity in particular, has brought us heavily into the automated intelligence space. And given that development direction, today, we'll be talking about AI in healthcare. I will be presenting today, along with my colleague, Jeff Sullivan, Consensus Chief Technology Officer. So let's go. A quick review of the agenda here. I'll talk about the messy state of healthcare interoperability and a specific problem in that space that it impacts millions of patients. Jeff will then walk through how the application of AI technology has offered an elegant solution to a problem that's been in place for at least the last 20 years. Finally, we'll speak about the opportunity here and how we go to market. Let's move to Slide 5 and get started. Healthcare has been struggling with the ability to exchange information for as long as the electronic health record or EHR systems have been in the mainstream. Healthcare is not just a single industry, but really encompasses many commercial areas. You have hospitals, clinic, insurers, public health, social services, pharmacy, labs, home health, skilled nursing facilities, the list goes on. The number of players creating and storing information is huge. Difficulty arises from a wide range of technologies and data structures used by the hundreds of electronic health record, ERP, financial claims and research systems in use today, resulting in incompatibility for information exchange. The effort to establish standards for information exchange in healthcare has been limited in its effectiveness because no single standard has yet emerged as a clear choice for communication. A protocol called Health Language 7, HL7 for short, defined some parameters for information exchange, but leaves a large area open for individual interpretation. A subvariant of HL7 called Fast Healthcare Interoperability Resources or FHIR, was designed with API-to-API connections in mind but remained very finicky in how it's used and only about 10% to 15% of the messages use it. Finally, a closed members-only e-mail service called Direct Secure Messaging, or Direct for short, allows patient records as e-mail attachments. Now none of those 3 technologies really speak to each other, and no single protocol has emerged as the clear standard. As evidenced by the explosion of data you see illustrated from the Stanford Health Research report in the lower left, the amassed repository of information is growing at an accelerating rate, delivering quality care in today's environment relies in part on sharing that data. This valuable information can greatly improve patient care and allow providers to coordinate efforts for the benefit of each and every one of us, but the lack of a standard protocol for EHRs remains a hurdle. If you ask why healthcare uses fax and they fax a lot, easy information sharing is the reason. In contrast to the electronics, [indiscernible] that is our current interoperability environment, fax is reliable, secure, inexpensive and it verifies receipt. While we can define the problem in technical terms and think of the issue as just one more engineering challenge, there is a practical reality here that affects literally every one of us because we're all patients. While in many industries, market forces have driven competitors to compatibility, the health IT community seems as far away as it's ever been to share data across platforms as an industry goal. Another wrinkle here is that EHR systems need to communicate with a whole different set of vendors who provide pharmacy systems, claims adjudication systems, prior authorization systems and systems specialized in the post-acute segment. These core systems are embedded with long roots into the organization. So changing how they communicate can cause a significant ripple effect that breaks workflows or impacts how transactions are fed into billing systems. As patients, the impact is felt as we traverse the system. Getting a referral to a specialist, receiving your explanation of benefits or waiting for approval of a procedure or medication can result in delays of weeks and sometimes months to patients seeking care as they wait for manual processes to be executed because the EHR doesn't talk to the payer system in a language it can understand. Our recent Deloitte survey was clear, the physicians expect that these systems share data efficiently. Now if the difficulty in getting modern systems to talk to each other is enough of a hurdle, there's one more deep institutional wrinkle that comes to bear. Our healthcare system is a network of care settings, different settings for different care delivery, hospitals for acute episodes, doctors' offices for checkups and the urgent care center for manageable industries or sudden bouts of sickness, you get the picture. Now it is likely that most of the places you think about when you think of healthcare providers are working with a modern EHR system are doctors' offices, hospitals, multi-specialty clinics [ and lab. ] In fact, well over 90% of those care settings have an EHR capable of processing structured clinical documents. While we have been just through the abundant and growing use of fax in hospitals, there is an entire additional set of providers that come into play. These are care settings that largely do not have EHRs but still see millions of patients every week. In 2009, the government funded EHR purchases for hospitals, clinics and academic medical centers. At the same time, many care settings were not allowed to participate in the funding specifically excluded from participating in the EHR funding program. No skilled nursing facilities, long-term care sites, midwife birthing centers, home health, hospice or community-based nonphysician practices like an addiction rehabilitation center could qualify. Ironically, the practical outcome here was that the types of care settings that were most likely able to afford EHRs, got them largely for free, while those least able to afford them were excluded. And along those fault lines, you see the use of tools is decisively different. And while fax is used throughout the healthcare industry, the technical core not excluded, you find even more prevalent fax use in post-acute, home-based community clinics in rural areas are able to get the help to afford an EHR, that's the dynamic that sets up a rather interesting problem. So here, we see the traffic pattern that's developed between those within EHR and those without. Importantly, what I'll call the outer rings of healthcare here have a great need to interact with the central core. Many of the expert specialty providers, much of the sophisticated diagnostics equipment payers, surgery centers and hospitals are in that technology core. And while those on the outer rings use fax traffic as their lifeline and fax is what's called unstructured data, which generally readable by humans, but not by machines, that's the language that those outer rings speak. Conversely, structured data or machine readable data is essential to have EHR systems that deliver their optimum value. So in the absence of some trillion-dollar funding initiative for the outer rings, the receivers of this fax traffic typically shoulder the administrative task of manual entry of any of the messages that are received via fax. These messages are then filed electronically into EHR. Sometimes, they'll just directly file the fax document as a document in the EHR but that's still not fully optimized. Generally, the best way to get the most value out of your database is to key that fax in and put it into a state of structured data. This is where the application of AI to interoperability can be positioned. Unstructured data creates administrative burden. Structured data, the language of EHRs, requires expensive and complex systems that are not easy to maintain for those without a dedicated IT department. Care settings without that economic strength still need to communicate with the large organizations using EHR systems and overwhelmingly use fax to fill that need. And while effective for human communication, fax is unstructured data, is not optimized to allow EHRs to operate at their highest efficiency. The difficulty is further amplified when you realize a wide variety of forms, handwritten notes and narrative prose that defy easy standardization. It's into this messy environment that we put our new technology to work. So now let me pass this to Jeff Sullivan, who can tell us the way AI is applied to solve this problem. Jeff?
Jeffrey Sullivan
executiveThanks, John. Let's start with the whirlwind tour of the basics. Artificial intelligence is a blanket term describing the development of computer systems to perform tasks we typically think require human intelligence. At a basic level, AI makes use of certain technological building blocks, the most common of which are expert systems, machine learning, neural networks and deep learning. Expert systems are specifically programmed to deliver expert level performance on a very narrow problem. They're useful for very structured problem spaces. Machine learning on the other hand, is a method of teaching computers to learn from data without being explicitly programmed. There are a number of different types of algorithms used in ML. The 3 most prevalent are neural networks, which are artificial constructs that mimic human brain function and are trained by presenting inputs and expected outputs for the network to learn. Deep learning, which uses multilayer neural networks to learn more complex task and behaviors and genetic algorithms, which mimic evolutionary behavior to select the best solution to a problem from a family of options. These techniques are often combined to construct the more advanced AI technologies like natural language processing, speech recognition, computer vision and especially large language models, which are having quite the moment right now with the explosive popularity of products like ChatGPT. Natural language processing, or NLP, is a major area in AI that focuses on enabling computers to understand and interpret human language, most commonly applied to the written word. NLP is used in a wide variety of applications, including talking with Alexa or Siri, text chatting with bots on a website or reading and responding to customer service e-mails. To convert written language into spoken language or vice versa, we make use of speech synthesis and speech recognition. Unlike earlier technical approaches that made use of explicitly programmed algorithms with limited success, AI speech makes use of machine learning techniques to convert between spoken language and text with much more fluid and natural results. Just as NLP deals with textual or spoken communication, computer vision is a field that empowers computers to recognize objects, detect patterns and make sense of data from images and videos. This technology has broad applications, including autonomous vehicles, surveillance systems, quality control and manufacturing and medical imaging. Computer vision can also be used to convert images of text into actual text, which can then be processed using NLP techniques. And similarly, text generated by NLP systems can be fed into speech generation to produce fluid speech. So you can see how these fields can be combined to perform even more complex tasks and workflows. But one of the biggest breaking topics in AI is something we're going to dig into on the next slide. Large language models are one of the most interesting or at least the buzziest things happening in AI right now. Their advanced AI models that have been trained on vast amounts of text data to understand and generate human language. We're talking millions or even billions of documents. These models use deep learning techniques and neural networks to learn patterns, semantics and context from the data. We think of LLMs as foundation models, which means they can be used as the starting point to develop more specialized models. An important distinction in large language models is the difference between extractive and generative models. These models play a pivotal role in modern natural language processing, automation and understanding the difference between them is crucial. Let's begin with extracted models, like the systems shown here from Google and Facebook. These are specifically designed to as their name suggests, extract information or provide summaries of a body of text. They can identify and select key sentences or phrases, effectively condensing the content. Think of them as efficient filters that capture the essential elements from a given document or body of knowledge. Extractive models are invaluable when large volumes of information need to be distilled into concise summaries. They find widespread use in news aggregation, content curation and even chatbot applications, providing succinct responses based on available information. By focusing on extracting the most crucial details, these models ensure the preservation of the original text's core essence. On the other hand, we have generative models like ChatGPT. Unlike extractive models that rely on summarizing existing content, generative models possess the -- at times startling ability to create entirely original content by leveraging patterns and structures building to their foundational model. Essentially, they have the ability to create language from scratch, simulating human writing. Generative models have attracted massive attention in recent months. They exhibit impressive capabilities in creative tasks, including storytelling, report writing, analysis and even musical composition. Generative models excel in endeavors that demand creativity, imagination and the capacity to produce coherent and contextually relevant text. However, it's important to note that generative models are subject to something called hallucination, where they generate content that appears accurate but is completely fabricated. The danger in hallucination is that it tends to be very plausible, while a hallucination like there's a 20-foot tall Purple Wolf in my backyard is easy to identify [indiscernible] something like there's a man in a hoodie hiding in the shadows is far more problematic. It sounds plausible, but it's completely false. Generative hallucinations tend to fall in the latter category. We'll talk quite a bit more about large language models when it comes to applying AI at healthcare interoperability. Speaking of which, we've seen some of the major challenges facing Interop. Now we want to talk about how AI might be able to help. The most significant burden to interoperability is the huge amount of unstructured data in play when exchanging health data. We understand now why that is, and it's not likely to change anytime soon. But here, there are some clear areas where AI can be productively brought to bear to improve the interoperability story in healthcare. I think there are 4 key facets of the interoperability problem that are most amenable to improvement, and we're going to quickly talk about each of them. A huge challenge is the extraction of data from unstructured sources, and this generally comes in the form of converting this unstructured data into a plain text format suitable for standardization and transformation. This means extracting text from things like faxes, scans and photos, but also to data in video and audio streams. However, it can also apply to what I'd call semi-structured sources like Microsoft Excel or PDF files and the like. In the truly unstructured cases, the best technique available to extract the data and render it into computer readable text is the application of computer vision techniques. These go beyond the limited capabilities of basic optical character recognition and apply techniques derived from human and animal visual processing research to train systems to recognize even poor quality printed text and handwriting which are historically a big challenge for many OCR technologies. Here, we see things like Google Vision AI or Amazon Textract being offered as foundation models, which can be further trained to suit, but it's also an area where there is robust third-party development. This is because there's a lot of heuristic information in visual processing that is domain specific. Here's a somewhat trivial example of this. A computer vision model trained on medical terms is likely to correctly infer that a nearly eligible word is lesion whereas a more general model might conclude that the word is lesson. Now once extracted, this raw text input will almost always need to be standardized or translated or both before it can be used in other healthcare systems. Sometimes you might also have multiple types of content in a single feed, for example, a fax that contains a patient discharge form, a test order and a referral for physical therapy, and you need to organize the parts so that you can only use the bits which are relevant to your current purpose. This can be a fairly complicated task, and there are a number of AI techniques to use in the standardization process. Certainly, natural language processing is a great technology for flexibly parsing textual information and transforming it into a symbolic representation. But as we look at more complex data structures and more nuanced information being represented, we start to see more machine learning techniques being applied. Neural networks with their ability to take specific inputs and produce appropriate outputs are well suited to relatively simple information standardization tasks. But as the inputs get bigger and more varied, large language models tend to shine. Extractive models of the preferred use case in this scenario due to their comparatively low tendency to hallucination. So we prefer those when possible, which would be when you need to generate low to moderate complexity outputs. No matter how you get there, once you've got the data into a standard canonical format, there are a lot of options for use within the realm of healthcare interoperability. But first, we need to address the challenge on the next slide. Note that I just said, getting data into a standardized format, not the standardized format. As John covered, there are dizzying array of different healthcare data standards, most capable of representing the same data in fundamentally different ways. The problem of interoperability is the problem of knowing how to format the data you have so that the other party, whether it's in EHR, human or whatever, can receive, understand and make use of it. The transformation of data from one format to another exists as one of the central problems of healthcare interoperability because there are so -- very many different types of systems in most healthcare settings and crucial data often needs to flow through several of them to reach all the places it's needed. This huge mass of data transformation activity is extremely labor-intensive, either in terms of multiyear integration projects or armies of manual labor used to rekey data from one system to another. We can make use of expert systems and neural networks to facilitate relatively simple transformations but we'll need to rely on deep learning extractive or even generative models for the more sophisticated transformations of data. The latter are especially likely if the final result needs to be in a human readable format. For example, when the recipient is in a rural clinic without access to modern EHR systems. By comparison, the problem of integration that is connecting one system to another, so that data can flow between them is relatively straightforward. However, this final step of the process does tend to be pretty technical and labor-intensive, and AI can lend a hand through the application of AI-driven robotic process automation systems, expert systems to generate or actually perform the integration instructions between 2 systems or neural network models that have been trained so that the output of one system is the inputs of the neural network and the output of a neural network is the input to the other system. So let's take a quick look at a couple of early use cases where we're applying AI to improve interoperability. First, we're going to talk about Clarity on the next slide. Our Clarity platform provides the foundation for our intelligent data extraction capabilities built on our Computer Vision engine, Natural Language Processing and proprietary extractive large language models. The Clarity platform provides flexible context-aware data extraction capabilities without requiring intensive document training sessions or time-consuming [indiscernible] document template so common in last generation data extraction. The flexible and open-ended Clarity platform is we found that our customers are looking for more guidance in solving targeted problems, so we've begun by rolling out business process specific apps to address these challenges. While the Clarity Platform's foundation models are designed to support multiple subject matter domains with broad capabilities, our first 2 Clarity apps are derived from specialized models we focused on the healthcare space. Now before we move on, I wanted to address the obvious question about the 800-pound gorillas in the room, namely tech titans like Amazon, Google and Microsoft, who are investing billions of dollars either in their own offerings on AI or in companies like ChatGPT-maker OpenAI. It's a fair question to ask why these giants won't just gobble up the whole space. I mean they're betting big and have armies of technical talent to throw at the problem. The way we see it, there are 3 main reasons why Clarity is a good approach for us. First, while there is a lot of focus now on general purpose generative AI and a lot of eye catching headlines to go with it, we think that extracted models are the way to go for the foreseeable future. And there are far few of those in play. We see the generative models driving customers to us over time as they like the promise of the technology but can't stomach the hallucination rates. Second, there is plenty of room in this space for a lot of solutions. And while many companies are chasing glamorous headlines bringing AI to patient diagnosis or a medical research or clinical decision support, we're squarely focused on solving administrative and interoperability challenges that are only going to increase as the labor shortage in healthcare continues to trend. We see this as a [ fertile ] uncontroversial and profitable space to compete in. And last but not least, we see these large companies as providing a lot of rock capability that others can use to build solutions, but actually being able to deliver solutions that customers want is about more than just raw technical capability. You also have to be able to bring it to bear on a problem that we're solving and worth paying for. Our extensive experience servicing thousands and thousands of healthcare providers over the full gamut of care settings sets us apart from generic tool providers and our ability to bring advanced technology to bear to create secure, scalable solutions sets us apart from customers who want to roll their own, but lack the experience and resources to get there or continue to maintain and extend it once they do. In fact, we're a big user of AWS, Google and Azure offerings. We just see those as building blocks. We see Clarity as more focused and productized to specific needs. So on the next slide, we're going to look at our first half, which tackles prior authorization, which is when a patient needs to have a treatment or medication approved by their insurer before they can receive it. As you might know, prior auth is a huge challenge in healthcare today. It brings into attention the needs of the patient to get the best healthcare and the insurance payers desire to manage costs and ensure appropriate treatments are being prescribed. First, a couple of quick points about prior auth. According to the American Medical Association, more than 40% of all prescription and treatment prior authorization requests are received by fax, mostly because someone in the request chain doesn't have a compatible EHR system capable of doing end-to-end processing. And over 77 million prior auths required some level of manual handling each year. Another huge challenge is lack of standardization. One of our customers who focuses on prior auth shared with us that they received more than 30,000 distinct form variations every year. Despite these challenges, there are some statutory time requirements on responses to prior auth requests including one that dictates that whenever request is designated as urgent, it must be responded to within 24 hours. To make matters more complicated, there's no standard way to designate this urgency. On the right of this slide is an example of a prior auth request that was faxed in. Highlighted here are important components that we want to extract. You can see that there's printed text and handwriting, a barcode and several types of urgency markings. Now let's talk about how Clarity can help. Our Clarity prior authorization app is defined to simplify the process for prior auth participants. Clarity PA takes in a customer's prior auth faxes, extracts the data in the fax, converts the data into a standardized format like you see on the right and provides that data back to the customer to integrate into their existing workflow. The technology used in this app include deep learning systems for doing document classification and splitting, computer vision for reading those documents and extracting their contents into processable format and an extractive large language model for pulling out all of the various data in the forms into a standard format to be delivered back to our customers. We extract both administrative and clinical details and offer some add-on capabilities that can further accelerate processing based on specific customer workflows. On Slide 22, we look at a daily occurrence in most healthcare settings. Unstructured documents come in via fax, mail or scan. The document relates to a patient who is at that point unidentified. It contains one or more types of clinical documents within it, which are, at this point, unknown. What is the care setting do? Typically, this is a job for human intelligence. There will be someone whose job it is to go through this constant inflow of documents, identify which patient each relates to, understand what kind of clinical information is in the document and render that data into a form that can be attached to the patient record and begin to find workflow for that care setting, whether it's patient discharge, appointment setting, lab test or whatever. In many care settings, the person doing this is a clinical worker, either a nurse or a doctor who needs to perform these administrative tasks to get to the point where the clinical work can actually begin. Our Clarity clinical documentation app is designed to solve for this use case, offering a streamlined way to load these clinical documents directly into the client's EHR system. When the unstructured document comes in, our computer vision models used to extract the content and put it into a textual document that split into discrete documents and classified using a deep learning model. Each document is then adjusted into an extractive model. That model compiles information about the patient and doctor demographics and clinical details contained within the documents. An expert system is used to translate the output of the extractive model into a standardized message called a continuity of care document which is a standard HL7 document type, and this CCD-A has been delivered directly into the customer's EHR by the Direct Secure Message protocol. If the EHR supports it, the clinical and demographic details in the message can be used to automatically match and attach the document to the patient's record and kick off workflows in the EHR. The nurse or doctor can spend their time with patients and not on administrative setup. On Slide 24, we can see an example of a typical CD fax, in this case, a new patient form for a fictitious patient. And on Slide 25, we have the result of running it through Clarity CD. The clinical details are extracted from the facts, they are bundled into a direct secure message and delivered in this case into the NextGen EHR. And all of the details have been loaded into EHR and are ready for consumption by the team member processing patient intake. So we've seen some examples of how the AI in our Clarity platform can be applied to improve the state of interoperability in healthcare. Now let's turn it back to John to talk a little bit about the market for AI in healthcare and our approach to serving it.
John Nebergall
executiveThank you, Jeff. Now that we've described the problem and how AI can solve it. We should have a look at the market. The value of that is created by Clarity and understand how those economics work. Here, you can see an analyst view of AI and the impressive, expected growth. But there is more to the story when you look at the total addressable market for this piece of the operational efficiency subcategory. Now as I've said before, healthcare fax is a lot with that inside baseball knowledge, we looked at this with a different lens. Recent analyst reports have suggested there are about 13 billion faxes sent and received in healthcare annually. From here, this audience can certainly see what the market opportunity looks like, 13 billion faxes, 3.5 pages. That's 45 billion pages plus of healthcare fax on an annual basis, and that really sets the TAM that we're looking at here to create a substantial value for the client, for their patients and a fair economic return for the company. Now let's look very quickly at individual customer economics and the value proposition. Here, this lays out the key issues and advantages of the Clarity Solution as well as the unit economics that clients have to shoulder in order to input structured data into the EHR. Let's start with something more fundamental, velocity. And I'm sure that you're all aware that an entire industry has blossomed in the area of revenue cycle management. The importance of accelerating the time it takes for healthcare providers to get paid for their services create huge value and really is the poster child for time is money. And time is the first use advantage for Clarity. We see the best of our customers' efforts into key information into their systems can take as little as 2 days. Now think of that. I'll get a fax for a patient referral. Now that's money for a provider, but it takes 2 days to get that into my system, 2 days to start scheduling an appointment, 2 days of the patient waiting for a call and 2 days of deceleration of my revenue cycle. And that's the best we've seen. And same is true with things like prior authorizations, claims form, even public health documents, getting structured data into the system more quickly moves the needle on both the economics, and with the delivery of patient care, Clarity eliminates that lag. The next issue, keying errors is even more important. Keying errors increased the potential for treatment errors, which increase the potential for unfavorable outcomes. The National Institutes of Health published the medical error reduction and prevention study that's been updated through May of 2023, and it states that medical errors account for between $4 billion and $20 billion per year and result in approximately 100,000 deaths. The Journal of American Medical Informatics Association published a study on keying errors when entering lab values and found the rate of overall error nearly 4%. And all those errors, over 14% were significantly incorrect. Now with AI, the receiver has given a statistical confidence score that gives them an objective view of accuracy and guides their decision-making. Random keying errors are eliminated. And as the technology gets more document, it continues to learn. It continues to improve, and it builds an ever-increasing level of accuracy. And while the audience today is likely predisposed to just jump straight to the unit economics and hold speed and accuracy as lesser issues, you might be right in many industries, but not healthcare. In this industry, delays in the states can cause real harm. These advantages create significant value. They can be the difference between keeping an accreditation or losing one, can improve outcomes in meaningful ways and are central to any healthcare administrators thinking. Finally, we come to the unit economics, and this one is likely familiar to all of you. I'm probably not spoiling the story here when I jump straight to the punchline, just like John Henry, who laid down his hammer and died, the machine wins. So for the customer, we create value in 3 very concrete ways, speed, accuracy and cost. Our go-to-market is equally straightforward. We design and implement a solution based on customer needs. We then train the machine to a prescribed level of accuracy and are paid at the prevailing rates for that work. Based on customer volumes, we created a subscription service agreement that includes an upper limit of extractions and a per page price if that amount is exceeding. Finally, I'll remind you again of the fundamentals here. 13 billion faxes, 3.5 pages per fax and fax continues to grow. Needless to say, the market opportunity is significant. So to conclude, hopefully, now you have a bit more insight into the technology behind automated intelligence and insight into a real opportunity in healthcare interoperability that was previously thought would always require human intervention, and the thought process it takes to define the solution and the market opportunity. We thank you for the opportunity to share our insights and for your time. And now I believe we've left some room for Q&A.
Operator
operator[Operator Instructions] And the first question today, how is your solution handling language-specific image data in a single document, could page content be mixed with different language data.
Jeffrey Sullivan
executiveThank you. Yes, I'll take that question. So the interesting thing about LLMs is that they understand language at a symbolic level, and so they're able to handle multiple different languages. Our foundational model, which is an extractive LLM, as I mentioned, is able to handle, I think, more than 14 different languages. We currently support English and Spanish in terms of direct application in Clarity PA. We don't do translation right now, but it is something that we're exploring because it is within the LLMs capabilities. But right now, we wanted to preserve the verisimilitude. So if you give an answer to -- in a document in English, the answer will be presented in English when we extract it. If you gave that answer in Spanish, that answer will be presented in Spanish again, but there is an opportunity for us to be translating between those languages and in fact, others as well as we move into the future. Right now, we're focusing on kind of accurately reflecting the answer that came back.
Operator
operatorOkay. The next question is a few questions in one, so I'll ask them altogether. What is the underlying architecture of your LLMs? Are they based on transformer architectures like BERT, GPT or other frameworks? What kind of data was used to train these models. Did you use a diverse and representative data sets? How large is the model in terms of parameters and training data? Do you fine-tune your models for specific tasks? And if so, how do you ensure that the fine-tuned models maintain their general language understanding? And how often do you update your models to incorporate new language patterns, data or address biases?
Jeffrey Sullivan
executiveOkay. So there's a lot to unpack here. I will re-answer or re-state each question as I answer it in sections, just that I can kind of keep it straight in my head. So yes, the underlying architecture of most LLMs is transformer-based as is ours. It is not a generative pre-trained transformer like ChatGPT is, but it is an extractive one. So I would say the lineage traces back more to something like BERT, which is an extractive transformer-based LLM than a generative one. But yes, it is based on transformer architecture as all of the modern LLMs are pretty much. What kind of data we've used to train the models. We train the models with over let's see -- over 1 billion documents were in our training set across a number of commercial and a large amount of open source training sets. The model parameters, I'm not sure that we're necessarily giving out exact details on this, but we have north of 10 billion parameters in our major model, our specialized submodels are optimized, so they can reduce the number of parameters by eliminating irrelevant kind of domain spaces. As I mentioned in the talk, the foundational model is very generalized and is able to handle sort of any domain, and it's quite large at north of 10 billion parameters. The more optimized ones have been tuned -- to answer your question about do you fine-tune your models for specific tasks. The answer is yes. So we derived stuff that was focused on healthcare for these first couple of apps. They use the same basic model, but even the models for Clarity PA and Clarity CD are tuned to their specific purposes. And that allows us to optimize them so that they function more efficiently while still being able to maintain their extracted power. So how do we ensure that those fine-tuning models maintain their language understanding. The general language understanding is all part of the core functionality, and that's not something that gets trained out, but what you might find is that if you're trying to use a -- one of our specialized models that's tuned for healthcare and you fed documents that were heavily say, manufacturing-based it would struggle with manufacturing specific corpus of knowledge because that part of it has been optimized away from in the extracted models in that area. How do we maintain their general language understanding, again, as I said, that's sort of baked into the foundation. So that's not something that we're overly concerned about. The last part of this question had to do with updating the models. We update them at least on a quarterly basis, addressing biases, I think one of the interesting things is that because we're not doing a lot of these generative things, our focus on bias is much reduced. We're really emphasizing extracting the information that's in the document and not making things up. Where you tend to hit the bias a lot more is when you're going to generative aspects and workloads, and it's trying to extrapolate what to say. And that's where the bias in its Corpus can be more fully expressed. So again, quarterly on the updates that incorporates new language patterns, new corpuses, new training sets, all of that stuff done on a quarterly basis.
Operator
operatorOkay. The next question, why aren't the EHR players creating similar AI solutions?
Jeffrey Sullivan
executiveI'm sure that there are EHR players who are doing LLM solutions, you -- certainly, if you're reading the news seen that Epic is incorporating ChatGPT into their EHR. I think the reason why they're not incorporating specific solutions like we're doing in Clarity is that they are very specific to singular use cases. And one of the interesting things about all of these EHRs is that if you've seen 1 EHR implementation, you've seen 1 EHR implementation. And the reality is that whenever you implement 1 of these EHRs, there's a substantial amount of customer-specific implementation that is done there. That is equally true at the AI level. In fact, it's more true because you've got to deal with both training specific to what the customer is looking for and then the specific work that they're doing. EHR providers will probably flow that out into the system integrator workspace and try to allow them to help with that. But then you're really dependent on the skill set of the individual [ SI. ] Our focus here is on delivering tailored solutions that are focused on specific use cases, ones that are broadly applicable in our customer base. And so we feel like our solutions would complement anything that's being done in the EHRs rather than directly compete with it. I will also mention that most of what we're seeing right now when it comes to EHR integration tends to be some variation on integrating ChatGPT, which is a generative large language model with its concomitant challenges, but also with the opportunity for one and the other two both exist in the same space and, in fact, for one to live with and enhance the other. So one of the things, for example, that we're able to demonstrate is if we use our extracted model and we produce our vectorized outputs from, say, a certain set of questions and feed that vectorized output as prompts into a generative model. We can produce that really fluid flexible language but with a vastly reduced hallucination rate because we've dramatically constrained the realm of opportunity for the generative model to go off the rails, if you will. So I think there's lots of space for everybody to play in this thing at the same time. Right now, what we see in EHRs is general capabilities that are going to need to be applied to specific solutions for each customer, and that's where we think we shine.
Operator
operator[Operator Instructions] The next question, what has the reception been like since you have taken this product to market?
John Nebergall
executiveI'll take that one. There's been a good deal of interest, both from end-user providers as well as partners that could use a solution like this in their overall solution set. So we've been pleased with the response that we've gotten. We certainly feel like we're just starting to get our legs under us and the traction started. But again, we're pleased with what we've seen so far in the marketplace.
Operator
operatorOkay. Great. Is the -- is Clarity, the only product that uses AI?
Jeffrey Sullivan
executiveSo I assume we're talking about ours, otherwise, everybody knows there's a gajillion products out there with AI these days. Clarity is our AI platform. That is what we put into our products. And so I think of this in a couple of ways. Clarity is a foundational platform that we're using to embed AI-driven capabilities in a number of our products. The first of these, we're seeing are these apps that we're creating, which interfaces with our eFax Corporate product line. But we do anticipate that Clarity will become a stand-alone product in the future that doesn't feed in any of our particular products and kind of you can feed with other inputs. Having said that, we are certainly looking at a number of other applications, apps that we're going to be building on top of our Clarity platform. And we're also in conversations with a number of customers about direct use of the Clarity platform itself. I think that if we talk about how things ramp and roll out. What we really wanted to do, which I referred to earlier, is give some customers something that they can implement quickly. And Clarity CD is something that is really a turnkey solution. You have a meaningful use certified EHR, you can snap it in and start using it immediately. Clarity PA is more of that traditional integration where it's an API that performs a bunch of functions and then you take those functions, you integrate them into your existing workflow. When you're using the Clarity platform, it's more of that kind of an integration. And so there's a broad array of area with which it can be used. But the way we look at this, we will be delivering a number of Clarity apps over the next couple of quarters. We're not going to announce any of those right now, but I think you can look forward to in the next couple of quarters some interesting additional announcements on Clarity apps, and we'll also be integrating Clarity and the other capabilities that we're doing into our product lines. That's also the case that we'll be using our own Clarity platform internally as well. But from a product perspective, Clarity is our AI product. That platform will deliver multiple capabilities, many of which will be apps that we offer that are more readily consumable, but also the raw Clarity platform capabilities are also going to be available with customers as well.
Operator
operatorOkay. And how many specialized Clarity products does Consensus offer?
Jeffrey Sullivan
executiveWell, we talked about Clarity CD and Clarity PA today. We are working on some others that -- again, not prepared to talk about at this point in time. But I would say look for our announcements in the coming quarters, you will see a number of additional Clarity apps and you will also see more information about what other things are being done on Clarity in the form of use cases and press releases and whatnot.
Operator
operatorOkay. And it seems that, that is all of the questions that we have today. I would now like to hand the floor back to Jeff and John for any closing remarks.
Jeffrey Sullivan
executiveWell, I'd like to thank everybody for paying attention to this. We're really excited about what we're doing with the application of our Clarity platform, in our products and between our products. As I mentioned just a moment ago, look forward to more exciting news to come in our earnings results and our quarterly meetings coming up. And thank you all for your attention. John, any final thoughts?
John Nebergall
executiveI really appreciate everybody's time again this morning, the opportunity to talk about the advancements that we're making in this key area and the way that we are building value, not only for us as an organization, but for the healthcare community at large. And thank you again for your time.
Operator
operatorThank you, ladies and gentlemen. This does conclude today's event. The replay for this event will be available this afternoon. Please use the same link to access the replay. Also, please follow up with Consensus regarding any further questions regarding the content of today's webcast. Thank you for your participation. You may disconnect at this time.
For developers and AI pipelines
Programmatic access to Consensus Cloud Solutions, Inc. earnings transcripts and 32,000+ others is available through the
EarningsCalls.dev REST API. Plans from $24.99/month — full transcripts, speaker segments,
full-text search, and the recently-added /api/v1/transcripts/recent polling endpoint for ETL pipelines.