GSI Technology, Inc. ($GSIT)
Earnings Call Transcript · March 10, 2026
Earnings Call Speaker Segments
Operator
OperatorGood day, and welcome to the iAccess Alpha Virtual Best Ideas Spring Investment Conference 2026. Our next presenting company is GSI Technologies, Incorporate. [Operator Instructions] I'd now like to turn the floor over to today's host, Mr. Didier Lasserre, Vice President of Sales and Investor Relations for GSI Technologies, Incorporated. Sir, please go ahead.
Didier Lasserre
ExecutivesThank you, and thank you for joining us. As the moderator mentioned, my name is Didier Lasserre. I'm Vice President of Sales and Investor Relations here at GSI. GSI has been a semiconductor company for over 30 years, known for our high-performance SRAM products that are used in networking, defense and other demanding applications. That business remains an important financial foundation for GSI being it generates revenue and cash that support the development of our next-generation technology, the Associative Processing Unit. We are excited about our APU due to our proprietary compute and memory architecture. Our current product, the Gemini-II is designed for power and low latency constrained edge environments such as drones, satellites and any other autonomous systems where minimizing data movements enables significantly higher performance per watt than traditional architectures. Gemini-II is already being evaluated in defense and other edge applications as we work towards initial design wins. Today, I'll briefly walk through one of the -- I'm sorry, the technology behind the compute and memory architecture, the edge AI market opportunities we're targeting and how we plan to bring the APU platform into commercial deployments in the upcoming years. The key takeaway I want to leave you with is that edge AI favors architectures that deliver the most compute per watt, and that's exactly what the APU compute and memory is designed for. I will be making some forward-looking statements, so we have included the safe harbor statement here. A quick overview. As we mentioned, we are a leader in the high-density, high-performance memory market. We've been partnered with TSMC for our wafer fabs for over 30 years, and this will be the same partnership that we'll be using for the APU as well. We've developed and invented the APU chip, which is, as I mentioned, a compute in memory CIM technology. We are targeting the edge. We are not looking at the data centers at the moment. And so the APU is really manufactured and designed for edge applications. To date, we spent over $175 million on the APU R&D, which has been funded using our SRAM product line. This past October, we raised net $47 million through an equity raise. If you look at our trailing 12-month revenues, we're just under $25 million. In fact, this month, at the end of this month, we'll finish our fiscal 2026. If you compare what we're running at for this year versus last quarter -- or I'm sorry, last year, fiscal 2025, it will be about a 25% increase in revenue. We outsource the labor-intensive portions of our business, the fab, assembly, day-to-day sales. And so we're able to keep our headcount down to a very efficient 122 employees. So the majority of the employees are either hardware or software engineers. We have a very unique architecture, which I'll be talking about extensively today, and we want to protect that. So we've been aggressive with filing patents. So we now have 87 patents, specifically for the APU. The balance sheet is strong. We have just over $70 million in cash and cash equivalents, market cap over $300 million. I want to say this morning, we're about $320 million in market cap and a high insider ownership of 20%. So just looking at a high level, what's the -- what are really the challenges in the AI market space? What's the bottleneck? And really, the bottleneck is the fact that data needs to be moved and transferred constantly within the system. And when you're moving data from memory to where the compute elements are, it takes time. It takes latency and it also takes a lot of power. For edge environments, which is what we're focused on, really, the compute is constrained because you have a very limited power budget. And this is where the APU fits in perfect because we do the compute or the processing where the data resides, and I'll explain how that works. If you look on the right, that's essentially what a GPU or CPU looks like. You can see that you have the DRAM, which is the memory, which is where all the data is stored. And so if for some reason, a GPU needs to do a compute, it needs to grab the data from DRAM, has to transition and be transferred through L2 cache to L1 cache before it gets to the compute elements. And once that data is used in the compute elements, it has to be written back all the way through the same cycle back through L1 through L2 back to memory. So this constant transfer of data, besides taking time takes a tremendous amount of power. If you look on the left, that's our architecture, very simple. We actually do the compute or the processing in the memory array itself. And so the compute bits or the compute elements, processor bits are physically in the memory array. And so we are -- the memory or the processing is actually where the data resides. So we're not having to go fetch it and then we don't have to write it back. And once we're through using it, it remains in place. So this really significantly increases the performance and lowers the power. Also in our architecture, we have over 1 million bit processors that can work simultaneously. So we have massive parallel processing with our technology. And then lastly, our resolution or our bit width is not predetermined like GPUs. It's not predetermined to be an 8-bit or 16-bit. It's -- we're a bit engine and you can configure it any way you want, and it can change from cycle to cycle. So if this cycle, you want to do some processing at 8-bit, that's fine. Next cycle, let's say, you have a model that's most efficient at 3-bit, you can go ahead 3-bit, no problem. So what's interesting is a lot of the AI workloads are moving to the edge for several reasons. Mainly, what you're finding is that it's real-time responses are required. And so you need to start doing the processing where the data is collected. And so that transition is happening. Now the other reasons for that transition is the fact that cloud compute besides becoming expensive, it also takes time and it's also not private. Some of the military and defense applications are working on the data that is not allowed to leave the device for security reasons. And so therefore, there's really a huge demand now at the edge for this real-time inference. So why does AI at the edge require a new architecture? If you look at the traditional methods, again, as we talked about, the data is separated from the compute. And so you have to go get the data. Again, this constant transferring of data requires time and power. So if you look at the GSI APU, with the data in the memory array where the processing is happening, we're able to, again, lower the latency and lower the power at the same time. So GPUs are good for data centers. I mean they're great for training, and they're very good for large data centers. But when you get to the edge, power, the performance per watt is critical. And that's where the APU really shines. So this is a true case POC. In fact, this is the POC that we announced last quarter. And this is for a drone perimeter security program. And so at the time, the drone manufacturer needed specific parameters. They needed the time-to-first-token to be no more than 3 seconds, and they needed at the time a system power to be no more than 50 watts, preferably 30 watts. And so this drone manufacturer went to NVIDIA first and we looked at the Jetson and the Jetson gave them the time-to-first-token that they're required, but it was significantly over 100 watts for that performance, and that did not work for this drone power budget. So then they looked at a Snapdragon from Qualcomm, and they were able to achieve the power requirements, but it took 12 seconds to get the first token out. That's 4x slower than they could afford. And at that point, they looked at the Gemini-II from GSI, GSI was able to give them the performance of 3 seconds on the time-to-first-token, along with the 30 watts on the power budget. And so this was a critical win for us, and we were chosen as the hardware solution for this program. If you look at the market sizes, a lot of folks are really concentrated on the data center. Everything is about the data center. But if you look at the edge AI market, it's going to be exploding in the future. Right now, it's estimated to be about $20 billion, growing to $120 billion by 2030. If you look at the markets that we want to address, we're about $7 billion right now, and it will be more than doubling in the next 5 years. And these markets include drones, SAR satellites, in autonomous systems, smart cities, automated warehouses, anything in that category. So we've talked about Plato in the past. So Plato is our next-generation part. It's going to be designed specifically for LLMs at the edge. And I want to emphasize at the edge. So certainly, there are GPUs today that are working on LLMs for the data center. Those require over a kilowatt of power consumption. If you look at the Plato, Plato is going to be designed to be around 10 watts and in a lot of cases, less than 10 watts to deliver an LLM. So it's really designed again for the edge. We started the design this past quarter, and we're anticipating having the design done about a year from now. So we'll be taping out in the first half of 2027. If you look at our strategy on how we want to monetize the APU family, we're going to be starting with the Gemini-II. And we're starting with applications like drones and smart cities and anything basically physical AI at the edge. We're doing this through POCs and through some of our government contacts, which I'll talk about how we're doing that. But really, it's these advantages with the CIM architecture for things like time-to-first-token and other surrounding awareness that are really going to be leveraged for all these different applications. And again, the APU gives you that unique architecture for that low power. We anticipate Plato to kick in sometime in 2028. And then we're already having discussions with partners on what the generation after Plato looks like. Some of our successes have come from the mill defense areas. We've been very successful with SBIRs, which are essentially grants from the government. To date, we've been awarded $4.4 million worth of SBIRs. I kind of -- I'll start at the bottom about these SBIRs. We had our first Phase 1 win with the U.S. Army second half of last year for $250,000. We also had a win with both the U.S. Air Force Research Labs and the Space Development Agency. Both were worth over $1 million. The Space Defense Agency SBIR was extended for another $751,000. And the purpose for that extension is SDA wanted to see what our commercial chip look like on a robust level. So we're taking this grant to put our commercial Gemini-II under radiation beam and other ionization type testing to see what it can do. And then another grant, which will be coming in is the POC that we discussed with our partner, G2 Tech in Israel for a program called Sentinel that is for DoD or DoW and another foreign defense agency. And this is for a drone camera perimeter security environment. And this is something that we'll be together with G2 Tech demoing to these agencies in the summertime. As far as future opportunities for SBIRs, we have a pipeline somewhere between $6 million and $10 million that we have submitted. We feel especially good about one of the submissions that we've done for the U.S. Army on the Phase 2. It's for a ruggedized edge node that can be used for a lot of different applications from SAAR to object detection to drones. We are hoping to hear back on that one shortly. And then we're also going after other funding sources are a little larger than SBIRs. We're looking at ones like STRATFI and TACFI and other ones like BAA, which will give tens of millions of dollars of grant money. And of course, we're also looking at partnerships with potential customers to have some other strategic funding as well. As far as the financial overview, the revenues have been growing nicely over the last 1.5 years or so. We dropped off a little bit last quarter, but we're running over $6 million. Operating expenses have been running about $7 million a quarter. You could see a bump up there in our December quarter. That was with the purchase of the IP required for the Plato design to start. That was just over $3 million worth of IP that we purchased that quarter. And as far as the cash and cash equivalents between the $47 million raise that we did in October and some of the ATM purchases during that quarter, our cash went up significantly to just over $70 million. Just on a quick overview of the legacy product line of the SRAMs. We do have the highest density, highest performance memories in the market space. We are at least 1 to 2 generations ahead of our nearest competitor. And good news is all of our competitors have frozen their road maps. So we'll continue to enjoy that leadership position for the future. The SigmaRAM, SigmaQuad family have really been driving the gross margins and the revenue because they attribute over 50% of our revenues from those families. We're taking that legacy SRAM and we're hardening it. We have hardened it. We've made Rad-Hard and Rad-Tolerant products. These are for satellites anywhere from the LEO satellites to the GEO satellites. What's unique about this market is it's very high ASPs and gross margins. If you look at the kind of the range of ASPs, a lower density Rad-Tolerant might be a few thousand dollars and a high-density Rad-Tolerant could be as high as $30,000. And again, with gross margins north of 90%. So this is a market that we've talked about in the past. We've sent out lots of different samples and prototypes for several programs, and we're just waiting for those to go into production. It's a long design-in cycle for these markets. So in summary, again, this computer memory device we have, this architecture really is unique in the fact that it allows us to really decrease latency, decrease power consumption makes us a perfect use case for edge applications. And again, anything from drones to satellites, any autonomous systems, smart cities, anything like that is a perfect market to address. We have proven advantages. The SBIR wins, the Cornell paper that went out last quarter that showed -- compared us to a GPU for RAG application, and it showed that we were more than 95% less power for the same performance. And then also the POC, the drone surveillance POC program I mentioned, that was a bake-off between us, Qualcomm and NVIDIA that we won. So it's certainly proven advantages. So we like to refer to ourselves as kind of an AI start-up. But in a lot of cases, I mean, in a lot of cases, we're really not a semiconductor start-up. So remember, we've been in the business for 30 years selling and shipping SRAMs. And during that time, we shipped over 100 million SRAMs. The manufacturing process we're using for SRAMs will be the same for our APU. I mean we'll be using the same wafer foundry. We'll be using the same assembly house. We'll be using the same testing. And so we have a proven model, 30 years' experience for when we ramp the APU. And again, those SRAMs have been funding nondilutive the APU R&D. And lastly, we have a strong balance sheet with, again, over $70 million in cash with no debt. At this point, I'll open it up to questions and -- or Q&A.
Didier Lasserre
ExecutivesOkay. So the first question, given Gemini-II's ultra low power, low latency performance, where do you see the strongest initial deployment verticals across drones and surveillance? So yes, certainly, drones, we've done a significant amount of software work on SAAR, object detection and now time-to-first-token. And then again, this POC that we're doing, it's with the demo going in the summer, it's with the intent to sell these to the DoD and other government agencies. So certainly, that will be the first area. What portion of future revenues do you expect to come from defense versus commercial edge applications over the next 2 to 3 years? That's a good question. It's going to start -- it's going to be leaning more towards the defense versus commercial. That's because that's where we've seen our early successes between the interest from the DoD, SBIRs that we've won along with this POC, we certainly see that, that's going to be the first entry into revenue. But again, we're trying to follow that up quickly with other applications like smart cities. But for the next 2 to 3 years, it will be leaning much more towards the defense side. You see here what is the size of the Gemini-II cache? Right now, it's 96 megabyte of memory on the Gemini-II. That's 8x more than Gemini-I. So candidly, on the Plato, it will be less on the internal cache. But as you recall, with Plato, it's going to be a different application. It's going to be LLMs. So LLMs, by definition, Large Language Models, won't fit on inside of a chip. And so what we've done is we've actually lowered the cache on that one in order to be able to really increase the pipeline, the bandwidth to be able to get extra data in the chip. But 96 megabyte for Gemini-II. Did you publish the details for where the 3-second figure for the Jetson Thor came from? That came from the drone manufacturer who did the bake-off. The numbers that we presented were the numbers that they did in a benchmark program that they did. So -- I'm sorry, are you going to move from DDR4 to HBM2 to avoid external memory bottlenecks? No. So we're actually going to be going to GDDR5 for Plato. So we -- again, these are edge applications. And so we need to keep them low power and lower cost as well. So it's prohibitive of moving to HBM for that. So we'll be going with the GDDR5. Comment -- please comment on the ATM. So right now, the ATM is not active. I think there is a couple of million dollars that were left on it. But at this point, it's not active. Let me see. Okay. So are potential partners already driving decisions making the Plato road map? Yes. And again, the current Plato program as well as, we'll call it, Plato 2, a future Plato. So yes, those are definitely with partners. Okay. So we hear, okay. So is the 98% power savings reported by Cornell based on simulated HBM external memory interface? Candidly, I don't believe it was simulated. I think that they actually did a program, but the Cornell Paper is out there for you to be able to read it and go through that. But candidly, I don't remember what the memory interface was. Is manufacturing in Taiwan? Yes. So we use, as I mentioned, TSMC as our fab. We use a company called ASEK to do our assembly, and we do our own testing. We have a facility in Sunnyvale, California, which we do all the R&D testing. And then when it goes into mass production, it goes to our Taiwan facility for the testing. Okay. Let me see here. Okay. You received a reported 3-second time-to-first-token for Gemma-3 12B on Gemini-II. Can you confirm that the benchmark was achieved on the final product silicon? And are there plans to release a live demo for third-party audit? So yes, the 3-second time-to-first-token was done on the current production -- I'm sorry, current silicon, which is going to be our production silicon. And as I mentioned, the demo right now is going to be done in -- sometime in the summer is when it's scheduled to be. Let me see here. Do you expect to raise more money anytime soon? And are you looking -- are you talking to any strategic investors? Also, is the SRAM business potentially up for sale? So certainly, to scale, we're not actively looking to raise more money, but more money will be needed, I'm sure, in the future. Is the SRAM business potentially up for sale? Well, that's a good question. Certainly, if the right opportunity came, we would certainly entertain that. At this point, I am actually out of time. Operator?
Operator
OperatorThank you, sir. Ladies and gentlemen, that concludes GSI Technologies, Incorporated's presentation. You may now disconnect, and please consult the conference agenda for the next presenting company.
For developers and AI pipelines
Programmatic access to GSI Technology, Inc. earnings transcripts and 32,000+ others is available through the
EarningsCalls.dev REST API. Plans from $24.99/month — full transcripts, speaker segments,
full-text search, and the recently-added /api/v1/transcripts/recent polling endpoint for ETL pipelines.