Seagate Technology Holdings plc (STX) Earnings Call Transcript & Summary
June 5, 2024
Earnings Call Speaker Segments
Unknown Attendee
attendeeNow let's move on to our next speaker, Ms. Rosalina Hiu, the Global Vice President of Brand Strategy at Seagate, a leader in hard drive innovation. She will discuss demands of the AI era. Welcome, Ms. Hiu. The floor is yours.
Rosalina Hiu
executive[Foreign Language]. I'm so happy to be here in COMPUTEX. And I'm Rose Hiu. I manage the global brand strategy for Seagate Technology. And I'm here to share with you how AI storage, specifically hard drive shape the AI future. And before I do that, because I can, I want to actually say this, I want to celebrate all the female in the tech industry in this room. So give it applause. Keep growing and keep believing in your career. So you and I are living in amazing time. This is the time of the biggest disruption potentially in the humankind history. And this kind of a moment had happened before in the past. It's comparable to -- if you look at the history of invention, steam engine accelerating the global industrialization, the personal computer, the Internet, the smartphone, it accelerated information sharing and gathering and also shape how we interact with each other, how we learn and how we do business. And GenAI potentially can do that big of a transformation or even bigger. It will change the fabric of our life. Now with the past innovation, it has connection to data but GenAI is rooted in lots, lots of data. You heard from the other speakers about trillions of token, parameter, et cetera, those are data and data fuel AI. And AI requires more data to make it better. And AI also will fuel data generation, new data generation. And this new data generated by AI, the GenAI specifically, will needs to be grounded with that original data they are being trained as a source of truth, meaning that data really becomes so valuable than ever in this AI era. And storage is super critical to preserve the data for AI. According to the IDC, the global known analysts, they're predicting that by 2028, there's going to be about 394 zettabytes of data. It's a lot of data. And they already included some prediction of GenAI in here. It essentially double in size in 5 years. This forecast, however, is actually understated because it does not include yet the potential of multimodality LLM or what we typically call also the Vision model because it's really hard to predict. We're still in the very beginning of this Vision model. And as it goes ubiquitous, the growth rate maybe become more than what we expected. GenAI is truly a data creation force multiplier. The prediction from GenAI alone, it will create about 100 zettabytes of data in 4 years. So if you remember the number in the previous slide, that's approximately about 25%. But again, it's understated. And some sources say it's about 170% to 200% CAGR for GenAI to generate data. That's actually significantly more than when the era of smartphone and PC in terms of data generation. And why is it that big? #1, it's that richer content with the multimodality GenAI that involves image and video, no surprise, the data size is huge. So then most likely, the data that's going to be replicated also going to get bigger and we need this replication because in order for all this new generated data from image and video to be used, it needs to get closer and closer to wherever the use cases are or the users are. So more applications are going to be needed. And additionally, we believe that more retention is needed for the data. I mentioned about the source of truth. If you hear or read some of the recent news, AI can hallucinate. It's not something that we need to be negative about. It's just something that we need to go through as the technology getting mature but it is concerning if there's no source of truth. One of the funny example that I remember is, the Gemini, when it's being asked about who are the founding fathers of America, it was highly, highly more biased towards diversity answers, which is completely wrong. And with -- if we don't preserve our historical information, then most likely the distortion of truth can happen years, years after. So it is critical to actually retain source of truth of data. And the governments all over the countries and the world requiring also data retention policy. So we believe with GenAI that retention policy is going to get longer and it could be indefinitely. So the conclusion here is that it's just going to be more exabytes that we have to deal with in the world. So for every technology that are introduced in the human mankind, there's the adoption that's called the S-curve. And GenAI is actually following this S-curve too and it actually accelerates. And we believe that when it gets to the mass adoption, the critical thing that everyone has to be prepared for is that scale of storage demand. Now there are technically 4 stages here and let me just go through quickly the stages. The first 2 stages essentially where today we are now, it's about building and training the model and then also deploying the model and creating application on top of this model. It's a lot of excitement. A lot of the start-ups are actually proliferating in every country trying to build new application. And then after that, it's the early adoption of those application and then the mass adoption is going to happen and this is where the storage is going to take off. But in each of these phases, technically, even the first 2 phases, storage are critical too. And you can think about all the data that needed, like I mentioned to train, it required a data lake to support it. And most likely, it's using the existing capacity of storage. However, with the Vision model, we believe that more and more of this data lake is going to have to grow, even the existing capacity will not be enough. For AI to reach full potential for consumer businesses, it does need to be deployed everywhere, at the cloud or the core, at the edges, which is typically an on-prem or colocation and also at the endpoint where the consumers are using the application. So cloud is where the foundational large language model are being stored and trained but you also heard lots of the announcement from Intel, AMD and Nvidia about NPU that will enable compute for AI PC and AR workstation and also Qualcomm Snapdragon for mobile AI compute. With all this multimodality large language model development, the data size is going to get bigger and that means also in each of this deployment state, we will need mass storage capacity at the cloud, the edge and the end point. So as AI going everywhere, storage will also be everywhere. So it is not enough for company or any company that are data intensive to think only about compute and networking requirement to be AI-ready. Storage is the backbone for data preservation that is needed for AI. Existing storage capacity will not be sufficient, so enterprise will need to be strategic to plan ahead for that storage growth as the mass deployment going to happen. So data centers must quickly also increase storage capacity. And they have to balance these 2 forces going against each other, the explosive data creation and resource scarcity. The explosive data creation competes against the resource capacity constantly. And let me go through the resource scarcity that I mentioned here. Real estate, I think you heard from everyone already how data centers are growing like crazy. It's the hottest asset because of AI. I read an article that says that the availability of colocation and data center in Singapore, Tokyo and Hong Kong is below 2%. And all the companies are growing their data centers and looking for land and it jack-up the price of real estate. The CAGR of data centers globally is about 10%. However, the hyperscalers grow at 20% rate. So everyone is competing. The colocation space essentially is very competitive in terms of prices and the cost to build data center is also growing up to USD 1.5 billion. Second is the power. It is very energy hungry. I think everyone before me already mentioned this as well. It's about 160% more power demand, AI is going to need by 2030. And considering 1 GenAI image is being generated, it takes about the same as the energy of 1 fully charged smartphone. So yes, we do need to figure out how this power issue is going to be. Now when it comes to storage decision, besides the sustainability aspect is the budget. There is a TCO, the terms of total cost optimization for storage decision process. And that is acquisition of the devices, the power and other costs. But the largest cost out of the 3 is actually acquisition of the devices. So how do enterprises and hyperscalers decide on this? Now let's use hyperscalers because they are the poster child as an example of the best in terms of TCO optimization. So I'll let you know the secret. It is not SSD that they use the most. It's actually -- 90% of their data, they store in hard drive. And why is that? Because of the TCO of the acquisition cost of hard drive, it's 6: 1. It will forever be 6:1 ratio for SSD for the foreseeable future. So given that, hard drive is a fundamental component for cloud storage and any data center that needs to scale out for GenAI. With that, we do need to innovate and figure out how to support them. So hard drive areal density is the key to support the key challenges of AI, data center and AI storage. How do we increase the capacity of hard drive? Technically, there are 2 ways here. The easy way to think about it is that in the same form factor, with the current PMR or Perpendicular Magnetic Recording, you can add more platter. At the maximum right now, there's only 10 platter or 10 disks inside the hard drive. You can add 11th platter to increase the capacity. It sounds simple. However, it actually adds more cost to the materials and it also increase the operating cost, power consumption as well as resource usage. And the other way is the most sophisticated one and not easy to get to, is to actually pack more bits in the same materials of the form factor that we have. And this is the most sustainable solution and it actually helped the company to lower the power consumption as well. So Seagate in early of this year, had introduced Mozaic 3+ platform. This is the highest areal density you can achieve in hard drive industry right now. The areal density that we have achieved is 3 terabytes per disk, which is far more compared to the past platform, which is around 2 to 2.4 terabytes per disk. Mozaic 3+ is a composite of the most complex nanoscale recording technologies and material science breakthroughs on the industry. It is as complicated or even more with the semiconductor industry. It is our decades of R&D to achieve this density and is extensible beyond 3 TB per disk. More bits are packed in the same form factor 3.5-inch hard drive, no additional platter head. It's compatible with existing system configuration, so it's easy to deploy and transition for any data centers. So what is the value of areal density increase? It's simply scale again, TCO optimization and sustainability. And let me quickly go through that. The impact of areal density at scale is profound. Think about this example. If a data center, in average, they have a fleet of 16 terabyte in their data center's rack and they upgrade it to 30 terabyte, which is basically our 3 TB per disk platform, Mozaic 3+. It essentially delivers double the capacity of storage in the same data center with the same floor space. And what about the power? With the same scenario from 16 terabyte upgrade to 30 terabyte, they will see at least 45% plus power savings. You can see the difference of the number here, the 16 terabyte consume about 0.59 watt per terabyte, whereas the Mozaic 3+ consume about 0.32 watt per terabyte. So roughly, it's about 45%. And we also observed greater than 55% embodied carbon reduction and it's great for the sustainability aspect. So it's about storing more without increasing consumption of space, power and natural resources. Additionally, the new platform is -- was made with 28% less of recycled material by weight and that also reduces the packaging and shipping impact. Areal density is truly an enabler for a sustainable datasphere and circular economy. The areal density breakthrough is not just what you see with -- what we introduced, Mozaic 3+. Tomorrow's advancement is already here now in our road map, in our lab, we already achieved the 4-terabyte per disk and 5-terabyte per disk density. We have the prototypes and we believe that we can deliver 50-terabyte drives by 2028. So that's about doubling the capacity for every 4 years. In the world of AI, data upholds AI and storage, hard drive upholds data. Hard drive areal density technology breakthrough is supercritical for AI storage to scale and to be sustained. It is where we believe the future of AI is read and written. Thank you and we hope to see you in the fourth floor.
Unknown Attendee
attendeeThank you, Ms. Hiu, for showing us for the future is being read and written.
This call discussed
For developers and AI pipelines
Programmatic access to Seagate Technology Holdings plc earnings transcripts and 32,000+ others is available through the
EarningsCalls.dev REST API. Plans from $24.99/month — full transcripts, speaker segments,
full-text search, and the recently-added /api/v1/transcripts/recent polling endpoint for ETL pipelines.