by Sara Fedele, Senior Manager, PR and Communications at Kickstart Innovation
In the coming weeks, throughout our 11-week program, we will be showcasing the journeys of several participants from Kickstart’s program. Today, we shine the spotlight on YData, a company transforming the data and AI landscape with their innovative synthetic data solutions. Since joining Kickstart, YData has continued to make remarkable progress, from securing major partnerships to advancing privacy-preserving AI technologies. In this interview with the CEO, Gonçalo Martins Ribeiro, we dive into their recent milestones, the growing demand for synthetic data, and their vision for the future of data-driven innovation.
Gonçalo, what inspired you to start YData, and how did you identify the gap in the market for synthetic data solutions? How has your vision for the company evolved since its early days?
When we founded YData, we recognized the growing challenges in accessing data for Data and AI projects. The need for large, diverse, and high-quality datasets was clear, but obtaining them often came with barriers like privacy concerns, bias, poor quality and limited access, just to name a few. This gap was particularly painful in industries like finance, telecom, pharma and healthcare, where privacy regulations restrict the use of sensitive data and data collection and labelling is faulty. Our vision started as a solution to simplify data access, but, over time, this vision has evolved into a mission to empower data teams with the ability to have high-quality data to work with - data that is both privacy-preserving and high-fidelity, so they can drive innovation faster and more ethically.
YData focuses on generating high-quality synthetic data. How do you see this technology transforming industries, and what excites you most about its potential?
Synthetic data has transformative potential across many industries. It can significantly accelerate the development of Data and AI projects by providing diverse, high-quality data, especially in areas where data is scarce, sensitive, or of poor quality. As AI becomes increasingly inevitable, those who don’t adopt this technology risk being left behind, and data is the foundational element of AI systems. Ensuring access to high-quality data is not just crucial—it’s imperative. What excites me most is how synthetic data can democratize access to quality data. By making it easier and safer to obtain, we can empower businesses and researchers to innovate faster, without being hindered by regulatory challenges or concerns over data scarcity.
Data privacy is a critical issue today. How does YData balance innovation in synthetic data with the need to ensure privacy and compliance with global regulations?
Data privacy is central to YData’s mission. Our synthetic data generation technology is designed to ensure that the data is non-identifiable and non-reversible to the original datasets, making it privacy-preserving by design. While maintaining the statistical integrity of the real data, we ensure no sensitive information is present. YData’s Fabric platform allows companies to safely innovate and train models without the risk of data breaches or non-compliance. Additionally, YData is SOC 2 Type 2 certified and consistently aligns with global regulations such as GDPR, CCPA, and HIPAA, ensuring that our solutions not only meet but exceed privacy and data protection standards.
As the demand for data-driven solutions grows, what trends in the industry are you most excited about? How is YData preparing to stay ahead in this rapidly changing landscape?
One of the most significant trends today is the tightening of data privacy regulations, alongside massive investments in AI, particularly in Generative AI. This combination of stricter regulatory oversight and rapid AI advancements presents both challenges and opportunities. On one hand, companies must exercise greater caution in handling sensitive data; on the other hand, AI is progressing rapidly, offering enormous potential to transform industries. At YData, we see a tremendous opportunity in this space. Our synthetic data technology enables businesses to develop and train AI models without compromising privacy. By using privacy-preserving synthetic data, organizations can remain compliant with regulations while harnessing the power of AI and Generative AI to drive innovation.
YData has achieved a lot in a short time. What are some key milestones that you believe have been pivotal to the company’s growth and success?
YData has experienced remarkable growth, reaching 1.5 million monthly downloads, with many Fortune 500 companies among our users. Today, over 13,000 data scientists regularly use our solutions, solidifying YData’s position as a leading provider for data professionals. Beyond widespread product adoption, we are pioneers and research leaders in the field. We have submitted three patents and published numerous papers in recognized journals, showcasing our ongoing commitment to innovation and leadership. These milestones underscore our dedication to the data science community and our mission of advancing responsible, privacy-preserving AI development. In addition, we’ve established key partnerships with industry leaders such as Microsoft, Amazon, and Databricks. These collaborations have significantly expanded our presence in the data space, opening new opportunities and enhancing our ability to deliver impactful AI solutions.
How did YData enter the Swiss market, and what has your experience been so far? Were there any specific challenges or opportunities unique to this market?
Switzerland presents a very promising market for YData, given its strong focus on data privacy, innovation, and high-quality standards. While our presence here is still in its early stages, we are excited about the opportunities and actively working to establish a stronger foothold. We are actively forming partnerships with technology providers, academic institutions—such as our research collaboration with ETH Zurich—and industry leaders to drive innovation in Data and AI. The Swiss market aligns perfectly with our mission, and the Kickstart Innovation program has been instrumental in helping us navigate this environment, offering a soft landing as we build relationships and explore collaborations with key industry players. That said, YData already has a solid presence in other markets, particularly in the USA and UK, where we have formed strong partnerships and achieved rapid growth. These markets have been pivotal for our expansion, and we are leveraging our experiences there to inform our efforts in Switzerland. With its emphasis on privacy and data protection, Switzerland represents an ideal environment for our solutions. We are confident that with continued focus, we will be able to replicate our success in this market.
What role do you think data will play in solving some of the world’s biggest challenges, such as climate change or healthcare?
Data plays a crucial role in addressing global challenges. It can help model the impact of policies, develop more efficient renewable energy systems, and predict climate-related risks. In healthcare, it can accelerate drug discovery, improve diagnosis, and personalize treatments. Our strong focus on privacy ensures that data-driven solutions can be developed without compromising individual rights, enabling ethical progress while leveraging state-of-the-art technology.
Looking ahead, where do you see YData in the next five years? What impact do you hope the company will have on the data science industry and beyond?
In the next five years, I envision YData becoming a key player in the Data & AI space, leading the adoption of responsible and privacy-preserving solutions across various industries. Our impact will go beyond data science, as we aim to shape the standards for data access, quality, and responsible AI development. As the leader of the data vertical in the Center for Responsible AI, we’re committed to promoting the responsible use of data, mitigating bias, and ensuring the ethical application of technology. Ultimately, our goal is to democratize access to high-quality data, empowering companies and researchers with the tools they need to innovate responsibly and efficiently.
About Gonçalo Martin Ribeiro
Gonçalo Martin Ribeiro is the CEO of YData, a San Francisco-based company specializing in synthetic data generation. His entrepreneurial journey began with a passion for bridging the gap between education, research and business. At YData, he has been at the forefront of innovation and advocacy for responsible AI practices. Ribeiro’s vision centers around creating artificial data that mirrors real-world scenarios while addressing privacy and ethical concerns.
About YData
YData is a pioneering company focused on improving access to high-quality data for AI and data science teams on a mission to help organizations better understand and improve their data, with a strong emphasis on privacy and security. YData’s solutions are built to handle, process, and generate synthetic data, enabling innovation while maintaining data privacy. The company is SOC 2 Type 2 certified and collaborates with industry leaders, contributing to cutting-edge advancements in synthetic data and data-centric AI.
If you are a startup active in finance, insurance and deep tech, or interested in collaboration with corporates such as AXA, you can learn more about our program and pre-register.
留言