Loading…
DeveloperWeek Management 2024 + AI DevSummit 2024 (+ DW...
Attending this event?
Thursday, May 30 • 11:00am - 11:25am
PRO TALK (AI): Crafting Realities: The Role of Synthetic Data in AI Training

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Arpit Shrivastava, Meta, Product Leader

Problem: The challenge at the heart of this presentation is the efficient training of AI models in scenarios where real-world data is limited, sensitive, or expensive to acquire. This issue is particularly pressing in fields such as autonomous vehicle development and medical research, where the quality and diversity of training data directly influence the performance and reliability of AI systems. Addressing this problem is crucial for advancing AI capabilities while ensuring ethical standards and privacy are upheld.

Methodology: To tackle this challenge, our approach involves the creation and use of synthetic data. The methodology encompasses techniques for generating high-fidelity, diverse synthetic datasets that mimic real-world complexities without compromising privacy or incurring high costs. Key techniques include Generative Adversarial Networks (GANs), simulation-based synthesis, and rule-based data generation. The presentation will detail these methods, along with strategies for validating the realism and utility of synthetic data in training robust AI models.

Conclusions: Preliminary results demonstrate that synthetic data can significantly enhance AI model training, especially in constrained environments. By leveraging synthetic datasets, we've observed improvements in model accuracy, robustness, and generalizability across several applications. The presentation will outline these findings, showcasing examples where synthetic data has successfully bridged the gap between the data needs of AI systems and the limitations of real-world datasets.

Implications for Practitioners and Business: The insights from this work are highly relevant to AI practitioners, data scientists, and business leaders seeking innovative solutions to data scarcity and privacy concerns. The application of synthetic data opens new avenues for AI development across various sectors, offering a cost-effective, scalable, and privacy-compliant approach to training sophisticated AI models. This presentation will provide attendees with practical knowledge on leveraging synthetic data, encouraging its adoption in business strategies to drive AI innovation and competitive advantage.

Speakers
avatar for Arpit Shrivastava

Arpit Shrivastava

Product Leader, Meta
Customer Obsessed Product Leader with a proven track record at tech giants like Meta and Cisco Systems, where I've led the charge in product innovation and managed multi-billion dollar portfolios. My expertise lies in driving Machine Learning-focused product strategies and spearheading... Read More →


Thursday May 30, 2024 11:00am - 11:25am PDT
AI DevSummit Main Stage
Feedback form isn't open yet.