Loading…
DeveloperWeek Management 2024 + AI DevSummit 2024 (+ DW...
Attending this event?
Thursday, June 6 • 10:00am - 10:25am
[Virtual] OPEN TALK (Europe/Latin America): AI-powered Data Observability in Data Engineering

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Anandaganesh Balakrishnan, American Water, Principal Software Engineer

AI-powered data observability marks a transformative approach in data engineering, focusing on the advanced monitoring, management, and comprehension of an organization's data health. This method employs artificial intelligence (AI) and machine learning (ML) algorithms to automate issue detection and diagnosis, ensuring data quality, reliability, and trustworthiness. Essential aspects of this integration include Automated Anomaly Detection, Predictive Analytics, Root Cause Analysis, Data Quality Scoring, and Real-time Monitoring. These features collectively identify and promptly address data discrepancies, analyze historical data patterns to predict future issues and evaluate data quality across various dimensions, ensuring immediate and effective data management.

Adopting AI in data observability yields significant benefits such as increased operational efficiency, enhanced data quality, reduced system downtime, improved decision-making capabilities, and considerable cost savings. These advantages stem from reducing manual monitoring requirements, maintaining high data quality crucial for analytical processes, rapid issue resolution, and providing high-quality data to support strategic business decisions.

However, successfully implementing AI-powered data observability necessitates considering factors like integrating with existing data systems, customizing and tuning AI models according to specific data environments and business needs, and providing adequate training for teams. Given the growing complexity and pivotal role of data environments in business operations, AI's role in data observability is poised for expansion, promising innovative solutions for ensuring data integrity and enhancing business value.

Implementing AI-powered data observability in data engineering requires adherence to several best practices to enhance the effectiveness of data system monitoring, diagnosis, and health assurance. These practices aim to bolster data quality and operational efficiency and achieve superior business outcomes. Key strategies include:

- Setting clear objectives and measurable KPIs aligned with business goals.
- Comprehensive monitoring of the data ecosystem in real time.
- Leveraging advanced anomaly detection techniques through machine learning for precise issue identification.

Additionally, automating root cause analysis, ensuring the scalability and flexibility of the observability solution, and prioritizing data quality management are crucial. Encouraging cross-functional collaboration, addressing privacy and security concerns, and maintaining a continuous evaluation and improvement cycle are also vital. By embracing these practices, organizations can effectively leverage AI-powered data observability for proactive data management, minimizing operational risks, and facilitating informed decision-making based on high-quality data.

Speakers
avatar for Anandaganesh Balakrishnan

Anandaganesh Balakrishnan

Principal Software Engineer, American Water
Anandaganesh Balakrishnan has 15+ years of experience in data engineering, data virtualization, database development, infrastructure development, and data analytics. He held leadership roles spanning diverse industries across Banking, Trading, Biotech, Real Estate, and Utilities.He... Read More →


Thursday June 6, 2024 10:00am - 10:25am PDT
VIRTUAL DeveloperWeek Europe / Latin America Main Stage
  DW Europe/Latin America: Artificial Intelligence Innovation
  • Talk Type OPEN TALK
  • Track or Conference Artificial Intelligence Innovation, North America, DW Europe/ Latin America, VIRTUAL, Developer Technology Innovation
  • In-Person/Virtual Virtual
  • about Anandaganesh Balakrishnan has 15+ years of experience in data engineering, data virtualization, database development, infrastructure development, and data analytics. He held leadership roles spanning diverse industries across Banking, Trading, Biotech, Real Estate, and Utilities.<br><br>He currently leads the development and optimization of data virtualization infrastructure and data engineering strategies. He supports application developers, data products team, database developers, data scientists, and other key stakeholders on data initiatives. He ensures optimal data delivery architecture by benchmarking different tools' capabilities and performance. His current focus is AI on unstructured data, large language models, Generative AI, self-service data analytics, and data catalogs.
Feedback form isn't open yet.