Mastering the Art of Productionizing GenAI Solutions

S1 E012
|
DEVOPS
Jan 4, 2024

About Speaker

Sreedhar Gade

Vice President, Engineering, Freshworks

Sreedhar, the VP of Engineering at Freshworks, holds an impressive career spanning nearly two decades in leading technology, financial, and internet organizations like Yahoo! Inc and Microsoft. As a recognized industry leader, entrepreneur, speaker, and author, he's known for his insights on emerging trends and leadership. Sreedhar actively shares his expertise through blogs, social channels, and conferences, focusing on career guidance and leadership. He collaborates with global industry leaders, emphasizing rapid career growth strategies. He believes in democratizing success principles, ensuring equal access to information, execution plans, and mentorship for all, regardless of location or financial status.

Sreedhar, the VP of Engineering at Freshworks, holds an impressive career spanning nearly two decades in leading technology, financial, and internet organizations like Yahoo! Inc and Microsoft. As a recognized industry leader, entrepreneur, speaker, and author, he's known for his insights on emerging trends and leadership. Sreedhar actively shares his expertise through blogs, social channels, and conferences, focusing on career guidance and leadership. He collaborates with global industry leaders, emphasizing rapid career growth strategies. He believes in democratizing success principles, ensuring equal access to information, execution plans, and mentorship for all, regardless of location or financial status.

Sreedhar, the VP of Engineering at Freshworks, holds an impressive career spanning nearly two decades in leading technology, financial, and internet organizations like Yahoo! Inc and Microsoft. As a recognized industry leader, entrepreneur, speaker, and author, he's known for his insights on emerging trends and leadership. Sreedhar actively shares his expertise through blogs, social channels, and conferences, focusing on career guidance and leadership. He collaborates with global industry leaders, emphasizing rapid career growth strategies. He believes in democratizing success principles, ensuring equal access to information, execution plans, and mentorship for all, regardless of location or financial status.

About Host

Sathya Narayanan Nagarajan

Co-founder and CTO, Amnic

Sathya is an experienced technologist with over two decades in Artificial Intelligence (AI), Electric Vehicles (EV), and Distributed Systems. As the Co-founder and CTO of Amnic, he drives the development of a cloud Intelligence Platform, emphasizing efficiency, cost reduction, and reliability. Sathya's leadership spans roles at Ola Electric Mobility, Ola Cabs, Yahoo and many internet companies. With 11 patents in AI, EV, and Distributed Systems, he is committed to knowledge sharing and guiding industry thought leaders.

Summary of Podcast

In the podcast "Mastering the Art of Productionizing GenAI Solutions," Sheedhar Gade, Vice President at Freshworks, discusses the evolution of computing technology over the past two decades, focusing on the recent transition from cloud computing to generative AI (GenAI). Startups and small businesses are currently focusing on use case development and some model tuning, while mid-sized companies are performing industry-specific use case development and model tuning. Enterprise companies, however, are building proprietary models and investing in large data centers and GPU inventory for long-term benefits. This podcast explores the importance of carefully evaluating models before taking them to production, focusing on factors such as functionality, cost, performance, and accuracy. There are challenges in choosing the right model and cloud provider, and potential token usage to increase exponentially as users and user-generated content grow. Scaling up GenAI solutions and integrating them with existing systems pose further challenges, necessitating collaboration with cloud partners and centralizing the platform as a broker between traditional engineering systems and the cloud. Sreedhar emphasize the importance of continuous monitoring and improvement in GenAI solutions, as models or use cases can drift from good ROI to lower ROI. They also address the use of GenAI models in Edge Computing, the resource requirements for adopting GenAI via chatbot for Q&A and support, and the comparison between model-centric and data-centric AI. The reliance on Nvidia's dominance in required hardware for advanced AI solutions is highlighted, with Microsoft and AWS attempting to challenge this through their chipsets. The first episode of the series concludes by expressing appreciation for audience insights and announcing plans to explore scaling in subsequent episodes.

In the podcast "Mastering the Art of Productionizing GenAI Solutions," Sheedhar Gade, Vice President at Freshworks, discusses the evolution of computing technology over the past two decades, focusing on the recent transition from cloud computing to generative AI (GenAI). Startups and small businesses are currently focusing on use case development and some model tuning, while mid-sized companies are performing industry-specific use case development and model tuning. Enterprise companies, however, are building proprietary models and investing in large data centers and GPU inventory for long-term benefits. This podcast explores the importance of carefully evaluating models before taking them to production, focusing on factors such as functionality, cost, performance, and accuracy. There are challenges in choosing the right model and cloud provider, and potential token usage to increase exponentially as users and user-generated content grow. Scaling up GenAI solutions and integrating them with existing systems pose further challenges, necessitating collaboration with cloud partners and centralizing the platform as a broker between traditional engineering systems and the cloud. Sreedhar emphasize the importance of continuous monitoring and improvement in GenAI solutions, as models or use cases can drift from good ROI to lower ROI. They also address the use of GenAI models in Edge Computing, the resource requirements for adopting GenAI via chatbot for Q&A and support, and the comparison between model-centric and data-centric AI. The reliance on Nvidia's dominance in required hardware for advanced AI solutions is highlighted, with Microsoft and AWS attempting to challenge this through their chipsets. The first episode of the series concludes by expressing appreciation for audience insights and announcing plans to explore scaling in subsequent episodes.

In the podcast "Mastering the Art of Productionizing GenAI Solutions," Sheedhar Gade, Vice President at Freshworks, discusses the evolution of computing technology over the past two decades, focusing on the recent transition from cloud computing to generative AI (GenAI). Startups and small businesses are currently focusing on use case development and some model tuning, while mid-sized companies are performing industry-specific use case development and model tuning. Enterprise companies, however, are building proprietary models and investing in large data centers and GPU inventory for long-term benefits. This podcast explores the importance of carefully evaluating models before taking them to production, focusing on factors such as functionality, cost, performance, and accuracy. There are challenges in choosing the right model and cloud provider, and potential token usage to increase exponentially as users and user-generated content grow. Scaling up GenAI solutions and integrating them with existing systems pose further challenges, necessitating collaboration with cloud partners and centralizing the platform as a broker between traditional engineering systems and the cloud. Sreedhar emphasize the importance of continuous monitoring and improvement in GenAI solutions, as models or use cases can drift from good ROI to lower ROI. They also address the use of GenAI models in Edge Computing, the resource requirements for adopting GenAI via chatbot for Q&A and support, and the comparison between model-centric and data-centric AI. The reliance on Nvidia's dominance in required hardware for advanced AI solutions is highlighted, with Microsoft and AWS attempting to challenge this through their chipsets. The first episode of the series concludes by expressing appreciation for audience insights and announcing plans to explore scaling in subsequent episodes.

About Amnic

Amnic is a cloud cost observability platform, helping businesses measure and rightsize their cloud costs. Amnic helps businesses visualize, analyze and optimize their cloud spends, in turn building a lean cloud infrastructure. Amnic offers out of the box solutions that help breakdown cloud bills and provide greater visibility and understanding into cloud costs along with recommendations to lower spends, alerts and anomaly detection.

Amnic delivers a wide range of features including K8s visibility, cost analyzer, alerts and custom reporting, budgeting, forecasting and smart tagging. DevOps and SRE teams rely on Amnic to deliver a simplified view into their cloud costs, allowing them to maintain governance and build a culture of cost optimization. Setup in 5-minutes and get 30-days of free trial.

Visit www.amnic.com to get started.

MORE EPISODES

Melissa Parker

S2 E005

Switching lanes and finding a calling in DevOps with Melissa Parker

Sep 20, 2024

Satender Patwal

S2 E004

Unlocking key learnings in DevOps and SRE roles

Sep 6, 2024

Kelly Looney

S2 E003

DevOps through the decades with Kelly Looney

Aug 9, 2024

Smita Ojha

S2 E002

Bringing best practices from large corporations into younger companies

Jun 26, 2024

Aankur Khare

S2 E001

Five principles to align your engineering organization with frugal excellence

May 30, 2024

Vishwanath Kolki

S1 E019

Empowering Your Cloud Cost Optimization Journey

Apr 18, 2024

Avinash Kumar Gupta

S1 E018

Optimizing Costs : A Platform Engineers Playbook

Apr 11, 2024

Subhadeep Guin

S1 E017

Unleashing the Full Potential of IDP

Feb 22, 2024

Turja N Chaudhuri

S1 E016

GenAI Mastery: Crafting the Art of Productization

Feb 8, 2024

Sreedhar Gade

S1 E015

Unleashing Governance in GenAl

Jan 25, 2024

Namrata Dixit

S1 E014

GenAI in DevOps: Transforming Developer Culture

Jan 20, 2024

Sreedhar Gade

S1 E013

Scaling GenAI: Production Challenges Unveiled

Jan 11, 2024

S1 E012

Mastering the Art of Productionizing GenAI Solutions

Jan 4, 2024

Sathish K S

S1 E011

Building a resilient data platform

Dec 22, 2023

Suresh Kumar Khemka

S1 E010

Designing Resilient Systems

Dec 15, 2023

Manoj Kumar

S1 E009

Software Engineering with DevSecOps Excellence

Dec 8, 2023

Dr. Maneesha Asundi

S1 E008

Demystifying the Journey from Code to Cloud to FinOps

Nov 21, 2023

Shirsha Ray Chaudhuri

S1 E007

Decoding Gen AI From Prototype to Production

Nov 10, 2023

Gopal Sharma

S1 E006

Cloud Reliability Blueprint for a trustworthy infrastructure

Oct 20, 2023

Sunil Kumar

S1 E005

From Start-Up to Scale-Up

Oct 14, 2023

Suyog Shetty

S1 E004

Optimizing Google Cloud Spend like a Pro

Oct 4, 2023

Abhishek Gaurav

S1 E003

Balancing On-Premises and Cloud

Sep 24, 2023

Nilesh Gule

S1 E002

Mastering Cloud Reliability: Best Practices

Sep 9, 2023

Rajdeep Bhattacharya

S1 E001

Tech Evolution: Challenges to DevOps Excellence

Aug 31, 2023

Build a culture of cloud cost optimization

Build a culture of

cloud cost observability

Build a culture of

cloud cost observability

Build a culture of

cloud cost observability