Emerging Stack for LLM-powered Products
What does it take to go from PoC to Production
Kuldeep Yadav, PhD
https://www.linkedin.com/in/kyadav/
Real World
AI Bootcamp
Agenda
Why should you think about LLM stack?
Lifecycle of a LLM product
Design patterns to take LLM products from PoC to Production
Key libraries/platforms that you need
Infrastructure/tooling that are yet to be built
Real World
AI Bootcamp
What is the most important ingredient of a successful LLM-powered product?
Slido: #3067587
Real World
AI Bootcamp
Few years back …
Now…
Everyone has access to same model
How do you differentiate?
Real World
AI Bootcamp
Why there is a different stack for LLMs vs ML?
Real World
AI Bootcamp
Typical ML Pipeline
Weeks
Weeks
Weeks
Infrequent ( Months)
Secret Sauce
Real World
AI Bootcamp
LLM Lifecycle
Days
Days
Days
Very Frequent
What is the secret sauce?
Real World
AI Bootcamp
User Expectations
Ref: https://www.sh-reya.com/blog/ai-engineering-short/
Real World
AI Bootcamp
Typical Development Lifecycle with LLMs
Scope
Does your product/feature really need LLMs?
Build
Iterate to design, develop, and test the product
Deploy
Deploy it for real users; system reliability; plans to scale
Monitor
Monitor the performance, availability closely
Iterate to improve
To ask the right questions is already half the solution of a problem – Carl Jung
Real World
AI Bootcamp
Scope: Key Question to #ASK
Scope
“Testing the Waters”
Does this task/feature/product need LLMs?
What are business metrics that it will impact?
Where does this fit in user journey?
How will I present the output to the users?
Real World
AI Bootcamp
Scope: Things to Do
Real World
AI Bootcamp
What are agents really?
Real World
AI Bootcamp
Scope: Tools to Use
ChainForge (Open-source, easy to install)
Athina IDE (SaaS, easy to try)
Several other tools: LangChain, Jupyter Notebooks, ChatGPT, and so on….
Real World
AI Bootcamp
What are the common mistakes that people make in scoping?
Slido: #3067587
Real World
AI Bootcamp
Build: Making it real
Real World
AI Bootcamp
Build: Benchmarking Datasets
LabelStudio (Open-source, easy to use templates)
Real World
AI Bootcamp
Build: Workflow Orchestration
https://www.techtarget.com/searchenterpriseai/definition/LangChain
Real World
AI Bootcamp
Build: Workflow Orchestrators
Real World
AI Bootcamp
Build: Prompting
Social Media
LangChain, PromptHub, Athina
DsPy, Adelflow
Real World
AI Bootcamp
Build: Testing/Evaluating LLM Products
Real World
AI Bootcamp
Build: Investing in create a data fly-wheel
https://www.sh-reya.com/blog/ai-engineering-flywheel/
Continuous Development
Continuous Integration
Continuous Improvement
Real World
AI Bootcamp
Build: Let's take a deeper dive in Evaluation
Evaluation has the most action in LLM Tooling Space; 8-10 good SaaS providers including LangSmith, Galileo, BrainTrust, Arize, etc
Unit Tests
Model Evaluations
Human Evaluations
A/B Testing
Real World
AI Bootcamp
Methodological evaluation is a major difference between mediocre and great apps!
Real World
AI Bootcamp
Deployment and Monitoring�
Key Tools: PortKey, Galileo, Redis, GPTCache, Helicone
Real World
AI Bootcamp
Design Patterns of Successful Builders
Make data your friend
Focus deeply on your user, workflow, domain, and the use-case
Evaluation should be omni-present in your entire lifecycle
Iterate quickly
Observability
Tools and Stack
Real World
AI Bootcamp
Thank You!
Real World
AI Bootcamp