README

Hi, I’m Almog - an LLM engineering expert and a startup entrepreneur passionate about bringing AI innovation to
life. For the past couple years, I’ve been:
- Author of The LLM Triangle Principle - a framework for building reliable LLM applications in production.
- Helped to build and deploy dozens of LLM apps from idea to production as a hands-on consultant.
- Founder of GenAI Israel - the largest GenAI community for practitioners (Over 5000+
engineers, CTOs, researchers, and data scientists)
- Serial tech entrepreneur; ex-AI infrastructure founder with extensive cloud-native experience (Kubernetes
maintainer)
I love turning LLM dreams into reality - whether it’s high-level strategy or diving deep into code. Let’s build
something amazing with AI!
Areas of Expertise
- 🧠 Advanced LLM Application Development and Strategy
- 🏗️ Production-Ready AI/ML Infrastructure Architecture
- 🌟 Scalable Generative AI Systems Implementation
- ☁️ Cloud-Native AI Solutions Optimized for Performance
- 🚀 Large-Scale AI Engineering for Real-World Impact
- 🖥️ Startup and entrepreneurial experience in AI and cloud technologies
Publications
- The LLM Triangle Principle: Software Design Principles for Reliable LLM Apps
An innovative approach to designing robust LLM-based applications for real-world use, derived from hands-on project
Software design principles for thoughtfully designing reliable, high-performing LLM applications. A framework to
bridge the gap between potential and production-grade performance.
- Building LLM Apps: A Step-By-Step Guide
A comprehensive guide to LLM application development, from experimentation to production, based on personal
implementation experience.
- 8 Practical Prompt Engineering Tips for Better LLM Apps
Essential tips for effective prompt engineering in LLM applications, based on direct implementation experience.
- Effective AI Infrastructure Explained
Exploring modern AI infrastructure and its impact on the ML lifecycle, informed by hands-on project work and cloud
native expertise.
- Talks
From time to time, I give talks on various meetups, podcasts and conferences. You can find some of them on my
LinkedIn profile. Make sure to follow me to get updates on upcoming talks.
As Seen On
I’ve been featured in various podcasts, meetups, and conferences. If you’re interested in having me as a guest speaker
or panelist, please reach out via Email.
Recent appearances include:
- AI Dev TLV ‘24 - Talk (Hebrew) / The LLM Triangle: Engineering
Principles for Robust AI Applications - a talk I gave at the AI Dev TLV ‘24 conference about the LLM Triangle
principles and how to architect reliable AI apps in a production-grade manner.
- LangTalk E35 (Hebrew) / LLM
Applications Developer Guide - An end to end guide on how to get started and deploy to production your llm app
- Making Software (Osim Tochna) E165 (Hebrew) / From a PoC to a
product - the hidden challenges of deploying LLM applications
- **AI In Production Conference
** - Talk (
English) / How to Build LLM-native Apps with The LLM Triangle Blueprint - a talk I gave at the AI In Production
Conference about the LLM Triangle principles and how to architect reliable AI apps in a production-grade manner.
- The MLOps Podcast /🫣 Is Data Science a dying job? (
English) - About Kubernetes, Large Language Models (LLMs), how to get them into production, and how data is becoming a
more central piece of the ML landscape.
- AI Infra Stories (English) - A podcast about AI
infrastructure, where I hosted world-class AI infrastructure experts to discuss the latest trends and challenges in AI
infrastructure.
- And many more… 🚀
Open Source Contributions
I’ve been an active contributor to open source projects for over 15 years, regularly participating in various projects.
My contributions range from creating new tools to maintaining major projects, or just sending PRs for bugs 🙃
Notable contributions:
- Creator of Raptor.ml: An AI infrastructure project that helps to build and
deploy AI to production - the gap between data science and engineering.
- Author of openai-streaming: A Python library simplifying
interactions with LLM Streaming API, including for tool using purposes.
- Kubernetes Maintainer: Active contributor since 2016, focusing on cloud-native big data
solutions and Kubernetes Native architectures.
- pytest-evals: A pytest plugin for running and analyzing LLM
evaluation tests.
- LLM Playground: An interface to play/compare different LLM models
directly from your browser.
- Various Contributions: Ongoing involvement in multiple open source projects, consistently pushing for advancements
in technology and knowledge sharing.
Connect with me on LinkedIn, GitHub, or via
Email to discuss how we can collaborate.