Almog Baku - LLM engineering and entrepreneur

README

Hi, I’m Almog - an LLM engineering expert and a startup entrepreneur passionate about bringing AI innovation to life. For the past couple years, I’ve been:

Author of The LLM Triangle Principle - a framework for building reliable LLM applications in production.
Helped to build and deploy dozens of LLM apps from idea to production as a hands-on consultant.
Founder of GenAI Israel - the largest GenAI community for practitioners (Over 5000+ engineers, CTOs, researchers, and data scientists)
Serial tech entrepreneur; ex-AI infrastructure founder with extensive cloud-native experience (Kubernetes maintainer)

I love turning LLM dreams into reality - whether it’s high-level strategy or diving deep into code. Let’s build something amazing with AI!

Areas of Expertise

🧠 Advanced LLM Application Development and Strategy
🏗️ Production-Ready AI/ML Infrastructure Architecture
🌟 Scalable Generative AI Systems Implementation
☁️ Cloud-Native AI Solutions Optimized for Performance
🚀 Large-Scale AI Engineering for Real-World Impact
🖥️ Startup and entrepreneurial experience in AI and cloud technologies

Publications

The LLM Triangle Principle: Software Design Principles for Reliable LLM Apps
An innovative approach to designing robust LLM-based applications for real-world use, derived from hands-on project Software design principles for thoughtfully designing reliable, high-performing LLM applications. A framework to bridge the gap between potential and production-grade performance.
Building LLM Apps: A Step-By-Step Guide
A comprehensive guide to LLM application development, from experimentation to production, based on personal implementation experience.
8 Practical Prompt Engineering Tips for Better LLM Apps
Essential tips for effective prompt engineering in LLM applications, based on direct implementation experience.
Effective AI Infrastructure Explained
Exploring modern AI infrastructure and its impact on the ML lifecycle, informed by hands-on project work and cloud native expertise.
Talks
From time to time, I give talks on various meetups, podcasts and conferences. You can find some of them on my LinkedIn profile. Make sure to follow me to get updates on upcoming talks.

As Seen On

I’ve been featured in various podcasts, meetups, and conferences. If you’re interested in having me as a guest speaker or panelist, please reach out via Email.

Recent appearances include:

AI Engineer Summit (Online track) - Talk (English) / The LLM Triangle: Engineering Principles for Robust AI Applications - a talk I gave at the AI Engineer Summit about the LLM Triangle principles and how to architect reliable AI apps in a production-grade manner.
AI Dev TLV ‘24 - Talk (Hebrew) / The LLM Triangle: Engineering Principles for Robust AI Applications - a talk I gave at the AI Dev TLV ‘24 conference about the LLM Triangle principles and how to architect reliable AI apps in a production-grade manner.
LangTalk E35 (Hebrew) / LLM Applications Developer Guide - An end to end guide on how to get started and deploy to production your llm app
Making Software (Osim Tochna) E165 (Hebrew) / From a PoC to a product - the hidden challenges of deploying LLM applications
**AI In Production Conference ** - Talk ( English) / How to Build LLM-native Apps with The LLM Triangle Blueprint - a talk I gave at the AI In Production Conference about the LLM Triangle principles and how to architect reliable AI apps in a production-grade manner.
The MLOps Podcast /🫣 Is Data Science a dying job? ( English) - About Kubernetes, Large Language Models (LLMs), how to get them into production, and how data is becoming a more central piece of the ML landscape.
AI Infra Stories (English) - A podcast about AI infrastructure, where I hosted world-class AI infrastructure experts to discuss the latest trends and challenges in AI infrastructure.
And many more… 🚀

Open Source Contributions

I’ve been an active contributor to open source projects for over 15 years, regularly participating in various projects. My contributions range from creating new tools to maintaining major projects, or just sending PRs for bugs 🙃

Notable contributions:

Creator of Raptor.ml: An AI infrastructure project that helps to build and deploy AI to production - the gap between data science and engineering.
Author of openai-streaming: A Python library simplifying interactions with LLM Streaming API, including for tool using purposes.
Kubernetes Maintainer: Active contributor since 2016, focusing on cloud-native big data solutions and Kubernetes Native architectures.
pytest-evals: A pytest plugin for running and analyzing LLM evaluation tests.
LLM Playground: An interface to play/compare different LLM models directly from your browser.
Various Contributions: Ongoing involvement in multiple open source projects, consistently pushing for advancements in technology and knowledge sharing.

Get in Touch

Connect with me on LinkedIn, GitHub, or via Email, follow me on X for updates, or simply self-service schedule a meeting with me during my Office Hours.

Office Hours

I offer free Office Hours to assist engineers, entrepreneurs, and investors with AI and LLM strategies. Let’s connect to discuss your challenges and opportunities.