<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:content="http://purl.org/rss/1.0/modules/content/"><channel><title>Jatin Bansal — Backend, Distributed Systems &amp; AI Engineering on Jatin Bansal</title><link>https://jatinbansal.com/</link><description>Recent content in Jatin Bansal — Backend, Distributed Systems &amp; AI Engineering on Jatin Bansal</description><generator>Hugo</generator><language>en-us</language><lastBuildDate>Sat, 16 May 2026 00:00:00 +0000</lastBuildDate><atom:link href="https://jatinbansal.com/index.xml" rel="self" type="application/rss+xml"/><item><title>Query Transformations: Rewriting, HyDE, and Multi-Query</title><link>https://jatinbansal.com/ai-engineering/query-transformations/</link><pubDate>Sat, 16 May 2026 00:00:00 +0000</pubDate><guid>https://jatinbansal.com/ai-engineering/query-transformations/</guid><description>The query-side preprocessing layer for RAG: how rewriting, HyDE, multi-query, decomposition, and step-back prompting trade cost for recall.</description></item><item><title>Reranking: Cross-Encoders and Cascades</title><link>https://jatinbansal.com/ai-engineering/reranking/</link><pubDate>Thu, 14 May 2026 00:00:00 +0000</pubDate><guid>https://jatinbansal.com/ai-engineering/reranking/</guid><description>Why cross-encoders dominate the precision stage of retrieval, when a reranker pays off, and how to compose cascades that respect the latency budget.</description></item><item><title>Hybrid Search: BM25 Meets Dense Vectors</title><link>https://jatinbansal.com/ai-engineering/hybrid-search/</link><pubDate>Wed, 13 May 2026 00:00:00 +0000</pubDate><guid>https://jatinbansal.com/ai-engineering/hybrid-search/</guid><description>Why dense retrieval misses rare terms and exact matches, how BM25 and embeddings fuse via RRF, and the hybrid patterns that ship in production.</description></item><item><title>Chunking Strategies for Retrieval</title><link>https://jatinbansal.com/ai-engineering/chunking-strategies/</link><pubDate>Tue, 12 May 2026 00:00:00 +0000</pubDate><guid>https://jatinbansal.com/ai-engineering/chunking-strategies/</guid><description>Why chunk size is RAG&amp;#39;s most undertuned variable, how recursive, semantic, and structural chunking differ, and when parent-document retrieval wins.</description></item><item><title>LLM Inference: Tokens, Context, and Sampling</title><link>https://jatinbansal.com/ai-engineering/llm-inference-fundamentals/</link><pubDate>Mon, 11 May 2026 00:00:00 +0000</pubDate><guid>https://jatinbansal.com/ai-engineering/llm-inference-fundamentals/</guid><description>How LLMs process text: BPE tokenization, the context window as working memory, KV caching, and sampling parameters that shape output variance.</description></item><item><title>Text Embeddings: Turning Meaning into Geometry</title><link>https://jatinbansal.com/ai-engineering/text-embeddings/</link><pubDate>Mon, 11 May 2026 00:00:00 +0000</pubDate><guid>https://jatinbansal.com/ai-engineering/text-embeddings/</guid><description>How embedding models encode text as dense vectors, why cosine similarity captures meaning, and how to build semantic search in Python and TypeScript.</description></item><item><title>Vector Databases &amp; ANN Indexes</title><link>https://jatinbansal.com/ai-engineering/vector-databases-ann/</link><pubDate>Mon, 11 May 2026 00:00:00 +0000</pubDate><guid>https://jatinbansal.com/ai-engineering/vector-databases-ann/</guid><description>How HNSW, IVF, and ScaNN trade recall for speed, why exact KNN doesn&amp;#39;t scale, and how to pick between pgvector, Qdrant, and Pinecone in production.</description></item><item><title>Writing Event Loops with Java Virtual Threads</title><link>https://jatinbansal.com/blog/event-loops-with-java-virtual-threads/</link><pubDate>Fri, 01 May 2026 00:00:00 +0000</pubDate><guid>https://jatinbansal.com/blog/event-loops-with-java-virtual-threads/</guid><description>A practical guide to writing small event loops in Java 21 and Java 25 using virtual threads, blocking queues, direct control flow, and graceful shutdown.</description></item><item><title>Context vs Prompt Engineering: The Evolution from Instructions to Intelligence</title><link>https://jatinbansal.com/blog/context-vs-prompt-engineering/</link><pubDate>Sun, 31 Aug 2025 00:00:00 +0000</pubDate><guid>https://jatinbansal.com/blog/context-vs-prompt-engineering/</guid><description>Exploring the shift from prompt engineering to context engineering in AI systems, understanding context rot, and why managing context is becoming more critical than crafting prompts.</description></item><item><title>Claude Code Commands Reference</title><link>https://jatinbansal.com/notes/claude-code-commands/</link><pubDate>Thu, 10 Jul 2025 00:00:00 +0000</pubDate><guid>https://jatinbansal.com/notes/claude-code-commands/</guid><description>Comprehensive guide to Claude Code CLI commands including Docker MCP Gateway, authentication, scripting, advanced server management, and troubleshooting for efficient AI-powered development workflows</description></item><item><title>Deep Work</title><link>https://jatinbansal.com/notes/deep-work/</link><pubDate>Sun, 06 Jul 2025 00:00:00 +0000</pubDate><guid>https://jatinbansal.com/notes/deep-work/</guid><description>Deep work is a state of peak, distraction-free concentration that enables you to learn difficult things and produce high-quality work quickly.</description></item><item><title>✨ Neural Net, LLM, AI Learning Resources</title><link>https://jatinbansal.com/notes/ai-llm-reading/</link><pubDate>Sat, 05 Jul 2025 00:00:00 +0000</pubDate><guid>https://jatinbansal.com/notes/ai-llm-reading/</guid><description>A curated collection of resources that I find useful for learning about neural networks, LLMs, and AI development.</description></item><item><title>StampedLock: How to Use Locks with Near Lock-Free Reads in Java</title><link>https://jatinbansal.com/blog/stamped-lock/</link><pubDate>Sat, 05 Jul 2025 00:00:00 +0000</pubDate><guid>https://jatinbansal.com/blog/stamped-lock/</guid><description>Learn how Java&amp;rsquo;s StampedLock enables near lock-free reads with optimistic locking, why it&amp;rsquo;s useful for virtual threads and read-heavy workloads, and how to use it safely.</description></item><item><title>Scaling PostgreSQL Databases with Spring Boot: A Journey into Application-Level Sharding</title><link>https://jatinbansal.com/blog/scaling-postgres/</link><pubDate>Tue, 05 Sep 2023 00:00:00 +0000</pubDate><guid>https://jatinbansal.com/blog/scaling-postgres/</guid><description>Learn how we scaled PostgreSQL to handle millions of inserts per hour using application-level sharding with Spring Boot, combining table partitioning and host-level sharding for robust performance.</description></item></channel></rss>