RAG on a Budget: Building a Knowledge Base with Ollama & ChromaDB
Learn how to build a local RAG system using Ollama and ChromaDB for free. Step-by-step guide with Docker Compose, Python code, chunking strategies, and real-world examples.
All the articles with the tag "ai".
Learn how to build a local RAG system using Ollama and ChromaDB for free. Step-by-step guide with Docker Compose, Python code, chunking strategies, and real-world examples.
Compare Stable Diffusion (A1111 & Forge), ComfyUI, and Fooocus for local AI image generation. GPU requirements, Docker setups, workflows, and beginner picks explained.
Compare Text Generation Web UI and KoboldCpp for local LLM inference. Covers setup, model formats, APIs, samplers, performance, and which tool fits your workflow best.
Confused by AI agent frameworks? Compare LangGraph, CrewAI, and AutoGen with real Python examples, a no-nonsense breakdown, and zero hype. Pick the right one.
Open WebUI vs LibreChat: two self-hosted ChatGPT alternatives compared. We cover setup, Ollama integration, multi-user support, RAG, plugins, and which one fits you.
Learn LLM fine-tuning with LoRA and QLoRA on a consumer GPU. Practical guide covering dataset prep, Hugging Face, Unsloth, VRAM needs, and common pitfalls.
Master Ollama with Modelfiles, GPU tuning, API usage, and performance tricks. Stop running 70B models on 8GB VRAM and wondering why everything is slow.
Run OpenAI Whisper or Faster-Whisper locally with Docker. Better privacy, zero API costs, and surprisingly good accuracy — even on a potato CPU.
Compare Continue.dev, Cody, and Tabby — three self-hosted AI code assistants that keep your code private, cost nothing per token, and work offline.
CUDA vs ROCm for AI on Linux: NVIDIA's easy path, AMD's emotional journey, and why CPU inference isn't dead yet. Real Docker setups included.
Flowise vs Langflow compared: self-hosted, Docker-ready visual LLM workflow builders. Build no-code AI pipelines, RAG chatbots, and more — without losing your mind.
Temperature, top-p, top-k, context length — LLM inference parameters explained so you stop guessing why the model gives weird output.