Practical guides for working with Hugging Face Transformers and running large language models locally.

Models & storage

Run LLMs locally

Build models from scratch

Continue with Generative AI, AI Agents, and the daily AI Rankings of top AI repositories, papers, and packages.

FAQ

How do I stop Hugging Face from filling up my default drive? Point the cache elsewhere with cache_dir / HF_HOME — see downloading models to a custom cache folder.

How do I run a Hugging Face model offline / locally? Convert it to GGUF in Google Colab, then run it locally, or use a local Gradio app for FLAN-T5 / GPT.