GUIDE

Hugging Face & Local LLMs

Practical tutorials for Transformers, local models, embeddings, GGUF conversion, and open-source LLM workflows.

Ruslan Magana Vsevolodovna
Ruslan Magana Vsevolodovna
Machine Learning Engineer · Data Scientist · Physicist
Genoa, Italy

Latest tutorials

FAQ

What is Hugging Face used for?

Hugging Face provides models, datasets, libraries, and tools for building and deploying machine learning and LLM workflows.

Can I run Hugging Face models locally?

Yes. Many models can be downloaded and run locally with Transformers, llama.cpp, Ollama, Gradio, or FastAPI depending on the format and hardware.

What is GGUF and why is it useful?

GGUF is a model format commonly used with llama.cpp and local inference tools. It helps run quantized models efficiently on local machines.