Back to Blogs

My Local LLM Journey

AIJun 4, 20265 min read

Like most people exploring AI today, I started with cloud-based tools.

They were convenient, powerful and easy to access. But over time, I became increasingly interested in a simple question.

What if I could run capable AI models entirely on my own machine?

Initially, the motivation was privacy. I wanted to experiment with personal notes, analytics data and workflows without sending information to external servers. What started as curiosity eventually led me down the path of local LLMs, Ollama and Mistral 7B.

Along the way, I discovered that running AI locally is not just about privacy. It is about ownership, flexibility and building workflows that fit your needs rather than someone else's platform.

Why I Started Exploring Local Models

Most AI tools today operate as services.

You send data to a remote model, receive a response and move on. For many use cases, that approach works perfectly well.

However, I found myself wanting more control over the entire process.

I wanted to experiment with personal datasets, automate analysis workflows and understand what was happening behind the scenes. Running models locally meant that I could work without worrying about rate limits, API costs or sensitive information leaving my machine.

Owning the model means owning the data it sees.

That idea became the foundation for many of my experiments.

Building an AI Analytics Assistant

One of the most interesting projects I built involved automating Google Analytics analysis.

Like many product and growth teams, I found myself regularly opening dashboards, checking metrics and trying to identify what had changed. The data was available, but extracting useful insights still required manual effort.

I wanted to see whether a local AI model could help automate this process.

The first step was building a pipeline that connected directly to Google Analytics 4 through its API. Rather than simply pulling raw metrics, I focused on identifying pages that were losing users, underperforming or showing growth opportunities.

Numbers alone rarely tell a useful story.

The challenge was converting analytics data into something an AI model could reason about.

Converting Data Into Context

After pulling data from GA4, I generated Python scripts that cleaned, structured and processed the information.

The output was transformed into a simplified JSON format that contained only the most relevant metrics and trends.

This turned out to be one of the most important lessons from the project.

Many people assume model quality is the primary factor behind useful AI outputs. In reality, the quality of the input often matters more.

A smaller model provided with clean, structured context frequently performs better than a larger model forced to interpret messy data.

By reducing noise and focusing on the most meaningful signals, I was able to create prompts that the model could process effectively.

Running Mistral 7B Locally With Ollama

For the AI layer, I used Ollama running Mistral 7B on my Linux machine.

I chose Mistral because it offered a good balance between performance and hardware requirements. It was capable enough to generate useful observations while remaining practical to run on consumer hardware.

The workflow looked something like this:

  • Pull analytics data from GA4.
  • Process and clean the data using Python.
  • Convert important metrics into structured JSON.
  • Pass the information to Mistral 7B through Ollama.
  • Generate plain English insights and recommendations.

Instead of reviewing dashboards manually, I received summaries explaining what was happening, which pages were declining and where opportunities existed.

The experience felt surprisingly useful for a locally hosted model.

The Trade-Offs Are Real

Running local models is rewarding, but it comes with limitations.

A 7 billion parameter model is not comparable to frontier models from OpenAI, Anthropic or Google. There are situations where the reasoning quality simply cannot match larger systems.

Hallucinations still occur.

Complex analysis can sometimes produce inconsistent recommendations.

Long context windows remain challenging depending on the model and hardware available.

The key lesson is understanding where local models provide value.

For lightweight analysis, summarisation, classification and personal workflows, models such as Mistral 7B perform surprisingly well.

For deep reasoning tasks, larger cloud-based models still have a significant advantage.

What Surprised Me Most

The biggest surprise was how much impact retrieval and structured context had on output quality.

Initially, I assumed better models would automatically produce better insights.

Instead, I discovered that improving the quality of the data pipeline often produced larger gains than upgrading the model itself.

A well-structured dataset paired with a modest model frequently outperformed poorly prepared data sent to a larger model.

This shifted my perspective on AI systems.

The model is only one component. The surrounding workflow matters just as much.

Automating Everything

The final step was eliminating manual intervention altogether.

I scheduled the entire workflow using a Linux cron job.

At predefined intervals, the pipeline would:

  • Pull fresh analytics data.
  • Process and structure the results.
  • Generate AI-powered insights.
  • Send a summary email automatically.

The result was a lightweight analytics assistant running entirely on my own machine.

Instead of checking dashboards every day, I received actionable summaries that highlighted areas worth investigating.

It was not perfect, but it significantly reduced the effort required to monitor product performance.

Final Thoughts

My journey into local LLMs started with privacy concerns, but it quickly became an exploration of ownership and automation.

Running models locally taught me that successful AI systems are rarely just about the model itself. Data quality, retrieval, workflow design and automation often contribute more value than raw model size.

Local models are not going to replace frontier models for every use case. However, for personal knowledge systems, analytics workflows and privacy-focused automation, they are already incredibly capable.

As hardware improves and open-source models continue to evolve, I believe local AI will become an increasingly important part of how individuals and teams build intelligent products.