🧠 Building a Lightweight LLM-Powered Q&A API Using Ollama and Node.js | by Raviya Technical | AdonisJS | Jul, 2025

2 months ago Eze-Admin

Modern AI applications often require integrating local or hosted Large Language Models (LLMs) into web backends. In this article, we walk through the structure of a simple yet efficient Node.js API that leverages Ollama for running LLMs like Mistral locally. This setup enables asking questions based on a predefined text file using chunking and keyword matching.

project-root/
├── src/
│   ├── config/
│   │   └── llms.js          # LLM configuration
│   ├── controllers/
│   │   └── api/
│   │       └── llmsController.js  # Core logic for question answering
│   ├── services/
│   │   └── llms/
│   │       └── ollamaService.js   # Communication with Ollama API
│   ├── utils/
│   │   └── llms.js           # Text chunking and relevance logic
│   │   └── utils.js          # Utility functions (loadTXT, apiResponse)
│   ├── routes/
│   │   └── api/
│   │       └── llms.routes.js  # API route definition
└── uploads/
└── docs/
└── example.txt       # The source document for Q&A

This config file defines the model and chunking parameters used for splitting documents into manageable pieces.

module.exports = {
OLLAMA_URL: "http://localhost:11434",
MODEL_NAME: process.env.MODEL_NAME || "mistral",
// CHUNK_SIZE: 1000,
// OVERLAP: 100,
CHUNK_SIZE: 300,
OVERLAP: 50,
};