Document RAG API

Description

A FastAPI-based application designed to ingest, process, and query documents stored in markdown format. It leverages a Retrieval-Augmented Generation (RAG) approach to provide relevant document sections based on user queries.

Usage

Querying the API

Ingest Documents from Files:

curl -X POST "http://localhost:8000/ingest/files"

Ingest Documents Directly:

curl -X POST "http://localhost:8000/ingest" \
  -H "Content-Type: application/json" \
  -d '{
    "documents": [
      {
        "content": "# Document Title\n\n## Section 1\nConfidence: 5/5 Source: Primary\n\nThis is the content...",
        "filename": "document1.md",
        "source": "api_upload"
      }
    ]
  }'

Query Documents:

curl -X POST "http://localhost:8000/query" \
  -H "Content-Type: application/json" \
  -d '{
    "query": "your query here",
    "n_results": 5,
    "min_confidence": 0.8
  }'

API Documentation

Access the interactive API documentation at http://localhost:8000/docs after starting the server.

API Endpoints

POST /ingest
- Ingest one or more documents directly via API
- Accepts JSON payload with document content
- Documents must include content and filename
- Optional source field for tracking origin
POST /ingest/files
- Ingest all markdown files from the data directory
- No request body needed
- Returns count of ingested documents
POST /query
- Search through ingested documents
- Supports filtering by confidence score
- Returns most relevant sections
GET /health
- Basic health check endpoint
- Returns server status
GET /stats
- Returns metadata about vector store contents
- Includes document counts, confidence levels, and section statistics
- Useful for monitoring and debugging

Installation

Prerequisites

Python 3.8+
Virtual environment (recommended)

Setup

Clone the repository:

git clone <repository-url>
cd <repository-directory>

Create and activate a virtual environment:

python -m venv venv
source venv/bin/activate  # On Windows use `venv\Scripts\activate`

Install the required packages:
```
pip install -r requirements.txt
```

Start the server:

# Normal start
python src/main.py

# Start with initial data ingestion
python src/main.py --init

Offline Support

The system includes offline capability using a local LLM:

Download the model (4GB):
```
python src/download_model.py
```
The system will automatically switch between:
- Online mode: Uses cloud LLM for best quality
- Offline mode: Uses local LLM for reliability
Offline features:
- Vector search works without internet
- Local text generation available
- Automatic mode switching
- All data stored locally

Development

Setting Up

Ensure you have a /data directory in the project root with markdown files to ingest.
Use watchdog to automatically restart the server on code changes:
```
python watchdog_script.py
```

Debugging

Check the console output for debug information about the data directory and file loading.
Ensure the data directory contains markdown files and is correctly referenced in the code.
Monitor the server logs for ingestion and query details
Use the /health endpoint to verify server status

Contributing

Fork the repository.
Create a new branch for your feature or bugfix.
Submit a pull request with a detailed description of your changes.

Query Endpoint (/query)

Success Response:

{
  "results": [
    "Olvana's cyber capabilities are sophisticated, known for subversion operations. They target sectors including healthcare, financial services, defense industries, energy, and government systems. Known malicious activities include healthcare system targeting, financial sector attacks, and critical infrastructure targeting.",
    "The cyber warfare program is active and known for limited attacks targeting adversarial nations and institutions. Effectiveness has been demonstrated through multiple operations."
  ],
  "metadata": [
    {
      "source": "data/olvana.md",
      "filename": "olvana.md",
      "section_title": "Cyber Capabilities",
      "confidence": "5/5",
      "confidence_score": 1.0,
      "source_document": "Country Overview Document"
    },
    {
      "source": "data/north_torbia.md",
      "filename": "north_torbia.md",
      "section_title": "Cyber Capabilities",
      "confidence": "5/5",
      "confidence_score": 1.0,
      "source_document": "Country Overview Document"
    }
  ]
}

Error Response (Invalid Confidence Score):

{
  "detail": "min_confidence must be between 0 and 1"
}

Error Response (Empty Vector Store):

{
  "detail": "No documents found in vector store. Please ingest documents first."
}

Ingest Endpoint (/ingest)

Success Response:

{
  "status": "success",
  "message": "Successfully ingested 3 documents"
}

Error Response (Invalid Document Format):

{
  "detail": [
    {
      "loc": ["body", "documents", 0, "content"],
      "msg": "field required",
      "type": "value_error.missing"
    }
  ]
}

Error Response (Processing Error):

{
  "detail": "Failed to process document: Invalid markdown structure"
}

Ingest Files Endpoint (/ingest/files)

Success Response:

{
  "status": "success",
  "message": "Successfully ingested 11 documents from files"
}

Warning Response (No Files):

{
  "status": "warning",
  "message": "No documents found in data directory"
}

Error Response (Directory Access):

{
  "detail": "Failed to access data directory: Permission denied"
}

Health Check Endpoint (/health)

Success Response:

{
  "status": "healthy"
}

Stats Endpoint (/stats)

Success Response:

{
  "status": "success",
  "stats": {
    "total_sections": 157,
    "unique_documents": 11,
    "confidence_levels": {
      "5/5": 98,
      "4/5": 45,
      "3/5": 14
    },
    "sections_by_document": {
      "olvana.md": 25,
      "north_torbia.md": 22,
      "sungzon.md": 18,
      "operation_steel_sentinel.md": 15
    },
    "sections_by_title": {
      "Military Capabilities": 11,
      "Geographic Data": 11,
      "Cyber Capabilities": 8,
      "Economic Data": 7
    },
    "average_confidence": 0.89,
    "sources": {
      "Country Overview Document": 89,
      "Military Capability Analysis": 45,
      "Analysis": 23
    }
  }
}

Empty Store Response:

{
  "status": "empty",
  "message": "No documents in vector store",
  "count": 0
}

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
data		data
extra_data		extra_data
models		models
src		src
structured		structured
templates		templates
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
project_structure		project_structure
requirements.txt		requirements.txt
watchdog_script.py		watchdog_script.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Document RAG API

Description

Usage

Querying the API

API Documentation

API Endpoints

Installation

Prerequisites

Setup

Offline Support

Development

Setting Up

Debugging

Contributing

Query Endpoint (/query)

Ingest Endpoint (/ingest)

Ingest Files Endpoint (/ingest/files)

Health Check Endpoint (/health)

Stats Endpoint (/stats)

About

Releases

Packages

Contributors 2

Languages

programbo/battlenerd

Folders and files

Latest commit

History

Repository files navigation

Document RAG API

Description

Usage

Querying the API

API Documentation

API Endpoints

Installation

Prerequisites

Setup

Offline Support

Development

Setting Up

Debugging

Contributing

Query Endpoint (/query)

Ingest Endpoint (/ingest)

Ingest Files Endpoint (/ingest/files)

Health Check Endpoint (/health)

Stats Endpoint (/stats)

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages