mrosati/Chromy

Public Access

Fork 0

T

mrosati bd08c2bda3 add readme

2026-04-21 20:13:28 +02:00

handlers

complete refactor

2026-04-21 17:42:37 +02:00

.gitignore

ignore DS_Source

2026-04-21 19:50:48 +02:00

.python-version

init

2026-04-21 13:32:09 +02:00

chroma_functions.py

add metadata (file_name)

2026-04-21 18:24:49 +02:00

chunk_functions.py

add documents

2026-04-21 15:28:20 +02:00

cli_app.py

complete refactor

2026-04-21 17:42:37 +02:00

cli_parser.py

code format. add nuitka.

2026-04-21 17:55:11 +02:00

embed.py

add chunk and embed

2026-04-21 15:06:04 +02:00

main.py

complete refactor

2026-04-21 17:42:37 +02:00

pyproject.toml

code format. add nuitka.

2026-04-21 17:55:11 +02:00

README.md

add readme

2026-04-21 20:13:28 +02:00

utilities.py

add metadata (file_name)

2026-04-21 18:24:49 +02:00

uv.lock

code format. add nuitka.

2026-04-21 17:55:11 +02:00

README.md

chroma

A small command-line utility for working with a local Chroma database. It lets you create collections, ingest file contents as chunked embeddings, and run similarity queries against stored documents.

What it does

manages local Chroma collections
chunks files with semchunk
generates embeddings with Chroma's default embedding function
stores chunk text plus source file metadata
queries collections and prints readable results

Requirements

Python 3.12+
a local environment able to install the project dependencies in pyproject.toml

Installation

Using uv:

uv sync

Or with pip:

python -m venv .venv
source .venv/bin/activate
pip install -e .

Running the CLI

The project entrypoint is main.py.

uv run python main.py --help

Commands

list-collections | lc
create-collection | cc <collection>
delete-collection | dc <collection>
count | co <collection>
add-data | ad <collection> <file>
query | q <collection> <query_text>

Examples

Create a collection:

uv run python main.py create-collection notes

Add a file:

uv run python main.py add-data notes ./docs/example.txt

Count stored records:

uv run python main.py count notes

Search the collection:

uv run python main.py query notes "How do I configure this project?"

List collections:

uv run python main.py list-collections

Delete a collection:

uv run python main.py delete-collection notes

How ingestion works

When you run add-data, the file is:

read from disk
split into chunks
embedded
inserted into the target collection with the original file path stored as metadata

Query results include the stored document chunk, its id, distance, and file name when available.

Notes

collections are stored in a local persistent Chroma database
add-data requires the target collection to already exist
the CLI prints friendly messages for common errors such as missing collections or missing files