add nocmem: auto memory recall + ingest via NuoNuo hippocampal network
- nocmem Python service (mem/): FastAPI wrapper around NuoNuo's Hopfield-Hebbian memory, with /recall, /ingest, /store, /stats endpoints - NOC integration: auto recall after user message (injected as system msg), async ingest after LLM response (fire-and-forget) - Recall: cosine pre-filter (threshold 0.35) + Hopfield attention (β=32), top_k=3, KV-cache friendly (appended after user msg, not in system prompt) - Ingest: LLM extraction + paraphrase augmentation, heuristic fallback - Wired into main.rs, life.rs (agent done), http.rs (api chat) - Config: optional `nocmem.endpoint` in config.yaml - Includes benchmarks: LongMemEval (R@5=94.0%), efficiency, noise vs scale - Design doc: doc/nocmem.md
This commit is contained in:
19
mem/nocmem.service
Normal file
19
mem/nocmem.service
Normal file
@@ -0,0 +1,19 @@
|
||||
[Unit]
|
||||
Description=nocmem — NuoNuo memory service for NOC
|
||||
After=network.target
|
||||
|
||||
[Service]
|
||||
Type=simple
|
||||
WorkingDirectory=/data/src/noc/mem
|
||||
ExecStart=/home/fam/.local/bin/uv run uvicorn server:app --host 0.0.0.0 --port 9820
|
||||
Restart=on-failure
|
||||
RestartSec=5
|
||||
|
||||
Environment=NOCMEM_LLM_ENDPOINT=http://100.84.7.49:8000/v1
|
||||
Environment=NOCMEM_LLM_MODEL=QuantTrio/gemma-4-31B-it-AWQ
|
||||
Environment=NOCMEM_LLM_API_KEY=unused
|
||||
Environment=NOCMEM_DATA_DIR=/data/src/noc/mem/data
|
||||
Environment=NOCMEM_DEVICE=cuda
|
||||
|
||||
[Install]
|
||||
WantedBy=default.target
|
||||
Reference in New Issue
Block a user