CodebaseMemory class reference

CodebaseMemory is a thin HTTP client around the ReasonBlocks findings store. It lets an agent persist observations about a codebase — bugs found, file summaries, architectural notes — and retrieve them on future runs via semantic search against a per-codebase vector collection. The client never raises on transport errors. Every method returns an empty list, None, 0, or False when the backend is unreachable, so a missing or slow API endpoint never breaks your agent loop.

Framework wrappers in reasonblocks.integrations.langchain_tools and reasonblocks.integrations.claude_tools expose these methods as LLM-callable tools. Use CodebaseMemory directly when you need programmatic control; use the integration wrappers when you want to hand the tool to an LLM.

Constructor

from reasonblocks import CodebaseMemory

memory = CodebaseMemory(
    codebase_id="myorg_myrepo",
    api_key="rb_live_...",
    base_url="https://rb-api.mycompany.com",
)

codebase_id

string

required

Identifies which codebase’s findings collection to read from and write to. Use a stable, unique string — typically the org and repo names joined with an underscore, such as "pydantic_pydantic" or "myorg_myrepo". Raises ValueError if empty.

api_key

string

required

Your ReasonBlocks API key, used as a Bearer token on every request.

base_url

string

The API origin to connect to. Set this when pointing to a self-hosted deployment. When omitted, the SDK connects to the ReasonBlocks hosted API. The trailing slash is stripped automatically.

timeout

number

default:"10.0"

HTTP request timeout in seconds. The underlying HTTP client keeps TCP connections warm across calls, so a modest timeout is safe for most deployments.

`store()`

Persists a single finding to the codebase’s findings collection. The content is embedded server-side and indexed for semantic recall.

finding_id = memory.store(
    "BaseModel.validate raises ValidationError when extra=forbid",
    file_path="pydantic/main.py",
    finding_type="behavior",
    metadata={"reviewer": "agent-1", "pr": 42},
)

content

string

required

The finding text to store. Truncated server-side to 8,000 characters. Returns None immediately if the content is empty or whitespace-only.

file_path

string

default:"\"\""

The file this finding is about. Truncated to 512 characters. Stored alongside the embedding so you can filter or invalidate by path later.

finding_type

string

default:"\"note\""

A label that categorizes the finding. Truncated to 64 characters. Common values include "behavior", "bug", "summary", "architecture", and "note".

metadata

object

Arbitrary key–value pairs stored with the finding. Useful for tagging findings with run IDs, reviewer names, PR numbers, or any other context you want to retrieve later.

return

string | None

The finding_id string assigned by the server, or None if the store call failed or the content was empty.

`recall()`

Performs a semantic search over the codebase’s stored findings and returns the most relevant results.

results = memory.recall(
    "validator error handling",
    top_k=5,
    score_threshold=0.3,
)

for r in results:
    print(r["file_path"], r["score"])
    print(r["content"])

query

string

required

The search query. Embedded server-side and compared against stored finding embeddings. Truncated to 2,000 characters. Returns an empty list immediately if the query is empty or whitespace-only.

top_k

integer

default:"5"

Maximum number of results to return. Results are ordered by descending relevance score.

score_threshold

number

default:"0.25"

Minimum cosine similarity score a finding must reach to be included in the results. Raise this to get only high-confidence matches; lower it to cast a wider net.

return

list[dict]

A list of finding dicts ordered by descending relevance score. Each dict contains:

Show Result dict fields

content

string

The stored finding text.

file_path

string

The file path the finding was stored under.

finding_type

string

The finding type label (e.g., "bug", "behavior", "note").

score

number

Cosine similarity score between 0 and 1. Higher means more relevant.

`list_all()`

Returns every finding currently stored for this codebase, without any filtering or ranking.

all_findings = memory.list_all()
print(f"{len(all_findings)} findings in cache")

return

list[dict]

A flat list of all finding dicts for this codebase. Each dict has the same shape as results from recall(). Returns an empty list on transport error.

`invalidate()`

Deletes all findings whose file_path matches any path in the given list. Use this to keep the cache consistent when files change.

deleted = memory.invalidate(["pydantic/main.py", "pydantic/validators.py"])
print(f"Removed {deleted} stale findings")

Pair invalidate() with ImportGraph.blast_radius() to automatically widen invalidation to all files that import the changed files — not just the changed files themselves.

file_paths

string[]

required

A list of file paths to invalidate. Returns 0 immediately when the list is empty.

return

integer

The number of findings deleted. Returns 0 on transport error.

`clear()`

Drops the entire findings collection for this codebase. This is irreversible.

success = memory.clear()

clear() deletes all findings for the codebase permanently. There is no undo. Use invalidate() for targeted removal.

return

boolean

True if the collection was cleared successfully, False on transport error or if the server did not confirm the operation.

`close()`

Releases the underlying httpx.Client and its connection pool. Call this when you are done with the CodebaseMemory instance and want to free resources immediately.

memory.close()

You do not need to call close() if the process is about to exit. The HTTP client will be cleaned up automatically. close() is most useful in long-running servers where you want to explicitly reclaim connections.

`store_many()`

Convenience method that calls store() for each item in a batch. Use this instead of manually looping when you want to persist multiple findings at once.

findings = [
    {
        "content": "auth.py validates JWT expiry before every request",
        "file_path": "app/auth.py",
        "finding_type": "behavior",
    },
    {
        "content": "models.py uses __slots__ for all dataclasses",
        "file_path": "app/models.py",
        "finding_type": "architecture",
    },
]

stored_count = memory.store_many(findings)
print(f"Stored {stored_count} findings")

findings

object[]

required

A list of finding dicts. Each dict must have at least a content key. The optional keys are file_path, finding_type, and metadata, matching the parameters of store(). Dicts missing content or with an empty content are silently skipped.

return

integer

The count of findings successfully stored. Items that were skipped or failed do not count toward this total.

`coverage_for()`

Returns the fraction of the given file paths that have at least one finding stored in memory. Useful for deciding whether the cache is rich enough to answer a query or whether the agent needs additional tooling to compensate.

paths = ["app/auth.py", "app/models.py", "app/views.py"]
ratio = memory.coverage_for(paths)

if ratio < 0.5:
    print("Cache coverage is low — consider running a full review first")

A soft path match is applied: a finding stored under pydantic/main.py counts as covering a query for main.py.

file_paths

string[]

required

The list of file paths to check coverage for. Returns 1.0 immediately when the list is empty.

return

number

A float between 0.0 and 1.0 representing the fraction of paths covered. Returns 0.0 when there are no findings in memory at all.

`format_recall()`

Runs recall() and formats the results as a human-readable string suitable for returning directly from an LLM tool. Avoids duplicating the formatting logic in every agent.

digest = memory.format_recall(
    "error handling in the auth module",
    top_k=3,
    score_threshold=0.3,
    max_len=400,
)
# Returns a formatted string like:
# Found 2 relevant prior findings:
#
# --- [behavior] app/auth.py  (relevance 0.81) ---
# auth.py raises AuthError with code 401 when ...

query

string

required

The search query passed to recall().

top_k

integer

default:"5"

Maximum number of results to return, passed through to recall().

score_threshold

number

default:"0.25"

Minimum similarity score, passed through to recall().

max_len

integer

default:"300"

Maximum characters to include from each finding’s content in the formatted output. Longer content is truncated.

return

string

A formatted multi-line string with a header and one section per result. Returns "(no relevant findings in cache yet)" when there are no matches.

SDK Classes

Types

Integrations

CodebaseMemory class reference

Constructor

`store()`

`recall()`

`list_all()`

`invalidate()`

`clear()`

`close()`

`store_many()`

`coverage_for()`

`format_recall()`

SDK Classes

Types

Integrations

Documentation Index

​Constructor

​store()

​recall()

​list_all()

​invalidate()

​clear()

​close()

​store_many()

​coverage_for()

​format_recall()

Constructor

`store()`

`recall()`

`list_all()`

`invalidate()`

`clear()`

`close()`

`store_many()`

`coverage_for()`

`format_recall()`