Measuring whether agent messages mean what the receiver thinks. Ontology adherence + schema fidelity, scored where it matters.
AI & ML interests
Independent research lab. Agent infrastructure, evaluation, and the systems under LLMs. Correctness where the code executes. Receipts, not punditry.
Recent Activity
Organization Card
Skelf Research
Independent research lab. Agent infrastructure, evaluation, and the systems under LLMs. Correctness where the code executes. Receipts, not punditry.
Work: huggingface.co/dipankarsarkar ยท github.com/skelf-research
models 0
None public yet
datasets 0
None public yet