conversation_id stringclasses 10
values | dataset stringclasses 1
value | first_query stringclasses 10
values | category stringclasses 3
values | domain stringclasses 3
values | lang stringclasses 1
value | is_haystack bool 1
class | mode stringclasses 1
value | length int64 8 12 | turns stringclasses 10
values | fact_ledger stringclasses 9
values | cross_turn_checks stringclasses 10
values | holistic_score stringclasses 1
value |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
0af360b2-0ad8-4d10-8a63-01be0588a597 | seeds | I am depressed and anxious a lot of the time. I am tired of it and don't know what to do. What do you recommend for me to do to be less depressed and anxious? | general | general | en | false | reasoning | 8 | [{"turn_index": 0, "segments": [{"segment_type": "query", "text": "I am depressed and anxious a lot of the time. I am tired of it and don't know what to do. What do you recommend for me to do to be less depressed and anxious?"}, {"segment_type": "reasoning", "text": "The user is expressing feelings of depression and an... | [] | {"delayed_recall": [], "standing": [{"constraint_turn": 2, "checked_turn": 3, "type": "json_valid", "passed": true, "detail": "valid JSON"}, {"constraint_turn": 2, "checked_turn": 4, "type": "json_valid", "passed": true, "detail": "valid JSON"}, {"constraint_turn": 2, "checked_turn": 5, "type": "json_valid", "passed": ... | null |
d8d75a10-6577-4b27-80b5-46b02297080a | seeds | Organize these words in alphabetic order and also from shortest to longest words.
-Casa
-House
-Weapon
-Computer
-Dimension
-Fire | general | general | en | false | reasoning | 8 | [{"turn_index": 0, "segments": [{"segment_type": "query", "text": "Organize these words in alphabetic order and also from shortest to longest words.\n\n-Casa\n\n-House\n\n-Weapon\n\n-Computer\n\n-Dimension\n\n-Fire"}, {"segment_type": "reasoning", "text": "The user wants me to organize a list of words in two ways:\n1. ... | [{"fact_id": "f1", "value": "Tier-3", "planted_turn": 2, "fact_type": "subscription_tier", "value_history": [{"turn": 2, "value": "Sapphire"}, {"turn": 2, "value": "Tier-3"}]}] | {"delayed_recall": [], "standing": [{"constraint_turn": 6, "checked_turn": 7, "type": "json_valid", "passed": true, "detail": "valid JSON"}], "update_overwrite": [{"turn_index": 2, "fact_id": "f1", "passed": true, "detail": "current value present, no stale values"}]} | null |
cffd1dea-8b17-4403-8a38-82307e91746f | seeds | What are the key differences between classical and operant conditioning in psychology? | general | general | en | false | reasoning | 11 | [{"turn_index": 0, "segments": [{"segment_type": "query", "text": "What are the key differences between classical and operant conditioning in psychology?"}, {"segment_type": "reasoning", "text": "The user is asking for the key differences between classical and operant conditioning in psychology. This is a straightforwa... | [{"fact_id": "f1", "value": "Sergeant", "planted_turn": 8, "fact_type": "pet_name", "value_history": [{"turn": 8, "value": "Sergeant"}]}] | {"delayed_recall": [], "standing": [{"constraint_turn": 5, "checked_turn": 6, "type": "markdown_format", "passed": true, "detail": "markdown formatting present"}, {"constraint_turn": 5, "checked_turn": 7, "type": "markdown_format", "passed": true, "detail": "markdown formatting present"}, {"constraint_turn": 5, "checke... | null |
18df4597-1077-4675-9cea-a0069d4e3647 | seeds | What's up doc? | general | general | en | false | reasoning | 10 | [{"turn_index": 0, "segments": [{"segment_type": "query", "text": "What's up doc?"}, {"segment_type": "reasoning", "text": "The user is greeting me with \"What's up doc?\" which is a casual, friendly greeting (famous Bugs Bunny catchphrase). This is the start of our conversation, so I don't have any prior context to re... | [{"fact_id": "f1", "value": "Salta", "planted_turn": 9, "fact_type": "office_city", "value_history": [{"turn": 9, "value": "Salta"}]}] | {"delayed_recall": [], "standing": [], "update_overwrite": []} | null |
4f87bf6e-65be-48b5-ae9a-1d320384848d | seeds | what are the risks of using AI to create code for a professional programmer | coding | coding | en | false | reasoning | 8 | "[{\"turn_index\": 0, \"segments\": [{\"segment_type\": \"query\", \"text\": \"what are the risks of(...TRUNCATED) | "[{\"fact_id\": \"f1\", \"value\": \"crimson\", \"planted_turn\": 3, \"fact_type\": \"favorite_color(...TRUNCATED) | "{\"delayed_recall\": [], \"standing\": [{\"constraint_turn\": 1, \"checked_turn\": 2, \"type\": \"l(...TRUNCATED) | null |
b3739a61-1ed2-4f85-8006-465ee002aa42 | seeds | "What are some unique things about the 37th largest city in Japan? Does it have anything in common w(...TRUNCATED) | general | general | en | false | reasoning | 10 | "[{\"turn_index\": 0, \"segments\": [{\"segment_type\": \"query\", \"text\": \"What are some unique (...TRUNCATED) | "[{\"fact_id\": \"f1\", \"value\": \"77364\", \"planted_turn\": 3, \"fact_type\": \"badge_number\", (...TRUNCATED) | "{\"delayed_recall\": [{\"turn_index\": 5, \"fact_id\": \"f1\", \"passed\": true, \"detail\": \"f1 l(...TRUNCATED) | null |
b7c27e81-42f0-4594-a438-4d95864bcb20 | seeds | "explain in first principles the business model behind politicians such as senators, their incentive(...TRUNCATED) | general | general | en | false | reasoning | 12 | "[{\"turn_index\": 0, \"segments\": [{\"segment_type\": \"query\", \"text\": \"explain in first prin(...TRUNCATED) | "[{\"fact_id\": \"f1\", \"value\": \"Nimbus\", \"planted_turn\": 11, \"fact_type\": \"pet_name\", \"(...TRUNCATED) | "{\"delayed_recall\": [], \"standing\": [{\"constraint_turn\": 2, \"checked_turn\": 3, \"type\": \"j(...TRUNCATED) | null |
e9c91df8-79b8-456e-9393-4a33c1e3ff75 | seeds | On average, how tall is a robot vacuum cleaner? | general | general | en | false | reasoning | 8 | "[{\"turn_index\": 0, \"segments\": [{\"segment_type\": \"query\", \"text\": \"On average, how tall (...TRUNCATED) | [] | "{\"delayed_recall\": [], \"standing\": [{\"constraint_turn\": 6, \"checked_turn\": 7, \"type\": \"m(...TRUNCATED) | null |
6054b496-3237-466d-8eb6-a1965fc31a30 | seeds | can you tell me who mark seifter is? especially focusing on his game design and creation history | humanities | humanities | en | false | reasoning | 11 | "[{\"turn_index\": 0, \"segments\": [{\"segment_type\": \"query\", \"text\": \"can you tell me who m(...TRUNCATED) | "[{\"fact_id\": \"f1\", \"value\": \"Salta\", \"planted_turn\": 4, \"fact_type\": \"office_city\", \(...TRUNCATED) | "{\"delayed_recall\": [{\"turn_index\": 10, \"fact_id\": \"f1\", \"passed\": true, \"detail\": \"f1 (...TRUNCATED) | null |
205e6547-bdfb-4bda-a543-9a7908f1dbc3 | seeds | What kind of safety measures does your programming include? | general | general | en | false | reasoning | 9 | "[{\"turn_index\": 0, \"segments\": [{\"segment_type\": \"query\", \"text\": \"What kind of safety m(...TRUNCATED) | "[{\"fact_id\": \"f1\", \"value\": \"8929\", \"planted_turn\": 3, \"fact_type\": \"lucky_number\", \(...TRUNCATED) | "{\"delayed_recall\": [], \"standing\": [{\"constraint_turn\": 2, \"checked_turn\": 3, \"type\": \"l(...TRUNCATED) | null |
README.md exists but content is empty.
- Downloads last month
- 11