bailey kuehl
baileyk
AI & ML interests
None yet
Recent Activity
updated a model 28 days ago
allenai/Olmo-Hybrid-7B new activity 29 days ago
allenai/Olmo-Hybrid-7B:update tokenizer and transformers version new activity 29 days ago
allenai/Olmo-Hybrid-7B:update tokenizer and transformers versionOrganizations
update tokenizer and transformers version
#3 opened 29 days ago
by
baileyk
Clarification regarding Mix <> Pool of this dataset (size seems off)
3
#1 opened 2 months ago
by
ivoschaper
Update README.md
2
#6 opened 2 months ago
by
ivoschaper
`text` vs `original_text`
1
#1 opened 2 months ago
by
nastyafilippova
Inappropriate Contents in the dataset
1
#5 opened 3 months ago
by
xingpeng001
understanding file naming
1
#3 opened 4 months ago
by
nastyafilippova
Installation Video and Testing - Step by Step
❤️ 2
1
#1 opened 4 months ago
by
fahdmirzac
There are a rare few incomplete files in the recently-uploaded webcrawl data
4
#4 opened 4 months ago
by
lukemerrick
cranemath/shard_00000138-withid.jsonl.zst file corrupted
5
#1 opened 4 months ago
by
yangwang92
CraneCode has corrupted shards
3
#4 opened 5 months ago
by
bcui19
Almost all examples in the dataset viewer preview are explicit
3
#4 opened 4 months ago
by
zleizzo
Incomplete common crawl data?
3
#3 opened 4 months ago
by
lukemerrick
Inquiry Downloading the PDF Dataset
2
#1 opened 7 months ago
by
KikiQi
load_dataset appears broken
9
#2 opened 5 months ago
by
karpathy
Cranemath might have some corrupted shards
2
#2 opened 6 months ago
by
bcui19
Damaged Files
2
#1 opened 6 months ago
by
osama24sy
Full Dataset
17
#3 opened 6 months ago
by
JamshidJDMY
AutoModelForSequenceClassification
1
#5 opened 6 months ago
by
L-G
[issue] incorrect data card, missing weborganizer labels
❤️ 1
3
#3 opened 6 months ago
by
glennmatlin
Is the dataset complete?
3
#2 opened 7 months ago
by
koajoel