AI & ML interests
Exploring foundation models for DNA and beyond 🧬
Recent Activity
HuggingFaceBio/Carbon-3B-GGUF
HuggingFaceBio/Carbon-500M-GGUF
Factorized Nucleotide Supervision
Explore DNA tokenization and factorization
Conditioning the base distributions: fix a base, the rest adapt
Visualize DNA tokenization and conditional predictions
Next-token prediction on a real sequence
Predict the next DNA base in a given sequence
Per-base, BPE, and 6-mer on the same DNA sequence
Visualize DNA tokenization by base, BPE, and 6‑mer
Prefix ambiguity, in a vocabulary that holds five plausible next tokens
Explore DNA tokenization and model scoring
Conditioning the base distributions: fix a base, the rest adapt
Visualize DNA tokenization and conditional predictions
Next-token prediction on a real sequence
Predict the next DNA base in a given sequence
Per-base, BPE, and 6-mer on the same DNA sequence
Visualize DNA tokenization by base, BPE, and 6‑mer
Factorized Nucleotide Supervision
Explore DNA tokenization and factorization
Prefix ambiguity, in a vocabulary that holds five plausible next tokens
Explore DNA tokenization and model scoring
Factorized Nucleotide Supervision
Explore DNA tokenization and factorization
Conditioning the base distributions: fix a base, the rest adapt
Visualize DNA tokenization and conditional predictions
Next-token prediction on a real sequence
Predict the next DNA base in a given sequence
Per-base, BPE, and 6-mer on the same DNA sequence
Visualize DNA tokenization by base, BPE, and 6‑mer
Prefix ambiguity, in a vocabulary that holds five plausible next tokens
Explore DNA tokenization and model scoring