Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Open to Collab
18
70
48
Mohammed Hamdy
mmhamdy
Follow
Mi6paulino's profile picture
medmac01's profile picture
ghostai1's profile picture
74 followers
Β·
265 following
https://surfingmanifolds.substack.com/
mhamdy_res
mmhamdy
mmhamdy
mmhamdy.bsky.social
AI & ML interests
AI4Sci | NLP | Reinforcement Learning
Recent Activity
replied
to
their
post
6 days ago
Things rarely go as we expect! In 2017, Google released the Transformer architecture. While it was clear the model was promising, absolutely no one (including its authors) anticipated the pervasive global revolution it would create! The authors actually viewed the Transformer as just a stepping stone for a much more ambitious project: The MultiModel. Their ultimate goal was to build a single deep learning architecture capable of jointly learning massive, diverse tasks across entirely different domains (in 2017). A One Model To Learn Them All. In fact, the MultiModel paper was published in the exact same month as Attention Is All You Need! But history had other plans. The building block eclipsed the grand design! So, have you heard about the MultiModel before? π
posted
an
update
6 days ago
Things rarely go as we expect! In 2017, Google released the Transformer architecture. While it was clear the model was promising, absolutely no one (including its authors) anticipated the pervasive global revolution it would create! The authors actually viewed the Transformer as just a stepping stone for a much more ambitious project: The MultiModel. Their ultimate goal was to build a single deep learning architecture capable of jointly learning massive, diverse tasks across entirely different domains (in 2017). A One Model To Learn Them All. In fact, the MultiModel paper was published in the exact same month as Attention Is All You Need! But history had other plans. The building block eclipsed the grand design! So, have you heard about the MultiModel before? π
posted
an
update
5 months ago
The new DeepSeek Engram paper is super fun! It also integrates mHC, and I suspect they're probably releasing all these papers to make the V4 report of reasonable lengthπ Here's a nice short summary from Gemini
View all activity
Organizations
mmhamdy
's datasets
1
Sort:Β Recently updated
mmhamdy/Arabic-OpenHermes-Filtered
Viewer
β’
Updated
Mar 7, 2024
β’
78.1k
β’
49