AI & ML interests

AI4Science

Recent Activity

cgeorgiawย  updated a Space about 12 hours ago
LeMaterial/LeMat-GenBench
cgeorgiawย  published a Space about 1 month ago
LeMaterial/LeMat-GenBench
cgeorgiawย  updated a Space about 1 month ago
LeMaterial/LeMat-GenBench
View all activity

cgeorgiawย 
posted an update about 2 months ago
cgeorgiawย 
posted an update 4 months ago
view post
Post
5986
๐Ÿš€๐Ÿš€๐Ÿš€ The largest ever dataset of co-folded 3D protein-ligand structures just dropped on HF!!

Meet SAIR (Structurally Augmented ICโ‚…โ‚€ Repository): 5M+ AI-generated complexes with experimentally measured drug potency data from SandboxAQ. ๐Ÿš€๐Ÿš€๐Ÿš€

Check it out and explore here: SandboxAQ/SAIR

ยท
cgeorgiawย 
posted an update 5 months ago
cgeorgiawย 
posted an update 7 months ago
cgeorgiawย 
posted an update 7 months ago
view post
Post
1614
Snooping on HF is the best because sometimes you just discover that someone (in this case, Earth Species Project) is about to drop terabytes of sick (high quality animal sounds) data...

EarthSpeciesProject/NatureLM-audio-training
cgeorgiawย 
posted an update 7 months ago
view post
Post
525
Just dropped two bigger physics datasets (both on photonics)!

NUMBA 1: SIB-CL
This dataset of Surrogate- and Invariance-Boosted Contrastive Learning (SIB-CL) datasets for two scientific problems:
- PhC2D: 2D photonic crystal density-of-states (DOS) and bandstructure data.
- TISE: 3D time-independent Schrรถdinger equation eigenvalue and eigenvector solutions.

NUMBA2: 2D Photonic Topology
Symmetry-driven analysis of 2D photonic crystals: 10k random unit cells across 11 symmetries, 2 polarizations, 5 contrasts. Includes time-reversal breaking cases for 4 symmetries at high contrast.

Check them out: cgeorgiaw/sib-cl & cgeorgiaw/2d-photonic-topology
clefourrierย 
posted an update 8 months ago
view post
Post
2138
Always surprised that so few people actually read the FineTasks blog, on
โœจhow to select training evals with the highest signalโœจ

If you're serious about training models without wasting compute on shitty runs, you absolutely should read it!!

An high signal eval actually tells you precisely, during training, how wel & what your model is learning, allowing you to discard the bad runs/bad samplings/...!

The blog covers in depth prompt choice, metrics, dataset, across languages/capabilities, and my fave section is "which properties should evals have"๐Ÿ‘Œ
(to know on your use case how to select the best evals for you)

Blog: HuggingFaceFW/blogpost-fine-tasks
  • 2 replies
ยท
thomwolfย 
posted an update 9 months ago
view post
Post
7739
If you've followed the progress of robotics in the past 18 months, you've likely noticed how robotics is increasingly becoming the next frontier that AI will unlock.

At Hugging Faceโ€”in robotics and across all AI fieldsโ€”we believe in a future where AI and robots are open-source, transparent, and affordable; community-built and safe; hackable and fun. We've had so much mutual understanding and passion working with the Pollen Robotics team over the past year that we decided to join forces!

You can already find our open-source humanoid robot platform Reachy 2 on the Pollen website and the Pollen community and people here on the hub at
pollen-robotics


We're so excited to build and share more open-source robots with the world in the coming months!
  • 1 reply
ยท