Getting WebRTC and Websockets right in python is very tricky. If you've tried to wrap an LLM in a real-time audio layer then you know what I'm talking about.
That's where FastRTC comes in! It makes WebRTC and Websocket streams super easy with minimal code and overhead.
New state-of-the-art BERT-size retrieval model: *cde-small-v2* 🥳🍾
Hi everyone! We at Cornell are releasing a new retrieval model this week. It uses the contextual embeddings framework, is based on ModernBERT backbone, and gets state-of-the-art results on the MTEB benchmark for its model size (140M parameters). cde-small-v2 gets an average score of 65.6 across the 56 datasets and sees improvements from our previous model in *every* task domain (retrieval, classification, etc.).
We made a lot of changes to make this model work. First of all, ModernBERT has a better tokenizer, which probably helped this work out-of-the-box. We also followed the principles from the CDE paper and used harder clusters and better hard-negative filtering, which showed a small performance improvement. And we made a few small changes that have been shown to work on the larger models: we disabled weight decay, masked out the prefix tokens during pooling, and added a residual connection from the first-stage to the second-stage for better gradient flow.
We're still looking for a computer sponsor to help us scale CDE to larger models. Since it's now state-of-the-art at the 100M parameter scale, it seems to be a reasonable bet that we could train a state-of-the-art large model if we had the GPUs. If you're interested in helping with this, please reach out!
🏁 Now it is possible to chat with telemetry data from real Formula 1 races!
This is an AI-powered solution for analyzing and generating detailed reports on Formula 1 racing sessions. This project combines the power of ReAct agents from LangChain with a RAG approach to pull data from a SQL database.
At the core of this system is a text-to-SQL capability that allows users to ask natural language questions about various aspects of F1 races, such as driver performance, weather impact, race strategies, and more. The AI agent then queries the database, processes the information, and generates comprehensive reports tailored to the user's needs.
The reports can be exported in various formats, making it easy to share insights with team members, race fans, or the broader motorsports community.