آخرین پست‌های Cool GitHub repositories (@github_repositories_bds) در تلگرام

پست‌های تلگرام Cool GitHub repositories

Cool GitHub repositories
Sharing coolest GitHub repositories.

Free courses: @bigdataspecialist

Free programming books: @programming_books_bds

Data science: @datascience_bds

Interview preparation: @coding_interview_preparation

Contact me: @mldatascientist
8,391 مشترک
11 عکس
1 ویدیو
آخرین به‌روزرسانی 01.03.2025 07:22

آخرین محتوای به اشتراک گذاشته شده توسط Cool GitHub repositories در تلگرام


Crawl4AI

Crawl4AI is the #1 trending GitHub repository, actively maintained by a vibrant community. It delivers blazing-fast, AI-ready web crawling tailored for LLMs, AI agents, and data pipelines. Open source, flexible, and built for real-time performance, Crawl4AI empowers developers with unmatched speed, precision, and deployment ease.

Creator: Unclecode
Stars ⭐️: 26.5k
Forked by: 2.1k

Github Repo:
https://github.com/unclecode/crawl4ai

#AI #gpt

Join @github_repositories_bds for more cool repositories.
*This channel belongs to @bigdataspecialist group

Ultravox

Ultravox is a new kind of multimodal LLM that can understand text as well as human speech, without the need for a separate Audio Speech Recognition (ASR) stage. Building on research like AudioLM, SeamlessM4T, Gazelle, SpeechGPT, and others, Ultravox is able to extend any open-weight LLM with a multimodal projector that converts audio directly into the high-dimensional space used by LLM. We've trained versions on Llama 3, Mistral, and Gemma. This direct coupling allows Ultravox to respond much more quickly than systems that combine separate ASR and LLM components. In the future this will also allow Ultravox to natively understand the paralinguistic cues of timing and emotion that are omnipresent in human speech.

The current version of Ultravox (v0.4), when invoked with audio content, has a time-to-first-token (TTFT) of approximately 150ms, and a tokens-per-second rate of ~60 using a Llama 3.1 8B backbone. While quite fast, we believe there is considerable room for improvement in these numbers.

Ultravox currently takes in audio and emits streaming text. As we evolve the model, we'll train it to be able to emit a stream of speech tokens that can then be converted directly into raw audio by an appropriate unit vocoder.

Creator: Fixie-ai
Stars⭐️ : 3k
Forked by: 190

Github Repo:
https://github.com/fixie-ai/ultravox

#github

Join @github_repositories_bds for more cool repositories.
*This channel belongs to @bigdataspecialist group

Moondream
Moondream is a highly efficient open-source vision language model that combines powerful image understanding capabilities with a remarkably small footprint. It's designed to be versatile and accessible, capable of running on a wide range of devices and platforms.

The project offers two model variants:
* Moondream 2B: The primary model with 2 billion parameters, offering robust performance for general-purpose image understanding tasks including captioning, visual question answering, and object detection.
* Moondream 0.5B: A compact 500 million parameter model specifically optimized as a distillation target for edge devices, enabling efficient deployment on resource-constrained hardware while maintaining impressive capabilities.

Creator: Vikhyat
Stars⭐️ : 7k
Forked by: 549
https://github.com/vikhyat/moondream

#github

Join @github_repositories_bds for more cool repositories.
*This channel belongs to @bigdataspecialist group

ton

Main TON monorepo, which includes the code of the node/validator, lite-client, tonlib, FunC compiler, etc.

Creator: ton-blockchain
Stars⭐️ : 3.4k
Forked by: 1.1k
https://github.com/ton-blockchain/ton

#github

Join @github_repositories_bds for more cool repositories.
*This channel belongs to @bigdataspecialist group

WrenAI

Open-source GenBI AI Agent that empowers data-driven teams to chat with their data to generate Text-to-SQL, charts, spreadsheets, reports, and BI. 📈📊📋🧑‍💻

Creator: Canner
Stars ⭐️: 4.9k
Forked by: 403

GitHub Repo:
https://github.com/Canner/WrenAI

#AI #texttoSQL #bi #charts

Join @github_repositories_bds for more cool repositories.
*This channel belongs to @bigdataspecialist group

RealtimeSTT
A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.
* AudioToTextRecorderClient class, which automatically starts a server if none is running and connects to it. The class shares the same interface as AudioToTextRecorder, making it easy to upgrade or switch between the two. (Work in progress, most parameters and callbacks of AudioToTextRecorder are already implemented into AudioToTextRecorderClient, but not all. Also the server can not handle concurrent (parallel) requests yet.)
* reworked CLI interface ("stt-server" to start the server, "stt" to start the client, look at "server" folder for more info)

Creator: KoljaB
Stars ⭐️: 4.1k
Forked by: 346
https://github.com/KoljaB/RealtimeSTT

#github

Join @github_repositories_bds for more cool repositories.
*This channel belongs to @bigdataspecialist group

MoneyPrinter Version 2
An Application that automates the process of making money online. MPV2 (MoneyPrinter Version 2) is, as the name suggests, the second version of the MoneyPrinter project. It is a complete rewrite of the original project, with a focus on a wider range of features and a more modular architecture.

Creator: FujiwaraChoki
Stars⭐️ : 5.6k
Forked by: 580
https://github.com/FujiwaraChoki/MoneyPrinterV2

#github

Join @github_repositories_bds for more cool repositories.
*This channel belongs to @bigdataspecialist group

MiniCPM-o
MiniCPM-o is the latest series of end-side multimodal LLMs (MLLMs) ungraded from MiniCPM-V. The models can now take image, video, text, and audio as inputs and provide high-quality text and speech outputs in an end-to-end fashion. Since February 2024, we have released 6 versions of the model, aiming to achieve strong performance and efficient deployment. The most notable models in the series currently include:

* MiniCPM-o 2.6: 🔥🔥🔥 The latest and most capable model in the MiniCPM-o series. With a total of 8B parameters, this end-to-end model achieves comparable performance to GPT-4o-202405 in vision, speech, and multimodal live streaming, making it one of the most versatile and performant models in the open-source community. For the new voice mode, MiniCPM-o 2.6 supports bilingual real-time speech conversation with configurable voices, and also allows for fun capabilities such as emotion/speed/style control, end-to-end voice cloning, role play, etc. It also advances MiniCPM-V 2.6's visual capabilities such strong OCR capability, trustworthy behavior, multilingual support, and video understanding. Due to its superior token density, MiniCPM-o 2.6 can for the first time support multimodal live streaming on end-side devices such as iPad.

* MiniCPM-V 2.6: The most capable model in the MiniCPM-V series. With a total of 8B parameters, the model surpasses GPT-4V in single image, multi-image and video understanding. It outperforms GPT-4o mini, Gemini 1.5 Pro and Claude 3.5 Sonnet in single image understanding, and can for the first time support real-time video understanding on iPad.

Creator: OpenBMB
Stars ⭐️: 16223
Forked by: 1173
https://github.com/OpenBMB/MiniCPM-o

#gpt

Join @github_repositories_bds for more cool repositories.
*This channel belongs to @bigdataspecialist group

Tabby
Tabby is a self-hosted AI coding assistant, offering an open-source and on-premises alternative to GitHub Copilot. It boasts several key features:

*  Self-contained, with no need for a DBMS or cloud service.
*  OpenAPI interface, easy to integrate with existing infrastructure (e.g Cloud IDE).
*  Supports consumer-grade GPUs.

Creator: TabbyML
Stars ⭐️: 28.3k
Forked by: 1.3k
https://github.com/TabbyML/tabby


#AI

Join @github_repositories_bds for more cool repositories.
*This channel belongs to @bigdataspecialist group
https://github.com/TabbyML/tabby

Genesis
Genesis is a physics platform designed for general-purpose Robotics/Embodied AI/Physical AI applications. It is simultaneously multiple things:

* A universal physics engine re-built from the ground up, capable of simulating a wide range of materials and physical phenomena.
* A lightweight, ultra-fast, pythonic, and user-friendly robotics simulation platform.
* A powerful and fast photo-realistic rendering system.
* A generative data engine that transforms user-prompted natural language description into various modalities of data.


Creator: Genesis-Embodied-AI
Stars ⭐️: 22.9k
Forked By: 1.9k
https://github.com/Genesis-Embodied-AI/Genesis

#AI

Join @github_repositories_bds for more cool repositories.
*This channel belongs to @bigdataspecialist group