AI with Papers - Artificial Intelligence & Deep Learning @ai_deeplearning Channel on Telegram

AI with Papers - Artificial Intelligence & Deep Learning

@ai_deeplearning


Every day fresh updates on Deep Learning, Machine Learning, and Computer Vision (with Papers).

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/

Direct contact: @ARGOVISION

AI with Papers - Artificial Intelligence & Deep Learning (English)

Are you passionate about Artificial Intelligence, Deep Learning, and Machine Learning? Look no further! Join the AI with Papers Telegram channel for daily fresh updates on the latest trends and research in the field. Curated by Alessandro Ferrari, a visionary in the world of AI, this channel provides valuable insights and knowledge for enthusiasts and professionals alike. Through curated papers and articles, you will stay informed about the cutting-edge advancements in Computer Vision, Deep Learning, and Machine Learning.

Alessandro Ferrari, the curator of the channel, is a renowned expert in the field of AI and is dedicated to sharing his expertise with the community. With a background in Computer Science and a passion for innovation, he provides valuable content that will keep you at the forefront of the AI revolution.

Whether you are a seasoned professional or a newcomer to the world of AI, the AI with Papers channel has something to offer for everyone. Stay updated on the latest research, trends, and technologies in Artificial Intelligence and Deep Learning. Don't miss out on this incredible opportunity to expand your knowledge and network with like-minded individuals. Join the AI with Papers Telegram channel today and be a part of the future of AI!

Join now and connect with Alessandro Ferrari directly: @ARGOVISION

AI with Papers - Artificial Intelligence & Deep Learning

23 Nov, 09:09


πŸ¦–Dino-X: Unified Obj-Centric LVMπŸ¦–

πŸ‘‰Unified vision model for Open-World Detection, Segmentation, Phrase Grounding, Visual Counting, Pose, Prompt-Free Detection/Recognition, Dense Caption, & more. Demo & API announced πŸ’™

πŸ‘‰Review https://t.ly/CSQon
πŸ‘‰Paper https://lnkd.in/dc44ZM8v
πŸ‘‰Project https://lnkd.in/dehKJVvC
πŸ‘‰Repo https://lnkd.in/df8Kb6iz

AI with Papers - Artificial Intelligence & Deep Learning

22 Nov, 07:40


βš”οΈSAMurai: SAM for Trackingβš”οΈ

πŸ‘‰UWA unveils SAMURAI, an enhanced adaptation of SAM 2 specifically designed for visual object tracking. New SOTA! Code under Apache 2.0πŸ’™

πŸ‘‰Review https://t.ly/yGU0P
πŸ‘‰Paper https://arxiv.org/pdf/2411.11922
πŸ‘‰Repo https://github.com/yangchris11/samurai
πŸ‘‰Project https://yangchris11.github.io/samurai/

AI with Papers - Artificial Intelligence & Deep Learning

18 Nov, 10:31


🧰 EchoMimicV2: Semi-body Human 🧰

πŸ‘‰Alipay (ANT Group) unveils EchoMimicV2, the novel SOTA half-body human animation via APD-Harmonization. See clip with audio (ZH/ENG). Code & Demo announcedπŸ’™

πŸ‘‰Review https://t.ly/enLxJ
πŸ‘‰Paper arxiv.org/pdf/2411.10061
πŸ‘‰Project antgroup.github.io/ai/echomimic_v2/
πŸ‘‰Repo-v2 github.com/antgroup/echomimic_v2
πŸ‘‰Repo-v1 https://github.com/antgroup/echomimic

AI with Papers - Artificial Intelligence & Deep Learning

15 Nov, 13:47


🧢 MagicQuill: super-easy Diffusion Editing 🧢

πŸ‘‰MagicQuill is a novel system designed to support users in smart editing of images. Robust UI/UX (e.g., inserting/erasing objects, colors, etc.) under a multimodal LLM to anticipate user intentions in real time. Code & Demos released πŸ’™

πŸ‘‰Review https://t.ly/hJyLa
πŸ‘‰Paper https://arxiv.org/pdf/2411.09703
πŸ‘‰Project https://magicquill.art/demo/
πŸ‘‰Repo https://github.com/magic-quill/magicquill
πŸ‘‰Demo https://huggingface.co/spaces/AI4Editing/MagicQuill

AI with Papers - Artificial Intelligence & Deep Learning

15 Nov, 07:32


πŸ›₯️ Global Tracklet Association MOT πŸ›₯️

πŸ‘‰A novel universal, model-agnostic method designed to refine and enhance tracklet association for single-camera MOT. Suitable for datasets such as SportsMOT, SoccerNet & similar. Source code releasedπŸ’™

πŸ‘‰Review https://t.ly/gk-yh
πŸ‘‰Paper https://lnkd.in/dvXQVKFw
πŸ‘‰Repo https://lnkd.in/dEJqiyWs

AI with Papers - Artificial Intelligence & Deep Learning

14 Nov, 07:54


πŸ”₯ 4 NanoSeconds inference πŸ”₯

πŸ‘‰LogicTreeNet: convolutional differentiable logic gate net. with logic gate tree kernels: Computer Vision into differentiable LGNs. Up to 6100% smaller than SOTA, inference in 4 NANOsecs!

πŸ‘‰Review https://t.ly/GflOW
πŸ‘‰Paper https://lnkd.in/dAZQr3dW
πŸ‘‰Full clip https://lnkd.in/dvDJ3j-u

AI with Papers - Artificial Intelligence & Deep Learning

13 Nov, 07:51


πŸ”SeedEdit: foundational T2IπŸ”

πŸ‘‰ByteDance unveils a novel T2I foundational model capable of delivering stable, high-aesthetic image edits which maintain image quality through unlimited rounds of editing instructions. No code announced but a Demo is onlineπŸ’™

πŸ‘‰Review https://t.ly/hPlnN
πŸ‘‰Paper https://arxiv.org/pdf/2411.06686
πŸ‘‰Project team.doubao.com/en/special/seededit
πŸ€—Demo https://huggingface.co/spaces/ByteDance/SeedEdit-APP

AI with Papers - Artificial Intelligence & Deep Learning

11 Nov, 13:47


❄️Don’t Look Twice: ViT by RLT❄️

πŸ‘‰CMU unveils RLT: speeding up the video transformers inspired by run-length encoding for data compression. Speed the training up and reducing the token count by up to 80%! Source Code announced πŸ’™

πŸ‘‰Review https://t.ly/ccSwN
πŸ‘‰Paper https://lnkd.in/d6VXur_q
πŸ‘‰Project https://lnkd.in/d4tXwM5T
πŸ‘‰Repo TBA

AI with Papers - Artificial Intelligence & Deep Learning

10 Nov, 10:43


🫠 X-Portrait 2: SOTA(?) Portrait Animation 🫠

πŸ‘‰ByteDance unveils a preview of X-Portrait2, the new SOTA expression encoder model that implicitly encodes every minuscule expressions from the input by training it on large-scale datasets. Impressive results but no paper & code announced.

πŸ‘‰Review https://t.ly/8Owh9 [UPDATE]
πŸ‘‰Paper ?
πŸ‘‰Project byteaigc.github.io/X-Portrait2/
πŸ‘‰Repo ?

AI with Papers - Artificial Intelligence & Deep Learning

08 Nov, 09:30


🧠 Single Neuron Reconstruction 🧠

πŸ‘‰SIAT unveils NeuroFly, a framework for large-scale single neuron reconstruction. Formulating neuron reconstruction task as a 3-stage streamlined workflow: automatic segmentation - connection - manual proofreading. Bridging computer vision and neuroscience πŸ’™

πŸ‘‰Review https://t.ly/Y5Xu0
πŸ‘‰Paper https://arxiv.org/pdf/2411.04715
πŸ‘‰Repo github.com/beanli161514/neurofly

AI with Papers - Artificial Intelligence & Deep Learning

07 Nov, 08:24


πŸ’ͺ Muscles in Time Dataset πŸ’ͺ

πŸ‘‰Muscles in Time (MinT) is a large-scale synthetic muscle activation dataset. MinT contains 9+ hours of simulation data covering 227 subjects and 402 simulated muscle strands. Code & Dataset available soon πŸ’™

πŸ‘‰Review https://t.ly/108g6
πŸ‘‰Paper arxiv.org/pdf/2411.00128
πŸ‘‰Project davidschneider.ai/mint
πŸ‘‰Code github.com/simplexsigil/MusclesInTime

AI with Papers - Artificial Intelligence & Deep Learning

05 Nov, 07:22


🏣 CityGaussianV2: Large-Scale City 🏣

πŸ‘‰A novel approach for large-scale scene reconstruction that addresses critical challenges related to geometric accuracy and efficiency: 10x compression, 25% faster & -50% memory! Source code releasedπŸ’™

πŸ‘‰Review https://t.ly/Xgn59
πŸ‘‰Paper arxiv.org/pdf/2411.00771
πŸ‘‰Project dekuliutesla.github.io/CityGaussianV2/
πŸ‘‰Code github.com/DekuLiuTesla/CityGaussian

AI with Papers - Artificial Intelligence & Deep Learning

04 Nov, 07:39


β˜€οΈ Universal Relightable Avatars β˜€οΈ

πŸ‘‰#Meta unveils URAvatar, photorealistic & relightable avatars from phone scan with unknown illumination. Stunning results!

πŸ‘‰Review https://t.ly/U-ESX
πŸ‘‰Paper arxiv.org/pdf/2410.24223
πŸ‘‰Project junxuan-li.github.io/urgca-website

AI with Papers - Artificial Intelligence & Deep Learning

01 Nov, 07:30


🍜 REM: Segment What You Describe 🍜

πŸ‘‰REM is a framework for segmenting concepts in video that can be described via LLM. Suitable for rare & non-object dynamic concepts, such as waves, smoke, etc. Code & Data announced πŸ’™

πŸ‘‰Review https://t.ly/OyVtV
πŸ‘‰Paper arxiv.org/pdf/2410.23287
πŸ‘‰Project https://miccooper9.github.io/projects/ReferEverything/

AI with Papers - Artificial Intelligence & Deep Learning

31 Oct, 08:13


πŸ”₯πŸ”₯ The code is out πŸ”₯πŸ”₯

πŸ‘‰Code https://github.com/HaixinShi/fmov_pose

AI with Papers - Artificial Intelligence & Deep Learning

31 Oct, 08:00


πŸ”₯ D-FINE: new SOTA Detector πŸ”₯

πŸ‘‰D-FINE, a powerful real-time object detector that achieves outstanding localization precision by redefining the bounding box regression task in DETR model. New SOTA on MS COCO with additional data. Code & models available πŸ’™

πŸ‘‰Review https://t.ly/aw9fN
πŸ‘‰Paper https://arxiv.org/pdf/2410.13842
πŸ‘‰Code https://github.com/Peterande/D-FINE

AI with Papers - Artificial Intelligence & Deep Learning

29 Oct, 07:53


🫐 Blendify: #Python + Blender 🫐

πŸ‘‰Lightweight Python framework that provides a high-level API for creating & rendering scenes with #Blender. It simplifies data augmentation & synthesis. Source Code releasedπŸ’™

πŸ‘‰Review https://t.ly/l0crA
πŸ‘‰Paper https://arxiv.org/pdf/2410.17858
πŸ‘‰Code https://virtualhumans.mpi-inf.mpg.de/blendify/

AI with Papers - Artificial Intelligence & Deep Learning

25 Oct, 10:49


β›ˆοΈ SMITE: SEGMENT IN TIME β›ˆοΈ

πŸ‘‰SFU unveils SMITE: a novel AI that -with only one or few segmentation references with fine granularity- is able to segment different unseen videos respecting the segmentation references. Dataset & Code (under Apache 2.0) announced πŸ’™

πŸ‘‰Review https://t.ly/w6aWJ
πŸ‘‰Paper arxiv.org/pdf/2410.18538
πŸ‘‰Project segment-me-in-time.github.io/
πŸ‘‰Repo github.com/alimohammadiamirhossein/smite

AI with Papers - Artificial Intelligence & Deep Learning

24 Oct, 09:05


🌻 Plant Camouflage Detection🌻

πŸ‘‰PlantCamo Dataset is the first dataset for plant camouflage detection: 1,250 images with camouflage characteristics. Source Code released πŸ’™

πŸ‘‰Review https://t.ly/pYFX4
πŸ‘‰Paper arxiv.org/pdf/2410.17598
πŸ‘‰Code github.com/yjybuaa/PlantCamo