-
Red Hat
-
06:49
- 1h ahead - in/nicolo-lucchesi-834958184
Stars
A high-throughput and memory-efficient inference and serving engine for LLMs
ODH Tools & Extensions Companion
PyTorch extensions for high performance and large scale training.
Muggled DPT: Depth estimation without the magic
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
Accessible large language models via k-bit quantization for PyTorch.
Code examples and resources for DBRX, a large language model developed by Databricks
PyTorch code and models for the DINOv2 self-supervised learning method.
Swift app demonstrating Core ML Stable Diffusion
Provides an interface layer to convert between n-dimensional types in different Rust crates
DeepSeek-VL: Towards Real-World Vision-Language Understanding
On-device AI across mobile, embedded and edge for PyTorch
Generative Models by Stability AI
Everything we actually know about the Apple Neural Engine (ANE)
Stable Diffusion with Core ML on Apple Silicon
Train transformer language models with reinforcement learning.
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
The C++ Core Guidelines are a set of tried-and-true guidelines, rules, and best practices about coding in C++
OpenVINO™ is an open source toolkit for optimizing and deploying AI inference
Neural Network Compression Framework for enhanced OpenVINO™ inference
Deep learning in Rust, with shape checked tensors and neural networks
A tool to modify ONNX models in a visualization fashion, based on Netron and Flask.
A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (p…
🔎 Search for YouTube videos, channels & playlists. Get 🎞 video & 📑 playlist info using link. Get search suggestions. WITHOUT YouTube Data API v3.
LLM training code for Databricks foundation models