Stars
[MalayMMLU] This is the first-ever Bahasa Melayu multitask benchmark designed to elevate the performance of Large Language Models (LLMs) and Large Vision Language Models (LVLMs).
✨✨Latest Advances on Multimodal Large Language Models
分享 GitHub 上有趣、入门级的开源项目。Share interesting, entry-level open source projects on GitHub.
Agent Framework / shim to use Pydantic with LLMs
🧑🚀 全世界最好的LLM资料总结 | Summary of the world's best LLM resources.
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
🦜🔗 Build context-aware reasoning applications
code for “RGB-D Flow: Dense 3-D Motion Estimation Using Color and Depth”, ICRA2013
Get up and running with Llama 3.3, Phi 4, Gemma 2, and other large language models.
A playbook for systematically maximizing the performance of deep learning models.
a reimplementation of PWC-Net in PyTorch that matches the official Caffe version
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
An Identity-free Video Dataset for Micro-Gesture Understanding and Emotion Analysis (CVPR'21)
Helper package with multiple U-Net implementations in Keras as well as useful utility tools helpful when working with image semantic segmentation tasks. This library and underlying tools come from …
Keras Implementation of Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)
Mean Average Precision for Object Detection