ShypdShypd.ai
💻

Coding & Development

Browsing page 139 of AI tools for Open Source & Models in Coding & Development. Sorted by confidence score — our independent quality rating.

BMW-YOLOv4-Training-Automation

BMW-YOLOv4-Training-Automation

58%

BMW-YOLOv4-Training-Automation is an open-source repository designed to simplify the training of state-of-the-art Deep Learning models, specifically YOLOv4 and YOLOv3. It aims to provide a no-code training experience, requiring little to no configuration. Users can supply their own labeled datasets or utilize the BMW-LabelTool-Lite for labeling. The tool supports comprehensive monitoring of the training process through various methods, including TensorBoard, a custom REST API with Swagger, and a graphical user interface (GUI). It is dockerized for flexible deployment on both GPU and CPU environments, making deep learning model training more accessible for developers and data scientists.

DirectML

DirectML

58%

DirectML is a high-performance, hardware-accelerated DirectX 12 library designed for machine learning tasks. It offers GPU acceleration for common machine learning operations across a wide array of supported hardware and drivers, including all DirectX 12-capable GPUs from major vendors. While DirectML is currently in maintenance mode, it remains supported on previous Windows releases and continues to ship with future Windows versions, receiving security and compliance fixes. It is distributed as a system component of Windows 10 and is also available as a standalone redistributable package for applications requiring a fixed version or running on older Windows 10 versions. DirectML exposes a native C++ DirectX 12 API and integrates as a backend for frameworks like Windows ML, ONNX Runtime, PyTorch, and TensorFlow, making it suitable for high-performance, low-latency applications such as frameworks, games, and other real-time applications.

ENet

ENet

58%

ENet is a deep neural network architecture specifically designed for real-time semantic segmentation, a crucial task in computer vision. It prioritizes efficient computation, enabling rapid processing of images and videos, which is essential for time-sensitive applications. The architecture is significantly more efficient than alternatives like SegNet, requiring fewer parameters and a smaller model size while achieving faster execution times. This makes ENet particularly well-suited for deployment in environments where speed and resource constraints are paramount, such as autonomous driving systems and robotics. The project provides resources including a tutorial, publication details, and pre-trained weights for various applications.

fastapi-ml-skeleton

fastapi-ml-skeleton

58%

fastapi-ml-skeleton is an open-source FastAPI skeleton application designed to streamline the deployment of machine learning models into production environments. It provides a robust, preconfigured, and fully tested codebase, enabling developers to quickly build and serve ML models as RESTful APIs. The project emphasizes speed, ease of use, and security, leveraging the FastAPI framework. It includes a sample regression model for house price prediction to help users understand its functionality and accelerate their own projects. The skeleton supports Python 3.11+, uses Poetry for package management, and incorporates comprehensive linting and testing with tools like isort, mypy, flake, black, and bandit, ensuring high code quality and maintainability.

Cywareness.io

Cywareness.io

58%

Cywareness.io is a comprehensive cybersecurity awareness training platform designed to minimize organizational risk by transforming employees into a human firewall. The platform offers automated and personalized cybersecurity awareness training programs, including unlimited realistic phishing simulations to test employee knowledge and response to various attack vectors. It provides instant reports and full insights through a dashboard to track progress and identify areas for improvement. Cywareness also features a vast video library for training, with the option to upload custom content, and offers micro-training after simulated attacks. With ISO27001 certification, Cywareness emphasizes secure and reliable solutions, integrating with Microsoft and Google workspaces for efficient management of training programs.

LFM2 ColBERT

LFM2 ColBERT

58%

LFM2 ColBERT is an AI model available on Hugging Face, designed for multilingual document retrieval. Users can enter a query to find relevant documents across various languages. The tool provides a list of matching documents, each accompanied by details such as its language, a relevance score, and the document's text. This makes it suitable for tasks requiring cross-lingual information retrieval or document search. As an open-source model, it offers flexibility for developers and researchers to integrate and adapt it for their specific applications, particularly those involving large-scale text data in diverse linguistic contexts.

MiniMax-M1

MiniMax-M1

58%

MiniMax-M1 is the world's first open-weight, large-scale hybrid-attention reasoning model, powered by a hybrid Mixture-of-Experts (MoE) architecture combined with a lightning attention mechanism. Developed based on the MiniMax-Text-01 model, it features 456 billion parameters with 45.9 billion parameters activated per token. A key differentiator is its native support for a context length of 1 million tokens, significantly larger than competitors. The lightning attention mechanism ensures efficient scaling of test-time compute, consuming 25% of the FLOPs compared to DeepSeek R1 at a generation length of 100K tokens. MiniMax-M1 is trained using large-scale reinforcement learning (RL) on diverse problems, including mathematical reasoning and software engineering. It offers two versions with 40K and 80K thinking budgets, outperforming other strong open-weight models on complex software engineering, tool-using, and long-context tasks. It also supports function calling capabilities and provides a chatbot and API for general use and evaluation.

Matterport

Matterport

58%

Matterport3D is a comprehensive open-source dataset designed for RGB-D machine learning tasks. It includes data captured from 90 properties using a Matterport Pro Camera, offering a rich resource for researchers and developers. The repository provides raw data, derived data, annotated data, and scripts/models for various scene understanding tasks such as image keypoint matching, view overlap prediction, surface normal estimation, region type classification, and semantic voxel labeling. It also includes tools for loading and viewing the data, making it a valuable asset for advancing research in indoor environment understanding.

Metrics

Metrics

58%

Metrics is an open-source toolbox offering implementations of various supervised machine learning evaluation metrics across multiple programming languages. Developers and researchers can utilize this tool to assess model performance in Python, R, Haskell, and MATLAB/Octave environments. It includes a wide array of metrics such as Absolute Error, Area Under the ROC, F1 Score, Log Loss, Mean Absolute Error, Mean Squared Error, and Root Mean Squared Error. The project is currently in a beta release, focusing on ensuring compatibility and functionality across its supported language repositories. It aims to provide a comprehensive suite for evaluating machine learning models.

MIDI-3D

MIDI-3D

58%

MIDI-3D is a cutting-edge 3D generative model designed for creating complex 3D scenes from a single input image. Unlike traditional methods that rely on reconstruction or multi-stage object generation, MIDI-3D leverages multi-instance diffusion models to simultaneously generate multiple high-quality 3D instances. This approach ensures accurate spatial relationships between objects and offers high generalizability, even with real and stylized image inputs, despite being trained on synthetic data. The tool is highly efficient, generating 3D scenes from segmented instance images without lengthy optimization steps. It also supports textured 3D scene generation, making it a powerful solution for various 3D content creation needs.

mmrazor

mmrazor

58%

mmrazor is a comprehensive model compression toolkit and benchmark developed as part of the OpenMMLab project. It offers four mainstream technologies: Neural Architecture Search (NAS), Pruning, Knowledge Distillation (KD), and Quantization. Designed for flexibility and compatibility, mmrazor can be easily integrated with various OpenMMLab projects and allows for plug-n-play incorporation of different algorithms. Its modular design enables developers to implement new model compression algorithms with minimal code or by modifying configuration files. The toolbox supports a wide range of algorithms within each category, including DARTS, DetNAS, SPOS for NAS; AutoSlim, L1-norm, Group Fisher, DMCP for pruning; and various methods like CWD, WSLD, ABLoss for KD. It also includes PTQ, QAT, and LSQ for quantization, making it a versatile tool for optimizing deep learning models.

motion_imitation

motion_imitation

58%

motion_imitation is a code repository accompanying the paper "Learning Agile Robotic Locomotion Skills by Imitating Animals." It provides a Gym environment for training a simulated quadruped robot to imitate various reference motions, offering example training code for learning policies. The tool supports Python 3.7 or 3.8 on Ubuntu, MacOS, and Windows, and can be installed as a pip package. It includes features for training and testing imitation models, working with motion capture data, and implementing locomotion using Model Predictive Control (MPC). The repository also details how to run MPC on real A1 robots, making it a comprehensive resource for researchers and developers in robotic locomotion.

MotioNet

MotioNet

58%

MotioNet is a deep neural network designed to reconstruct 3D human skeletal motion directly from monocular video. This library provides the source code for the network, which is based on a common motion representation. A key feature is its ability to output BVH files directly, eliminating the need for additional post-processing steps. The tool supports evaluation on both Human3.6m and wild videos, with integration for 2D pose detection tools like Openpose. Users can train models from scratch with customizable parameters or utilize provided pre-trained models for quick starts. It offers visualization through TensorBoardX for tracking training progress and includes detailed instructions for data preparation and testing. While powerful, it has limitations regarding moving cameras and dependence on 2D detection accuracy, which users should consider.

neurodiffeq

neurodiffeq

58%

neurodiffeq is an open-source Python library built on PyTorch, designed for solving ordinary and partial differential equations (ODEs and PDEs) using neural networks. It provides a flexible framework for implementing existing techniques of using artificial neural networks (ANNs) to approximate solutions. Unlike traditional numerical methods, neurodiffeq aims to compute continuous and differentiable solutions. The library supports various features including solving systems of ODEs and PDEs, handling initial and boundary conditions, and customizing network architectures. It also offers tools for monitoring training progress, implementing transfer learning, and defining custom sampling strategies for training points. Additionally, neurodiffeq supports solving solution bundles and inverse problems, making it suitable for complex scientific and engineering applications.

Neuton TinyML

Neuton TinyML

58%

Neuton TinyML, part of the Nordic Edge AI Lab, is a platform designed for building and deploying ultra-compact AI models specifically optimized for Nordic System-on-Chips (SoCs). It caters to both CPU-run edge AI with Neuton's self-growing models and NPU-enabled devices with LiteRT models, requiring no-code for wake word models and LiteRT configuration. The platform simplifies the AI development process into three steps: data upload, automated or configured model training, and deployment. It supports various intelligent applications like gesture recognition, anomaly detection, and health monitoring, focusing on low-power consumption, balanced memory and performance, and extended battery life for always-on sensing. It also includes data preprocessing tools like windowing, feature extraction, and selection, alongside model analysis features such as quality diagrams and confusion matrices.

openai-proxy

openai-proxy

58%

openai-proxy is an open-source solution designed to provide a proxy for OpenAI/ChatGPT APIs, primarily to circumvent geographical access restrictions. Many users encounter issues where OpenAI detects API access from certain regions, leading to account suspensions. This tool offers a workaround by allowing developers to deploy a proxy on a server in an unrestricted location. It supports deployment via Deno Deploy or CloudFlare Workers, making it flexible for various setups. The project provides clear instructions for integration with official OpenAI/ChatGPT npm and Python packages, simplifying the process for developers to continue using OpenAI's services seamlessly.

object-detection-opencv

object-detection-opencv

58%

object-detection-opencv provides a Python-based solution for object detection using the YOLO (You Only Look Once) framework, integrated with OpenCV's dnn module. This tool allows developers to perform inference on pre-trained deep learning models from popular frameworks like Caffe, Torch, and TensorFlow. Specifically, it leverages YOLOv3 weights for efficient object detection in images. The project is open-source and available on GitHub, offering a practical example for computer vision tasks. It's particularly useful for those looking to implement object recognition capabilities in their applications using Python and OpenCV, providing a foundation for further development in areas like real-time video analysis or image processing.

OpenChem

OpenChem

58%

OpenChem is a deep learning toolkit specifically designed for computational chemistry and drug design research, built with a PyTorch backend. Its primary goal is to simplify the application of deep learning models for researchers in these fields. Key features include a modular design with a unified API, allowing for easy combination of different modules, and the ability to build new models using only a configuration file. The toolkit supports fast training with multi-GPU capabilities and provides utilities for data preprocessing. It also integrates with Tensorboard for visualization. OpenChem handles various tasks such as classification, regression, multi-task learning, and generative models, supporting data types like character sequences (SMILES, amino acids) and molecular graphs, with automatic conversion of SMILES to graphs.

Open Persian ASR Leaderboard

Open Persian ASR Leaderboard

58%

The Open Persian ASR Leaderboard is a platform designed for evaluating and ranking Automatic Speech Recognition (ASR) models specifically for the Persian language. It enables users to submit their own ASR models by providing the model name in the format "user_name/model_name" and have them assessed against a standardized benchmark. This tool facilitates comparison of different models, helping researchers and developers identify top-performing ASR systems for Persian. The leaderboard provides a transparent and accessible way to track advancements and performance metrics in Persian ASR, fostering competition and innovation within the field.

RLzoo

RLzoo

58%

RLzoo is a comprehensive open-source reinforcement learning zoo designed for simple usage, implemented with TensorFlow 2.0 and leveraging the neural network layer APIs of TensorLayer2.0+. It offers a hands-on approach for reinforcement learning practices and benchmarks, supporting basic toy-tests like OpenAI Gym and DeepMind Control Suite with minimal configuration. Additionally, RLzoo supports robot learning environments such as RLBench. The platform provides both implicit and explicit configuration interfaces for running learning algorithms, making it flexible and convenient for users. It also supports distributed training across multiple computational nodes using the Kungfu package, catering to more realistic and large-scale scenarios.

roboflow-python

roboflow-python

58%

Roboflow-python is an open-source Python package designed to streamline the development of computer vision applications. It provides a comprehensive set of tools for managing datasets, training models, and deploying them efficiently. The package supports a wide range of computer vision tasks, making it a versatile choice for developers working on object detection, image classification, and other related projects. Its open-source nature fosters community collaboration and allows for flexible integration into existing workflows, providing a robust foundation for building and experimenting with AI-powered vision systems.

relational-networks

relational-networks

58%

relational-networks is an open-source Pytorch implementation of the "A simple neural network module for relational reasoning" paper, also known as Relational Networks. This tool is designed for researchers and developers working on visual reasoning and relational AI tasks. It has been thoroughly tested on the Sort-of-CLEVR task, a simplified version of CLEVR, which involves processing images with various colored shapes and answering both relational and non-relational questions. The implementation demonstrates superior performance compared to traditional CNN + MLP models, particularly in relational reasoning tasks, and includes modifications for improved computational efficiency.

rl-agents

rl-agents

58%

rl-agents is an open-source project providing a comprehensive collection of Reinforcement Learning agent implementations. This tool is designed for researchers and developers working in the field of AI, offering a variety of planning and learning algorithms. It serves as a valuable resource for experimentation and building new RL applications. The project's open-source nature fosters community contributions and allows for flexible integration into diverse research and development environments, making it suitable for both academic and practical applications in reinforcement learning.

slime

slime

58%

slime is an advanced post-training framework designed for Reinforcement Learning (RL) scaling, specifically tailored for large language models. It achieves high-performance training by seamlessly integrating Megatron with SGLang, enabling efficient and scalable operations. The framework supports flexible data generation through custom data workflows, allowing users to adapt to various training requirements. slime facilitates efficient training across different modes, making it a versatile solution for developers and researchers working with large language models and RL applications. Its focus on performance and flexibility makes it suitable for complex AI development tasks.