Coding & Development
Browsing page 140 of AI tools for Open Source & Models in Coding & Development. Sorted by confidence score — our independent quality rating.
StateSmith
StateSmith is a cross-platform, free/open-source tool designed for generating state machines in various programming languages. It creates human-readable code with zero dependencies, making it highly suitable for diverse applications ranging from tiny bare metal microcontrollers to video games, apps, and web platforms. A key differentiator is its avoidance of dynamic memory allocations, which is crucial for safety and performance-critical embedded systems. StateSmith ensures the diagram is always accurate by generating fully working code directly from the documentation, eliminating the common problem of manual synchronization between code and diagrams. The tool boasts a strong suite of over 730 integration tests that verify behavior across all supported languages, providing confidence in its reliability and allowing for refactoring without breaking specified behavior.
stable-diffusion-webui-model-toolkit
stable-diffusion-webui-model-toolkit is a comprehensive toolkit designed for managing, editing, and creating models within the Stable Diffusion WebUI environment. It offers essential features such as cleaning and pruning models to reduce bloat, converting models to and from safetensors format, and extracting or replacing individual model components like VAE, UNET, and CLIP. The toolkit also assists in identifying and debugging model architectures, providing detailed reports on matched and rejected architectures. A unique metric system helps identify model weights, even for renamed components. This tool is invaluable for developers looking to optimize, customize, and troubleshoot their Stable Diffusion models.
TencentPretrain
TencentPretrain is a powerful PyTorch-based framework designed for pre-training and fine-tuning AI models, supporting various data modalities including text and vision. Its modular architecture facilitates the use of existing pre-training models and provides clear interfaces for users to further develop and customize their own models. This makes it an ideal solution for researchers and developers looking to experiment with or deploy advanced AI models. The framework emphasizes flexibility and extensibility, allowing for adaptation to diverse research and application needs in the AI domain.
UniDetector
UniDetector is an open-source computer vision tool designed for universal object detection, providing the code release for the CVPR 2023 paper "Detecting Everything in the Open World: Towards Universal Object Detection." Built upon mmdetection v2.18.0 and requiring CLIP, this tool facilitates both single-dataset and multi-dataset training, as well as open-world inference. It supports end-to-end and decoupled training/inference workflows, including probability calibration. UniDetector is ideal for researchers and developers working on advanced object detection tasks, offering robust capabilities for preparing datasets, language CLIP embeddings, and pre-trained RegionCLIP parameters.
SECUIRA - Artificial Intelligence Services UAE
SECUIRA is a rapidly expanding homeland security company based in the UAE, dedicated to providing comprehensive, end-to-end solutions. Leveraging internal research and strategic mergers with security technology firms, SECUIRA focuses on critical city surveillance projects and other technology-driven initiatives across the Gulf and Asia. Their services encompass consultancy, project management, design, and integration of various physical security technologies, including IP CCTV, License Plate Recognition (LPR), Face Recognition, and IoT. With a team boasting over 20 years of experience in government projects, SECUIRA emphasizes innovation and technology benchmarking to deliver tailored solutions that meet evolving client requirements and adapt to the latest market trends.
VideoLLaMA3
VideoLLaMA3 is an open-source project offering a series of multimodal foundation models designed for advanced image and video understanding. It provides models like VideoLLaMA3-7B and VideoLLaMA3-2B, which are capable of tasks ranging from general image and video comprehension to more specialized applications such as multi-image comparison, visual referring, and grounding. The project includes detailed instructions for inference, training, and evaluation, making it suitable for researchers and developers. It supports various benchmarks for performance assessment and offers a flexible framework for preparing custom training data. The models are available on Hugging Face, facilitating easy access and integration into AI development workflows.
Themis AI
Themis AI offers Capsa, a model-agnostic uncertainty quantification platform designed to make any AI model safe and reliable. Capsa seamlessly integrates with existing ML models, such as those built with PyTorch and TensorFlow, allowing developers to quantify model uncertainty and de-risk outputs in seconds. This technology helps detect and correct unreliable outputs, ensuring consistent high-quality results across various applications. Key use cases include reducing costs in drug discovery through uncertainty-guided predictions, enabling risk-aware human intervention for autonomous vehicles, and detecting hallucinations in generative models. Themis AI focuses on providing robust AI quality assurance and compliance.
illuminate tech
Illuminate Tech is a bespoke advisory firm founded by former online safety regulators, dedicated to breaking down barriers to a safer, more trusted internet. They specialize in making online safety compliance smarter through their product, OSCAR. This platform empowers services to manage risk, adapt to fast-changing regulations, and implement pre-built compliance workflows with big-firm precision at a fraction of the cost. Beyond OSCAR, Illuminate Tech offers research and advisory services that contribute to the global conversation on online safety. Their vision is to provide every tech company with the tools needed to anticipate and address harm proactively, ensuring sustainable growth and effective safety tech implementation.
HarmonAI
HarmonAI is an open-source initiative from a Stability AI Lab, dedicated to releasing generative audio tools that enhance music production accessibility and enjoyment. The platform empowers musicians and creators to generate their own custom, infinite sound libraries, fostering boundless creativity. Developed by musicians for musicians, HarmonAI aims to return creative control to artists by providing powerful AI-driven tools. It focuses on making advanced audio generation technology available to everyone, promoting a more inclusive and innovative music production landscape.
sample-factory
Sample Factory is a high-throughput reinforcement learning codebase, recognized as one of the fastest RL libraries for efficient synchronous and asynchronous implementations of policy gradients (PPO). It is thoroughly tested and utilized by numerous researchers and practitioners, consistently achieving state-of-the-art performance across diverse domains like ViZDoom, IsaacGym, and Mujoco, while optimizing training time and hardware usage. Key features include highly optimized algorithm architecture, support for single- and multi-agent training, population-based training (PBT), and various action and observation spaces. The library automatically creates model architectures and supports custom designs, offering detailed WandB and Tensorboard summaries, HuggingFace integration, and multiple environment examples with tuned parameters.
Veria Labs
Veria Labs offers automated offensive security solutions, designed to help high-stakes industries identify and remediate vulnerabilities across their entire attack surface. The platform integrates with Git repositories and cloud environments to deeply analyze applications for security flaws, from code paths to cloud infrastructure. It generates proof-of-concept exploits that run directly against staging environments, ensuring the detection of real vulnerabilities. For each identified vulnerability, Veria Labs provides actionable reports and suggested patches, enabling rapid application security. Backed by Y Combinator and founded by a top US hacking team, Veria Labs aims to make getting hacked a thing of the past by matching the speed and scale of modern development.
AIRS
AIRS, or Artificial Intelligence Research for Science, is an open-source initiative offering a comprehensive collection of software tools, datasets, and benchmarks. It is specifically designed to support research in AI for quantum mechanics, density functional theory, small molecules, protein science, materials science, molecular interactions, biological science, partial differential equations, and ordinary differential equations. The project's goal is to foster an integrated, open, reproducible, and sustainable set of resources to advance the emerging field of AI for Science. It includes various methods and resources, with the list continuously expanding as research progresses, making it a valuable resource for academic and scientific communities.
nib
nib is an open-source Stylus library designed to streamline frontend development by offering a comprehensive collection of mixins, utilities, and components. It also includes advanced features like gradient image generation, which can be enabled by installing node-canvas. Developers can integrate nib into their projects using npm and leverage its functionalities either by importing the entire library or by selectively choosing specific modules like gradients or normalize. This tool is particularly useful for those working with Stylus to create efficient and maintainable stylesheets.
TinyLLaVA_Factory
TinyLLaVA_Factory is an open-source modular codebase designed for building small-scale large multimodal models (LMMs). Implemented in PyTorch and HuggingFace, it emphasizes simplicity, extensibility, and reproducibility. The framework allows users to customize their own LMMs with reduced coding effort and fewer mistakes. It integrates a suite of cutting-edge models and methods, supporting LLMs like OpenELM, TinyLlama, StableLM, Qwen, Gemma, and Phi. For vision towers, it includes CLIP, SigLIP, Dino, and combinations thereof. Connectors such as MLP, Qformer, and Resampler are also supported, alongside various training recipes including Frozen/Fully/Partially tuning and LoRA/QLoRA tuning. The project provides trained models, performance benchmarks, and local demo options for Gradio web and CLI inference.
ant-design-x-vue
Ant Design X Vue is an open-source Vue UI library designed to accelerate the development of AI-powered interactive pages. It offers a rich set of components based on the RICH interaction paradigm, covering most AI conversation scenarios. The library facilitates quick integration with OpenAI-standard model inference services and provides robust data flow management features for efficient development. Built with TypeScript, it ensures full type support, enhancing developer experience and reliability. Additionally, it offers fine-grained style adjustments to meet diverse customization needs and includes various templates to jumpstart LUI application development.
word2vec-api
word2vec-api is a straightforward web service designed to expose word embedding models through a simple API. Built upon the Gensim Word2Vec implementation, it supports models in both Word2Vec text and binary formats. The service is easy to launch and configure, requiring users to specify the model path, host, and port. It provides various endpoints for common word embedding tasks such as calculating similarity between words, finding most similar words, and retrieving word vectors. This tool is particularly useful for developers and data scientists who need to integrate word embeddings into their applications or research projects without building the serving infrastructure from scratch.
aima-javascript
aima-javascript is an open-source project dedicated to visualizing algorithms and concepts from Russell and Norvig's influential book, "Artificial Intelligence — A Modern Approach." Unlike its sibling projects that prioritize code, aima-javascript primarily focuses on interactive visualizations to help users grasp complex AI principles. It also includes the corresponding Javascript code for these algorithms. This tool is ideal for students, teachers, and researchers who want to explore and understand AI algorithms through practical, visual examples, making abstract concepts more tangible and accessible. The project aims to provide a rich learning experience for anyone studying artificial intelligence.
awesome-mlops
awesome-mlops is a comprehensive, open-source curated list of tools specifically designed for Machine Learning Operations (MLOps). This GitHub repository serves as a valuable resource for developers and data scientists looking to streamline their ML workflows. It categorizes tools across numerous MLOps stages, including AutoML, CI/CD for Machine Learning, Data Cataloging, Data Enrichment, Data Exploration, Data Management, Data Processing, Data Validation, Data Visualization, Drift Detection, Feature Engineering, Feature Stores, Hyperparameter Tuning, Knowledge Sharing, Machine Learning Platforms, Model Fairness and Privacy, Model Interpretability, Model Lifecycle, Model Serving, Model Testing & Validation, Optimization Tools, Simplification Tools, Visual Analysis and Debugging, and Workflow Tools. The list is inspired by awesome-python, making it a well-structured and easy-to-navigate collection.
Alink
Alink is an open-source machine learning algorithm platform built on Apache Flink, developed by the PAI team of Alibaba computing platform. It offers a wide array of machine learning algorithms for various tasks, including classification, regression, clustering, and recommendation systems. Alink supports both batch and stream processing, making it suitable for real-time AI applications. The platform provides Python (PyAlink) and Java APIs, allowing developers to integrate it into their existing workflows. It is designed for scalability and efficiency, leveraging Flink's distributed processing capabilities to handle large datasets and complex machine learning models. Alink also includes tools for feature engineering, model training, and deployment, making it a comprehensive solution for data scientists and developers working on AI projects.
AttentionDeepMIL
AttentionDeepMIL offers a PyTorch implementation of Attention-based Deep Multiple Instance Learning, a technique detailed in the paper "Attention-based Deep Multiple Instance Learning" by Ilse, Tomczak, and Welling. This open-source tool is designed for researchers and developers to explore and apply attention mechanisms within deep learning models, particularly in the context of multiple instance learning. It includes code for running MNIST-BAGS experiments and provides guidance for adapting the model to histopathology datasets like Breast Cancer and Colon Cancer. The implementation features a modified LeNet-5 model with Attention-based MIL pooling and uses the negative log-likelihood of the Bernoulli distribution as its objective function. It's a valuable resource for those looking to replicate or extend research in this specialized area of deep learning.
approachingalmost
approachingalmost is a GitHub repository that serves as a companion to the book "Approaching (Almost) Any Machine Learning Problem." This resource offers valuable insights and practical code examples for individuals looking to tackle various machine learning challenges. While the repository does not share the complete code from the book, it provides environment files and references to datasets, encouraging users to code along and understand the concepts. It is designed to support machine learning practitioners and data scientists in their learning journey, offering a structured approach to solving complex problems. Users can find links to purchase the book from various regional Amazon stores and Pothi, as well as instructions for setting up the development environment.
ATM
ATM (Auto Tune Models) is an open-source AutoML system developed by the Data to AI Lab at MIT, focusing on simplifying machine learning model selection and tuning. It operates as a multi-tenant, multi-data system, allowing users to provide a classification problem and a dataset in CSV format. ATM then automatically searches for and builds the best predictive model. The system supports various data input methods, including local CSV files, AWS S3 buckets, and URLs. It offers a Python API for creating ATM instances, running model searches, and exploring results, including summaries of dataruns, best classifiers, and detailed scores. Users can export and load trained models for making predictions on new data. ATM is built on Python and is designed for both ease of use and scalability.
Bytez
Bytez is a comprehensive platform dedicated to open source AI, enabling users to stay informed about the latest developments, explore various models, and seamlessly integrate them into their projects. The platform facilitates the discovery of open source AI models, offering capabilities to demo their functionalities before deployment. Bytez aims to bridge the gap between AI research and practical application, providing a unified environment for interacting with a wide array of open source AI technologies. It serves as a valuable resource for individuals and organizations looking to leverage the power of open source AI for innovation and development.
giga-brain-0
GigaBrain-0 is an innovative vision-language-action (VLA) foundation model designed for generalist robots, addressing the high cost and time consumption of collecting large-scale real-world robot data. By utilizing world models to generate diverse data at scale, GigaBrain-0 substantially decreases the need for real robot data while enhancing cross-task generalization. The model also improves policy robustness through RGBD input modeling and embodied Chain-of-Thought (CoT) supervision, enabling it to reason about spatial geometry, object states, and long-horizon dependencies. This leads to significant performance gains in dexterous, long-horizon, and mobile manipulation tasks, demonstrating superior generalization across variations in appearances, object placements, and camera viewpoints. The project also includes GigaBrain-0.1, which scales training data to 10k hours and achieves first place on the RoboChallenge leaderboard.