ShypdShypd.ai
💻

Coding & Development

Browsing page 133 of AI tools for Open Source & Models in Coding & Development. Sorted by confidence score — our independent quality rating.

safety-gymnasium

safety-gymnasium

59%

Safety-Gymnasium is an open-source library designed for Safe Reinforcement Learning (SafeRL), offering a highly scalable and customizable platform for benchmarking and developing safe AI algorithms. It provides a standardized set of environments, including safe navigation, velocity, vision, multi-agent, and Isaac Gym tasks, all compatible with the new Gym API. The library explicitly expresses costs, returning six items including cost in its step method, and offers convenience wrappers for converting between Safety-Gymnasium and standard Gymnasium APIs. It aims to facilitate research by providing an elegant code framework and well-designed environments, allowing users to explore new insights in SafeRL. The platform is built on MuJoCo 2.3.0+ and addresses previous limitations of similar libraries, such as dependency issues and lack of vectorized environments.

StabilityMatrix

StabilityMatrix

59%

StabilityMatrix is a multi-platform package manager and inference UI designed to simplify the use of Stable Diffusion. It provides one-click installation and updates for popular Stable Diffusion Web UIs like Automatic1111, ComfyUI, and Fooocus. The tool features an embedded Git and Python, eliminating the need for global installations, and is fully portable. StabilityMatrix includes a powerful inference UI with auto-completion and syntax highlighting, a checkpoint manager for shared models, and a model browser to import from CivitAI and HuggingFace with pause/resume download capabilities. It also supports managing plugins/extensions and offers a configurable launcher with a syntax-highlighted terminal.

Mistral AI

Mistral AI

59%

Mistral AI provides a powerful AI platform designed for enterprises to build and deploy advanced AI systems. Users can customize, fine-tune, and deploy AI assistants, autonomous agents, and multimodal AI using state-of-the-art open models. The platform supports various deployment options, including on-premises, cloud, edge, and devices, ensuring full data control. Key offerings include Le Chat for autonomous work, Vibe for autonomous coding, Studio for AI application development, Forge for custom model development, and Applied AI for advanced R&D. Mistral AI emphasizes deep engagement and hands-on assistance from AI scientists for deployment, solutioning, and safety.

speech-to-text-wavenet

speech-to-text-wavenet

59%

Speech-to-Text-WaveNet is an open-source project offering an end-to-end sentence-level English speech recognition system. Built upon DeepMind's WaveNet architecture and implemented with TensorFlow, this tool provides a robust foundation for researchers and developers in the field of audio processing. It allows users to train and test speech recognition models using datasets like VCTK, LibriSpeech, and TEDLIUM. Key features include pre-processing audio data into MFCC features, training with CTC loss, and transforming speech wave files into English text. The project also highlights areas for future development, such as integrating language models and supporting polyglot recognition, making it a valuable resource for advancing speech AI.

text-to-image

text-to-image

59%

text-to-image is an experimental TensorFlow implementation for synthesizing images from captions. This open-source tool leverages Skip Thought Vectors to understand the textual input and the GAN-CLS (Generative Adversarial Network - Conditional Latent Space) algorithm to generate corresponding images. It allows users to create images based on descriptive text, with the model currently trained on the flowers dataset. The implementation provides options for data processing, training with customizable parameters like noise dimension, batch size, and learning rate, and generating images from user-provided captions. It's a valuable resource for researchers and developers interested in the mechanics of text-to-image synthesis.

draw

draw

59%

draw is an open-source TensorFlow implementation of the "DRAW: A Recurrent Neural Network For Image Generation" model, specifically applied to the MNIST dataset. This project prioritizes simplicity and clarity, aiming to closely mirror the raw mathematical equations from the original paper. It allows users to train the DRAW model with or without attention mechanisms for both reading and writing, and provides pre-trained weights for quick experimentation. The implementation includes scripts for data visualization and offers a gentle walkthrough for understanding the paper and its code.

TextAttack

TextAttack

59%

TextAttack is an open-source Python framework designed for adversarial attacks, data augmentation, and model training in Natural Language Processing (NLP). It provides a comprehensive library of components and pre-implemented attack recipes, allowing users to generate adversarial examples to test the robustness of NLP models. The framework supports various attack types, including word-level substitutions, character-level perturbations, and attacks on sequence-to-sequence models. Beyond attacks, TextAttack facilitates data augmentation to enhance model generalization and robustness, and offers capabilities for training NLP models with a single command. It is ideal for researchers and developers looking to explore model vulnerabilities and improve model resilience.

tinyengine

tinyengine

59%

TinyEngine is the official implementation of a memory-efficient and high-performance neural network library specifically designed for Microcontrollers. As a core component of MCUNet, a system-algorithm co-design framework, TinyEngine works in conjunction with TinyNAS to facilitate tiny deep learning on IoT devices with extremely tight memory budgets. It significantly outperforms existing inference libraries like TF-Lite Micro, CMSIS-NN, and X-CUBE-AI by improving inference speed by 1.1-18.6x and reducing peak memory by 1.3-3.6x. Key optimization techniques include in-place depth-wise convolution, patch-based inference, operator fusion, SIMD programming, and various loop optimizations to enhance performance and minimize memory footprint.

uis-rnn

uis-rnn

59%

uis-rnn is a Python library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, primarily used for fully supervised speaker diarization. This algorithm excels at segmenting and clustering sequential data by learning from examples. The library provides core APIs for model construction, training, and prediction, allowing users to fit models with observation sequences and ground truth cluster IDs. It supports both list-based and concatenated sequence inputs, with careful handling of cluster ID uniqueness. The tool is particularly useful for tasks like identifying who spoke when in audio recordings, leveraging d-vector embeddings as observations. It also offers guidelines for training on large datasets by calling the fit() function multiple times with appropriately sized inputs.

Upscale-A-Video

Upscale-A-Video

59%

Upscale-A-Video is an open-source, diffusion-based model designed for real-world video super-resolution, enhancing video quality with temporal consistency. Developed by S-Lab at Nanyang Technological University and presented at CVPR 2024, this tool takes low-resolution videos and text prompts as inputs to generate high-resolution outputs. It provides inference code for various video types, including AIGC videos, old movies, and animations. Users can also address color discrepancies with options like "AdaIn" or "Wavelet" color fixes. The project includes the YouHQ dataset, comprising over 38,000 videos for training and 40 for evaluation, making it a valuable resource for researchers and developers in the video processing domain.

Protect AI

Protect AI

59%

Protect AI offers a comprehensive platform designed to secure AI applications across their entire lifecycle, from model selection and testing to deployment and runtime. Their suite of products, including Guardian for AI model security, Recon for scalable red teaming, and Layer for runtime security, operate on a unified platform. The company emphasizes a "Secure by Design" approach, shifting from reactive to proactive security measures against evolving AI threats. Protect AI also leverages extensive threat research, backed by a community of over 17,000 security researchers and partnerships with industry leaders like Hugging Face, to continuously feed its products with the latest threat intelligence. This ensures robust protection and helps organizations stay ahead of attackers.

wasi-nn

wasi-nn

59%

wasi-nn is a proposed WebAssembly System Interface (WASI) API designed for performing machine learning (ML) inference within WebAssembly environments. It aims to simplify the integration of existing ML models, such as those from TensorFlow, ONNX, and OpenVINO, into WASI applications. The API focuses on ease of use by allowing users to load models as opaque byte sequences and achieve high performance through hardware acceleration like GPUs, TPUs, and FPGAs. While primarily focused on inference, the project acknowledges future potential for ML training. It is currently in Phase 2 of development, emphasizing a framework- and model-agnostic approach to support diverse ML frameworks and formats.

DVC AI

DVC AI

59%

DVC AI, or Data Version Control, is an open-source system designed to bring software engineering best practices to data science and machine learning workflows. It offers a Git-like experience for managing data, models, and experiments, enabling teams to version and track large datasets and model artifacts. DVC AI integrates with Git for local workflows and also offers an enterprise solution, lakeFS, for highly scalable data version control in complex AI operations and big data environments. It supports various remote storage options like Amazon S3, Azure Blob Storage, Google Cloud Storage, and more. DVC AI helps users manage data pipelines, track metrics and parameters, and conduct experiment management, ensuring reproducibility and collaboration in AI/ML projects.

AI Repository Search

AI Repository Search

59%

AI Repository Search is a community-curated directory designed to help users discover open-source GitHub repositories. It leverages AI to allow users to search for repositories by describing their ideas in plain English, with the AI ranking results by relevance. Users can also browse repositories using various filters such as language, license, topics, activity, release cadence, issue health, documentation, built with, form factor, issue load, category, maintainer, model maturity, persona, platform, release pattern, security posture, use case, ecosystem, stack, status, features, and license. The platform also offers curated searches and the ability to save findings, making it a valuable resource for developers and researchers looking for specific open-source projects.

Privacy AI App

Privacy AI App

59%

Privacy AI App is a unique application designed for iPhone, iPad, and Mac users seeking ultimate privacy and control over their AI interactions. It functions as an offline AI chatbot hub, ensuring all data processing occurs directly on the device, eliminating the need for an internet connection and safeguarding user information. The app supports powerful open-source AI models like LLaMA, Mistral, Phi3, StableLM, and Gemma2, which are optimized for superior performance on Apple devices. Users can enhance their AI experience through extensive customization options, including adjusting sampling temperature, system prompts, and Top-p values, allowing for a truly personalized interaction. Privacy AI App offers a consistent experience across all compatible devices and prioritizes user-centered development with a focus on data privacy and security.

Open CoWork

Open CoWork

59%

Open CoWork is a free, open-source AI agent designed to empower users with advanced automation capabilities. This versatile tool allows for seamless control over web browsers and local applications, making it an ideal solution for a wide range of automation tasks. Its open-source nature means it can be extended with custom skills, providing developers and technical users with the flexibility to tailor its functionality to specific needs. Available for macOS, Windows, and Linux, Open CoWork offers a robust platform for creating and customizing AI agents, enabling efficient automation and enhanced productivity across various operating environments.

DoubleCloud

DoubleCloud

59%

DoubleCloud offers a comprehensive platform for building data analytics infrastructure, leveraging fully managed open-source solutions like ClickHouse, Apache Kafka, and Apache Airflow. The platform streamlines data pipelines from ingestion to visualization, providing integrated, reliable, and zero-maintenance services. Key offerings include a no-code ELT tool for real-time data syncing, and a managed open-source Data Visualization tool for creating dashboards and charts. Designed for engineers, DoubleCloud focuses on exceptional performance, security with ISO 27001, SOC 2, GDPR compliance, and cost-effectiveness through a pay-as-you-go model and hybrid storage options. It aims to simplify day-to-day operations by handling routine maintenance, allowing engineers to focus on product development.

BentoML

BentoML

59%

BentoML offers a comprehensive inference platform designed for speed and control, enabling users to deploy any AI model anywhere. It provides tailored inference optimization, efficient scaling, and streamlined operations for various models, including LLMs and custom architectures. The platform simplifies inference infrastructure while offering full control over deployment, supporting open-source models and custom models alike. Key features include deployment automation, CI/CD, comprehensive observability, and fine-grained access control. BentoML also offers intelligent resource management with cross-region scaling, elastic auto-scaling, and cold-start acceleration, ensuring optimal compute utilization across various cloud environments or on-premises Kubernetes.

OpenHGNN

OpenHGNN

59%

OpenHGNN is an open-source toolkit designed for Heterogeneous Graph Neural Networks (HGNNs), built upon the Deep Graph Library (DGL) and PyTorch. It aims to facilitate research and development in heterogeneous graph-based machine learning by integrating state-of-the-art HGNN models. The toolkit offers easy-to-use interfaces for conducting experiments and supports various tasks including node classification, link prediction, and recommendation. Key features include extensibility for user-defined tasks, models, and datasets, efficiency through DGL's backend, and tools for hyperparameter optimization and visualization. It also supports mini-batch training and distributed training for large-scale graphs.

plock

plock

59%

Plock is an open-source, local-first development tool designed to streamline interaction with large language models (LLMs) and other scripts directly from any text input area. Users can write a prompt, select it, and trigger a command (e.g., Cmd+Shift+.) to replace the selected text with the streaming output of an LLM or custom script. It also supports using clipboard content as context for prompts. Plock is highly customizable via a `settings.json` file, allowing users to define shortcuts, integrate various models (like Ollama, GPT, Perplexity), and chain multiple actions. It emphasizes local execution by default but can be configured to use remote APIs through shell scripts. The tool is cross-platform, supporting Mac, Linux, and Windows (with some caveats).

talking-head-anime-3-demo

talking-head-anime-3-demo

59%

talking-head-anime-3-demo provides demo programs for animating anime characters using a single image. This open-source project allows users to manipulate a character's facial expression, head rotation, body rotation, and chest expansion through a graphical user interface. Additionally, it supports transferring real-time facial motion from an iOS device to an anime character. The tool requires a powerful Nvidia GPU and specific software environments (Python, PyTorch, etc.) to run. It's designed for users interested in AI-driven animation, offering different neural network variants that balance size, speed, and accuracy. The project is released under an MIT license for the code and Creative Commons Attribution 4.0 International License for the models.

codespaces-jupyter

codespaces-jupyter

59%

codespaces-jupyter offers a ready-to-use development environment within GitHub Codespaces, specifically tailored for machine learning and data science projects. It comes pre-configured with Python and Jupyter notebooks, allowing users to immediately dive into their work without extensive setup. This tool provides a blank canvas for new projects, enabling users to explore and experiment with data science concepts. The environment is self-contained within a single codespace, offering flexibility for development. Users can choose to publish their work to a GitHub repository when ready or simply delete the codespace if it was for exploration, making it ideal for quick prototyping and learning.

clip-as-service

clip-as-service

59%

clip-as-service is an open-source tool designed for scalable embedding, reasoning, and ranking of images and text using the CLIP model. It can be easily integrated as a low-latency, high-scalability microservice into neural search solutions. Key features include fast serving of CLIP models with TensorRT, ONNX runtime, and PyTorch, offering up to 800QPS. It supports elastic scaling of multiple CLIP models on a single GPU with automatic load balancing. The tool provides an easy-to-use, minimalist API for both image and sentence embedding, supporting async clients and various protocols like gRPC, HTTP, and WebSocket. It also integrates smoothly with the Jina and DocArray neural search ecosystem, enabling the rapid building of cross-modal and multi-modal solutions.

ogb

ogb

59%

OGB (Open Graph Benchmark) offers a comprehensive suite of benchmark datasets, data loaders, and evaluators specifically designed for graph machine learning. It supports a wide array of graph ML tasks, including predictions at the node, link, and graph levels, and covers diverse real-world applications. The platform provides datasets of varying scales, from those processable on a single GPU to large-scale graphs requiring advanced techniques. OGB's data loaders are fully compatible with leading graph deep learning frameworks like PyTorch Geometric and Deep Graph Library (DGL), offering automatic dataset downloading, standardized splits, and unified performance evaluation. This ensures reliable comparison of different methods and facilitates research in graph machine learning.