💻

Coding & Development

Browsing page 91 of AI tools for Open Source & Models in Coding & Development. Sorted by confidence score — our independent quality rating.

All Backend & APIs Code Assistants Coding Agents Database & SQL DevOps & Infrastructure Documentation Frontend & UI Game Development Mobile Development No-Code / Low-Code Open Source & Models Prompt Engineering Testing & QA Vibe Coding Web Scraping & Automation

Drift Detector

60%

Drift Detector is an AI chatbot application hosted on Hugging Face Spaces, designed to facilitate the generation of chat responses using various AI models. Users can interact with the tool by inputting a message and then selecting a preferred AI model from a dropdown menu to receive a response. This functionality makes it suitable for experimenting with different AI agents and observing their conversational outputs. The tool is built using Gradio and is licensed under MIT, making it free to use and accessible for educational purposes and general experimentation within the AI community. The application also supports searching, though specific details on this feature are not provided.

pytorch-widedeep

60%

Pytorch-widedeep is a flexible open-source package designed for multimodal deep learning within the PyTorch framework. It enables users to effectively combine diverse data types, specifically tabular data with text and images, using Google's Wide and Deep Algorithm. The library supports various architectures, including those with single components like wide, deeptabular, deeptext, or deepimage, as well as complex combinations. It allows for the integration of custom models, provided they expose an `output_dim` property. The package includes preprocessors for different data types and a `Trainer` class to streamline model training, making it a comprehensive solution for advanced data scientists and developers working with complex, multi-modal datasets.

Danbooru Tags Transformer V2 with WD Tagger & Florence 2 Flux Captioner

60%

Danbooru Tags Transformer V2 with WD Tagger & Florence 2 Flux Captioner is an AI tool designed to assist users in creating detailed prompts for AI art generation. By uploading an image, users can leverage the power of WD Tagger and Florence 2 Flux Captioner models to automatically generate relevant tags and captions. The tool offers customization options for these generated prompts, allowing users to fine-tune them to their specific needs. Once satisfied, the prompts can be easily copied to the clipboard for use in various AI art generation platforms. This tool is hosted on Hugging Face Spaces, making it accessible for those looking to enhance their AI art creation workflow.

text-classification-surveys

60%

text-classification-surveys is an open-source GitHub repository dedicated to compiling extensive resources for text classification within Natural Language Processing (NLP). It offers a detailed overview of various models, ranging from deep learning approaches like SpanBERT, ALBERT, and BERT, to shallow learning techniques such as LightGBM, SVM, and Random Forest. The repository also covers a wide array of text classification datasets, including MR, SST, IMDB, and Yelp, alongside common evaluation metrics like accuracy, Precision, Recall, and F1. Furthermore, it addresses technical challenges, including multi-label text classification. The content is primarily derived from the paper "A Survey on Text Classification: From Shallow to Deep Learning," making it a valuable resource for researchers and students in the field.

Talk To Qwen Webrtc

60%

Talk To Qwen Webrtc is an AI tool designed for real-time voice interaction with the Qwen2Audio model, leveraging Gradio and WebRTC technologies. Users can speak into a microphone, and the application will transcribe their speech into text. Following transcription, the tool processes the audio input and generates a text-based response, enabling dynamic communication with an AI. This platform is hosted on Hugging Face Spaces, making it accessible for experimentation with AI-driven audio processing and voice agents. It offers a straightforward interface for those looking to explore speech-to-text and AI response generation capabilities.

DeepSeek OCR Demo

60%

DeepSeek OCR Demo is an interactive application built on Hugging Face Spaces, showcasing the capabilities of the DeepSeek-OCR model for optical character recognition. Users can upload various image types, including documents, charts, and scenes, and select from several processing tasks. These tasks include standard plain OCR for text extraction, conversion of document content into Markdown format, and specialized figure parsing. The tool also offers the ability to locate specific items within the uploaded content, making it versatile for different analysis needs. This demo provides a practical way to experience advanced OCR functionalities, catering to those interested in document analysis and data extraction from images.

Emu2

60%

Emu2 is a generative multimodal model developed by BAAI, designed for in-context learning and capable of processing both image and text inputs. This application, hosted on Hugging Face Spaces, enables users to generate various forms of content and engage in interactive chat experiences. By providing a combination of text and images, users can receive generated responses or participate in conversations, making it a versatile tool for multimodal AI research and experimentation. The model aims to push the boundaries of AI's ability to understand and create content across different modalities.

HF LLM API

60%

HF LLM API provides a straightforward interface for exploring and interacting with the HuggingFace Large Language Model API. Users can easily input text prompts and receive generated text responses, facilitating the testing and utilization of various large language models. This application is designed to simplify the process of working with LLMs, offering a practical way to experiment with different models and their outputs. It is hosted on Hugging Face Spaces, indicating its accessibility and potential for community-driven development and sharing. The tool's focus on direct interaction with the API makes it valuable for developers and researchers looking to integrate or test LLM capabilities.

Groq-LLaMA3/4

60%

Groq-LLaMA3/4 is a chat application built using Streamlit and the Groq API, hosted as a Hugging Face Space. It enables users to engage with various Llama models and other AI models available through a Groq account. The tool supports multimodal interactions, allowing users to upload and reference .txt, .md, or .pdf files within their conversations. Additionally, if the selected model has vision capabilities, users can upload images for analysis. This makes it a versatile platform for exploring and interacting with advanced AI models in a conversational format.

Enhance Ai Training Data

60%

Enhance Ai Training Data is a Hugging Face Space by Gretel.ai designed to generate high-quality synthetic training data. This tool helps users improve or evaluate their AI models by providing seed data in various formats and configuring generation options. While the direct application is currently experiencing a runtime error on its Hugging Face Space, the underlying concept focuses on creating synthetic datasets from existing text or data. This capability is crucial for AI developers and machine learning engineers looking to expand their training data without relying solely on real-world data, which can be scarce or sensitive.

Security-Copilot

60%

Microsoft Security Copilot is a generative AI-powered security solution designed to enhance the efficiency and capabilities of security and IT defenders. This open-source project aims to improve security outcomes at machine speed and scale, while adhering to responsible AI principles. The repository provides resources for contributing to the Security Copilot community, including guides for setting up a development environment, forking the repository, creating branches, and submitting pull requests. It includes sample prompts, plugins, and technical workshops to help users get started and integrate the solution into their security operations.

Pipeshift (YC S24)

60%

Pipeshift delivers the production infrastructure, tooling, and expertise needed to take AI products and agents to market quickly. It focuses on optimizing model runtimes to meet inference performance SLAs, with orchestration to scale real-time production workloads across various clouds and regions. The platform offers low latency, high throughput, fast cold-starts, and 99.99% uptime. Pipeshift allows users to serve open-source, custom, and fine-tuned AI models on infrastructure purpose-built for high-performance inference at massive scale. Key features include a Model API Sandbox, infrastructure observability, custom SLA-based auto-scaling, and increased GPU utilization through scheduling and bin-packing pipelines. Their proprietary framework, Modular Architecture for GPU Inference Clusters (MAGIC), adapts the inference stack in real-time for unique GenAI application needs.

Gemma-3-R1984-27B ChatBot

60%

Gemma-3-R1984-27B ChatBot is an AI-powered application designed to provide answers by analyzing various document types, including text, PDF, CSV, and TXT files. Users can upload their documents and then ask questions, receiving detailed responses derived directly from the content. This tool is built for reasoning and deep research, leveraging the Gemma-3 family of models. It is hosted on Hugging Face Spaces and benefits from the processing power of NVIDIA H100 GPUs, indicating a focus on robust performance for complex analytical tasks. The application aims to streamline information extraction and question-answering from diverse data sources.

DeepFake-Detection

60%

DeepFake-Detection is an open-source project by dessa-oss focused on advancing deepfake detection capabilities. It highlights the limitations of current state-of-the-art models, such as those based on FaceForensics++, in generalizing to real-life videos from platforms like YouTube. The tool offers a PyTorch implementation built on a pre-trained ResNet18 model, fine-tuned for deepfake detection. It conducts extensive experiments to demonstrate that existing datasets are often insufficient for robust real-world detection and proposes solutions involving the integration of more diverse data. The project emphasizes the need for detectors to be continuously updated with real-world data to effectively identify various manipulation techniques.

selfcheckgpt

60%

SelfCheckGPT is an open-source project designed for zero-resource, black-box hallucination detection in generative large language models. It provides several variants of the self-check approach, including BERTScore, Question-Answering (MQAG), n-gram, NLI, and LLM-Prompting. The tool allows developers and researchers to evaluate the factual consistency of AI-generated content by comparing it against sampled passages. It offers Python packages for easy installation and usage, with detailed examples provided for each self-check method. SelfCheckGPT also includes a dataset for evaluating hallucination detection, making it a valuable resource for improving the reliability of LLM outputs.

StableNormal

60%

StableNormal is an open-source AI tool designed to enhance monocular normal estimation by reducing the inherent stochasticity of diffusion models. This approach leads to "Stable-and-Sharp" normal maps, outperforming various baselines in terms of accuracy and stability. The tool is presented as a research project from SIGGRAPH Asia 2024 and provides a Python-based pipeline for installation and usage. It includes a faster inference option, StableNormal-turbo, which is 10 times quicker. Users can compute metrics on datasets like DIODE, IBims-1, Scannet, and NYUv2 to evaluate performance, making it suitable for researchers and developers in computer vision and generative AI.

truss

60%

Truss is a command-line interface (CLI) tool designed to streamline the deployment and serving of AI/ML models on Baseten. It allows developers to package their model's serving logic in Python, manage dependencies, and configure GPUs with ease. Truss handles containerization automatically, eliminating the need for manual Docker and Kubernetes setup. It supports a wide range of open-source frameworks, including vLLM, SGLang, TensorRT-LLM, transformers, diffusers, PyTorch, and TensorFlow. Key features include a fast developer loop with live reload, production-ready capabilities like built-in GPU support, secrets management, caching, and autoscaling, whether deployed to Baseten or custom infrastructure. Truss also provides a JSON schema for `config.yaml` to enable autocompletion and validation in popular IDEs.

T5unami Small V1

60%

T5unami Small V1 is an AI tool available as a Hugging Face Space, developed during the Somos NLP Hackathon 2023. While the tool's intended functionality is task automation and content generation, the live application currently displays a runtime error, indicating issues with its environment setup, specifically related to GPU support and PeftConfig initialization. This suggests it's a project in development or requires specific configurations to run correctly. The tool is likely intended for developers, researchers, or students interested in experimenting with AI models, particularly those involved in the Somos NLP community. Its current state prevents direct use, but it represents an effort in leveraging T5 models for various applications.

SRFBN_CVPR19

60%

SRFBN_CVPR19 offers a PyTorch implementation of the 'Feedback Network for Image Super-Resolution' paper (CVPR2019). This open-source tool allows users to apply advanced deep learning techniques to enhance the resolution of images. It includes pre-trained models for various scaling factors and degradation models, enabling quick evaluation on standard benchmarks like Set5, Set14, and Urban100. Users can also train their own models using provided scripts and datasets like DIV2K or DF2K. The repository details the architecture of the SRFBN, including feedback connections, and provides comprehensive instructions for testing and training, making it a valuable resource for researchers and developers in computer vision.

stable-diffusion-videos

60%

stable-diffusion-videos is an open-source tool designed for creating dynamic videos using Stable Diffusion. It enables users to generate videos by exploring the latent space and smoothly morphing between different text prompts. The tool supports various functionalities, including specifying prompts, seeds, interpolation steps, and output dimensions. A notable feature is the ability to create music videos by integrating audio files, where the audio informs the interpolation rate to synchronize video movement with the beat. It offers flexibility in controlling parameters like guidance scale and the number of inference steps. The project is available on GitHub and provides examples and installation instructions for easy setup and experimentation.

stable-diffusion-webui-chinese

60%

stable-diffusion-webui-chinese is an open-source extension designed to provide a Simplified Chinese localization for the Stable Diffusion WebUI. This project aims to make the popular AI image generation interface more accessible to Chinese-speaking users by translating its core components and several widely used extensions. The current version, 0313, is based on the latest official webui and self-used plugin templates as of March 13, 2024. It includes translations for extensions like ControlNet, openpose-editor, multidiffusion-upscaler, and more. Users can install it either through the WebUI's Extensions tab or by directly copying the localization files, making it a valuable resource for those who prefer a Chinese interface for their AI art generation workflow.

SpikeGPT

60%

SpikeGPT is an implementation of a generative pre-trained language model that utilizes pure binary, event-driven spiking neural networks. This lightweight model is inspired by RWKV-LM and allows for experimentation with spiking neural networks in language modeling tasks. It supports training on datasets like Enwik8 and pre-training on large corpora such as The Pile. Users can fine-tune the model on datasets like WikiText-103 and perform inference with custom prompts or a pre-trained model. The repository also includes resources for fine-tuning with Natural Language Understanding (NLU) tasks, making it a valuable tool for researchers and developers exploring alternative neural network architectures.

SuperGlue-pytorch

60%

SuperGlue-pytorch offers a PyTorch implementation of the SuperGlue matching network, designed for learning feature matching with Graph Neural Networks. This repository specifically includes code for training the SuperGlue network using SIFT keypoints and descriptors. It is intended for applications leveraging the Physarum Dynamics LP solver, which can potentially replace the original Sinkhorn Algorithm in SuperGlue. The architecture involves an Attentional Graph Neural Network and an Optimal Matching Layer, facilitating the identification of correspondences between image features, even in cases of occlusion or detector failure. The tool provides scripts for training the model and loading data, including generating keypoints, descriptors, and ground truth matches.

symbolic_deep_learning

60%

symbolic_deep_learning is an open-source project providing the official implementation for the research paper "Discovering Symbolic Models from Deep Learning with Inductive Biases." This tool enables researchers and developers to explore the integration of symbolic reasoning with deep learning techniques. It supports the development of models that combine neural networks with symbolic structures, offering a novel approach to understanding and interpreting complex deep learning models. The repository includes code for training example models, generating data, and analyzing results, making it a valuable resource for academic research in AI and machine learning.

EXPLORE OTHER CATEGORIES

🎨 Content & Design 📊 Productivity & Business 🤖 AI Agents & Automation 📚 Research & Education 🧘 Wellness & Lifestyle 💼 Career Development 📈 Marketing & Growth 📉 Data & Analytics 💬 Customer Support & CX 💰 Finance 🛒 E-commerce