ShypdShypd.ai
💻

Coding & Development

Browsing page 197 of AI tools for Open Source & Models in Coding & Development. Sorted by confidence score — our independent quality rating.

YOLO-World + EfficientSAM

YOLO-World + EfficientSAM

55%

YOLO-World + EfficientSAM is an AI tool available on Hugging Face that facilitates advanced object detection and image segmentation. Users can upload photos or videos and specify objects they wish to identify using comma-separated names. The tool then processes the media to highlight these objects with precise bounding boxes and masks, offering an optional confidence score display. This combination of YOLO-World for detection and EfficientSAM for segmentation provides a robust solution for visual analysis tasks. It is particularly suitable for AI research and prototyping, allowing developers and researchers to experiment with and build upon state-of-the-art computer vision models.

Zyphra-ZR1 WebGPU

Zyphra-ZR1 WebGPU

55%

Zyphra-ZR1 WebGPU is a compact AI reasoning model engineered to operate entirely within a web browser, leveraging WebGPU technology. This innovative approach enables users to perform complex reasoning tasks and interact with 3D models without the need for external servers or cloud infrastructure. Users can upload their own 3D models or utilize preloaded ones, exploring them in a detailed and immersive environment directly from their browser. This local execution capability makes it particularly useful for applications requiring offline functionality, enhanced privacy, or experimental AI development where server-side processing is not desired or feasible. The tool is hosted on Hugging Face Spaces, indicating its community-driven and accessible nature.

— Zero GPU Spaces —

— Zero GPU Spaces —

55%

— Zero GPU Spaces — is a directory designed to help users discover and explore AI applications that operate efficiently without the need for dedicated GPU hardware. This tool provides a searchable list of Hugging Face Spaces that leverage ZeroGPU technology, making AI accessible for a broader range of users and applications. Users can filter available demos by keywords, and view essential details such as the title, description, and usage statistics for each Space. It serves as a valuable resource for developers, researchers, and enthusiasts looking for cost-effective and hardware-agnostic AI solutions, promoting low-cost AI development and educational exploration.

LesQuicheurs

LesQuicheurs

54%

LesQuicheurs provides an AI chatbot experience focused on general conversations and entertainment. Users can interact with the chatbot for various purposes, including casual discussions and exploring the capabilities of language models. The tool is available on Hugging Face, making it accessible for those interested in engaging with AI-powered conversational agents and experimenting with their responses.

Kotaemon

Kotaemon

54%

Kotaemon is an AI chatbot designed to facilitate general conversation and AI interaction. Users can leverage this tool to test the capabilities of chatbots and explore various AI language models. It serves as a valuable resource for individuals and developers looking to prototype chatbot applications, offering a platform to experiment with and understand AI-driven conversational agents. The tool is available for free, making it accessible for a wide range of users interested in AI.

Keye VL 8B Preview

Keye VL 8B Preview

54%

Keye VL 8B Preview is an AI chatbot that specializes in visual question answering and image captioning. This tool is built to understand and process multimodal inputs, allowing users to interact with images by asking questions and generating descriptive captions. It leverages advanced AI models to interpret visual information and provide relevant textual responses. The tool is available for free, making it accessible for various applications requiring visual AI capabilities.

LCM-LoRA on SDXL

LCM-LoRA on SDXL

54%

LCM-LoRA on SDXL is an AI image generation tool designed to produce high-quality images. It integrates Latent Consistency Models (LCM) and LoRA (Low-Rank Adaptation) techniques, built upon the robust SDXL architecture. This combination allows for efficient and effective image generation. The tool is accessible to a broad audience, particularly AI enthusiasts and developers, and is offered for free on the Hugging Face platform.

Llama 3.1 70b Demo

Llama 3.1 70b Demo

54%

Llama 3.1 70b Demo is an AI chatbot specifically designed for engaging in conversational tasks. Its core capabilities include advanced language understanding and efficient text generation. This tool can serve as a valuable educational resource, providing a platform for users to interact with and learn from an AI. It is offered to users at no cost.

Llama 2 7B Chat

Llama 2 7B Chat

54%

Llama 2 7B Chat is an AI chatbot specifically developed for engaging in conversational tasks. Its core functionalities revolve around advanced language understanding and efficient text generation, making it suitable for various interactive applications. The tool is also positioned as a valuable educational resource, offering capabilities that can aid learning and exploration in AI and language processing. It is noted for being available at no cost.

MiniCPM-V-4 5-Demo

MiniCPM-V-4 5-Demo

54%

MiniCPM-V-4 5-Demo provides an interactive platform on Hugging Face for users to engage with the MiniCPM-V-4 5 AI model. This chatbot serves as a demonstration tool, enabling individuals to explore and evaluate the model's capabilities through direct interaction. It is offered without cost, making it an accessible resource for those involved in research, education, or simply curious about the performance of the MiniCPM-V-4 5 model.

DSensei

DSensei

54%

DSensei provides a serverless hosting solution specifically designed for fine-tuned Large Language Models (LLMs). The platform focuses on delivering high performance through dedicated instance deployments, aiming to reduce operational costs significantly. A key feature is its ability to achieve sub-second code start times for LLMs, which facilitates rapid deployments and ensures efficient, cost-effective serving based on a pay-per-use model. This makes it suitable for developers and businesses looking to deploy and manage their custom LLMs without the overhead of traditional infrastructure.

nerfplusplus

nerfplusplus

54%

nerfplusplus is an open-source codebase designed to enhance Neural Radiance Fields (NeRF) for capturing and rendering large-scale, unbounded 360-degree scenes. It offers significant improvements over traditional NeRF methods, particularly for complex environments. The codebase supports multi-GPU training and inference through PyTorch DistributedDataParallel, enabling efficient processing of demanding tasks. An experimental feature for optimizing per-image autoexposure is also included. It provides tools for data preparation, including generating camera parameters with COLMAP SfM, scene normalization, and visualizing cameras in 3D to ensure compatibility and correctness.

OpenSplat

OpenSplat

54%

OpenSplat is a free and open-source C++ implementation of 3D Gaussian splatting, designed for portability, efficiency, and speed. It can run on Windows, Mac, and Linux, with support for NVIDIA, AMD, and Apple (Metal) GPUs, as well as CPU-only operation (though significantly slower). The tool takes camera poses and sparse points from formats like COLMAP, OpenSfM, ODM, or nerfstudio projects to compute scene files (.ply or .splat). These generated files can then be imported into other software for viewing, editing, and rendering. OpenSplat is licensed under AGPLv3, allowing and encouraging commercial use.

WanX 2.1

WanX 2.1

54%

WanX 2.1 is an open-source AI video generator designed to create short MP4 videos directly from text prompts. It specializes in generating 5-second, 720p videos, making them ideal for various social media platforms. The tool leverages a 3D VAE architecture and an MLLM text encoder to enhance motion consistency within the generated videos. Its open-source nature means that both the code and model weights are publicly available on GitHub, allowing for transparency and community contributions.

rsrc

rsrc

54%

rsrc is an open-source tool designed for Go developers working on Windows applications, enabling them to embed .ico files and Windows manifest resources directly into their programs. It generates a .syso file containing these resources, which the Go linker then uses during the build process to create Win32 executables. This simplifies the inclusion of application icons and manifest details, ensuring they are properly integrated into the final binary. The tool supports various architectures including 386 and amd64, with experimental support for arm and arm64. While currently in low-priority maintenance, it remains a functional solution for its intended purpose, with an alternative, more feature-rich tool from @tc-hib also suggested for those needing advanced capabilities.

Complex-YOLOv4-Pytorch

Complex-YOLOv4-Pytorch

54%

Complex-YOLOv4-Pytorch offers a robust PyTorch implementation of the Complex-YOLOv4 paper, focusing on real-time 3D object detection using point clouds. This tool is designed for researchers and developers working with LiDAR data, providing features like distributed data parallel training for efficiency and Tensorboard integration for monitoring training progress. It incorporates advanced augmentation techniques such as Mosaic/Cutout for training and utilizes GIoU loss for optimizing rotated bounding boxes, enhancing detection accuracy. The project also highlights an anchor-free approach, faster training and inference, and eliminates the need for Non-Max-Suppression, making it a powerful solution for 3D object detection tasks.

cubdb

cubdb

54%

CubDB is an embedded key-value database specifically developed for the Elixir language, prioritizing robustness and efficient resource utilization. It offers essential operations like `get`, `put`, and `delete`, alongside advanced features such as atomic, consistent, isolated, and durable (ACID) transactions. The database employs multi-version concurrency control (MVCC), allowing concurrent read operations without blocking or being blocked by writes. CubDB ensures data integrity even after unexpected shutdowns through its append-only, immutable B-tree data structure, which prevents corruption and maintains transaction atomicity. It also supports manual or automatic compaction to reclaim disk space and provides read-only snapshots for consistent, isolated reads without blocking writes.

DAMO-YOLO

DAMO-YOLO

54%

DAMO-YOLO is a fast and accurate open-source object detection method developed by the TinyML Team from Alibaba DAMO Data Analytics and Intelligence Lab. It extends the YOLO series with new technologies including Neural Architecture Search (NAS) backbones, efficient Reparameterized Generalized-FPN (RepGFPN), a lightweight head with AlignedOTA label assignment, and distillation enhancement. The tool achieves higher performance than state-of-the-art YOLO series and provides not only powerful models but also highly efficient training strategies and complete tools from training to deployment. It supports various models, including general, light, and 701-category models, and offers tutorials for custom dataset finetuning and TensorRT Int8 Quantization.

upscaledb

upscaledb

54%

upscaledb is a very fast, lightweight embedded database engine written in C/C++ that includes a built-in query language. It is production-proven and designed for ease of use, offering features like a sorted B+Tree with variable length keys, basic schema support for POD types, and very fast analytical functions. The database can run as an in-memory solution, supports unlimited parallel transactions, and provides transparent AES encryption and CRC32 verification. It also includes various compression codecs, network access via TCP/Protocol Buffers, and wrappers for multiple programming languages including C++, Java, .NET, Erlang, and Python. upscaledb is open source under the Apache Public License 2.0.

DirectVoxGO

DirectVoxGO

54%

DirectVoxGO is an open-source tool designed for fast radiance field reconstruction, leveraging direct voxel grid optimization. It significantly speeds up NeRF (Neural Radiance Fields) by replacing traditional MLPs with a voxel grid for volume densities and a dense feature grid with a shallow MLP for view-dependent colors. The tool includes a PyTorch CUDA extension for additional 2-3x speedup and an O(N) realization for the distortion loss, improving both training time and quality. It supports various datasets including bounded and unbounded inward-facing scenes, as well as forward-facing scenes, making it versatile for researchers and engineers in computer vision.

embassy

embassy

54%

Embassy is a modern, open-source framework designed for embedded applications, utilizing the Rust programming language and its asynchronous facilities. It enables developers to write safe, correct, and energy-efficient embedded code more rapidly. The framework includes Hardware Abstraction Layers (HALs) for various microcontrollers like STM32, nRF, RP2040, and MSPM0, simplifying hardware interaction. Key features include globally available timekeeping, real-time task management with priority-based execution, and low-power readiness by putting the core to sleep when idle. Embassy also offers a comprehensive networking stack, Bluetooth Low Energy support, LoRa integration, a device-side USB stack, and a robust bootloader for power-fail-safe firmware upgrades.

HyperPose

HyperPose

54%

HyperPose is a powerful library designed for building high-performance custom human pose estimation applications. It stands out with its real-time capabilities, achieved through a sophisticated pose estimation engine that incorporates numerous system optimizations. These include pipeline parallelism, model inference with TensorRT, and CPU/GPU hybrid scheduling, leading to significantly higher FPS compared to other popular tools like OpenPose, TF-Pose, and OpenPifPaf. Beyond performance, HyperPose offers flexibility for developers, providing high-level Python APIs to customize training, evaluation, visualization, pre-processing, and post-processing. Users can also tailor model architectures and training datasets, and accelerate training with multiple GPUs, making it a versatile solution for advanced computer vision projects.

grayskull

grayskull

54%

Grayskull is a minimalist, dependency-free computer vision library written in C, specifically engineered for microcontrollers and other resource-constrained devices like drones and robotics. It focuses on grayscale image processing, providing a suite of modern and practical algorithms that fit within a few kilobytes of code. Key features include image operations such as copy, crop, resize (bilinear), and downsample, along with filtering capabilities like blur, Sobel edges, and various thresholding methods (global, Otsu, adaptive). The library also supports morphology operations (erosion, dilation), geometry functions like connected components and perspective warp, and advanced features like FAST/ORB keypoints for object tracking and LBP cascades for face and vehicle detection. Its single-header design, integer-based operations, and pure C99 implementation ensure no dynamic memory allocation or C++ dependencies, making it ideal for embedded vision projects.

kapao

kapao

54%

KAPAO (Keypoints and Poses as Objects) is an efficient single-stage multi-person human pose estimation method. It models keypoints and poses as objects within a dense anchor-based detection framework, simultaneously detecting both pose and keypoint objects and fusing them to predict human poses. This approach results in a model that is faster and more accurate than previous single-stage methods like DEKR and HigherHRNet, especially when not using test-time augmentation. The repository provides the official PyTorch implementation, including setup instructions, trained models, and various inference demos for static images, videos, and even depth video. It also details experiments on COCO and CrowdPose datasets, along with training commands for different model sizes (KAPAO-S, KAPAO-M, KAPAO-L).