ShypdShypd.ai
💻

Coding & Development

Browsing page 391 of AI tools for Coding & Development. Sorted by confidence score — our independent quality rating.

schnetpack

schnetpack

58%

schnetpack is an open-source toolbox designed for researchers and developers working with atomistic systems. It provides a robust framework for developing and applying deep neural networks to predict various properties of molecules and materials, such as potential energy surfaces and quantum-chemical characteristics. The tool includes fundamental building blocks for atomistic neural networks, simplifying the process of conducting simulations and making accurate property predictions. Its open-source nature, hosted on GitHub, encourages community contributions and provides transparent access to its codebase, making it a valuable resource for academic and industrial research in computational chemistry and materials science.

Satellite-Imagery-Datasets-Containing-Ships

Satellite-Imagery-Datasets-Containing-Ships

58%

Satellite-Imagery-Datasets-Containing-Ships is a comprehensive GitHub repository that curates radar and optical satellite datasets specifically designed for ship detection, classification, semantic segmentation, and instance segmentation tasks. These datasets are invaluable for researchers and developers working in computer vision, machine learning, remote sensing, and maritime analysis. The repository details various datasets, including SSDD, OpenSARship, SAR-Ship-Dataset, AIR-SARShip, HRSID, LS-SSDD, and FUSAR-Ship, providing information on their authors, year, tasks supported, and direct access links. Each dataset entry includes specifics like image dimensions, spatial resolutions, polarization types, and annotation formats, making it a crucial resource for developing and evaluating algorithms for maritime surveillance and naval operations.

SpatialLM

SpatialLM

58%

SpatialLM is a 3D large language model designed to process 3D point cloud data and generate structured 3D scene understanding outputs. It can identify architectural elements such as walls, doors, and windows, as well as oriented object bounding boxes with their semantic categories. A key differentiator is its ability to handle point clouds from diverse sources, including monocular video sequences, RGBD images, and LiDAR sensors, unlike previous methods that often required specialized equipment. This multimodal architecture bridges the gap between unstructured 3D geometric data and structured 3D representations, providing high-level semantic understanding. SpatialLM enhances spatial reasoning capabilities for applications in embodied robotics, autonomous navigation, and other complex 3D scene analysis tasks. It offers models like SpatialLM1.1-Llama-1B and SpatialLM1.1-Qwen-0.5B, available on Hugging Face, and supports detection with user-specified categories.

rl

rl

58%

TorchRL is an open-source Reinforcement Learning (RL) library built for PyTorch, emphasizing a modular, primitive-first, and Python-first design. It provides a comprehensive framework for developing and deploying RL agents, featuring a command-line training interface for state-of-the-art agents without extensive coding. The library also includes a revamped vLLM integration for scalable LLM inference and training, offering features like AsyncVLLM service, multiple load balancing strategies, and distributed data loading. Additionally, TorchRL offers an experimental PPOTrainer for configurable PPO training solutions and a complete LLM API for fine-tuning language models, supporting RLHF, supervised fine-tuning, and tool-augmented training. Its design principles align with the PyTorch ecosystem, ensuring efficiency, extensibility, and minimal dependencies.

shapash

shapash

58%

Shapash is a Python library designed to make machine learning models interpretable and comprehensible for everyone. It offers various visualizations with clear and explicit labels, simplifying the understanding of interactions between a model's features. A key feature is its ability to generate a Webapp, allowing users to easily navigate between local and global explainability. This Webapp helps Data Scientists understand their models and share results with non-data experts. Shapash also contributes to data science auditing by providing comprehensive reports about models and data. It supports Regression, Binary Classification, and Multiclass problems and is compatible with numerous models like Catboost, Xgboost, LightGBM, Sklearn Ensemble, Linear models, and SVM, with options to integrate other models.

dreamtalk

dreamtalk

58%

DreamTalk is an open-source framework designed for generating expressive talking head videos. It utilizes diffusion probabilistic models to create high-quality videos that capture diverse speaking styles. The tool is robust, handling a wide array of inputs including songs, speech in multiple languages, and even noisy audio, and can work with out-of-domain portraits. Users can specify audio paths, style clips, head poses, and input images to generate videos. While the primary focus is on accurate lip-sync and vivid expressions, the resolution can be improved using external solutions like CodeFormer or MetaPortrait's Temporal Super-Resolution Model. The project provides inference code and pretrained checkpoints, though access to checkpoints requires an email request for academic research purposes.

streamlit-fastapi-model-serving

streamlit-fastapi-model-serving

58%

streamlit-fastapi-model-serving is an open-source project designed to simplify the deployment of machine learning models. It leverages FastAPI for creating a robust backend with automatic API documentation and Streamlit for building an interactive, user-friendly frontend. This combination allows developers to quickly serve PyTorch models, providing both a programmatic interface for other applications and a visual interface for direct user experimentation. The project uses Docker Compose to orchestrate these two services, ensuring seamless communication and easy setup. It's an ideal solution for developers looking to deploy ML models with a complete web application stack.

SwiftUI-Agent-Skill

SwiftUI-Agent-Skill

58%

SwiftUI-Agent-Skill provides expert guidance for AI coding tools that support the Agent Skills open format. It focuses on practical SwiftUI best practices, covering essential aspects like state management, view composition, and performance optimization. This tool is designed for developers and teams who are adopting modern SwiftUI APIs and want to leverage AI assistance to improve their coding efficiency and code quality. It helps in understanding and implementing robust SwiftUI solutions, ensuring adherence to best practices for scalable and maintainable applications.

TextClassification-Keras

TextClassification-Keras

58%

TextClassification-Keras is a comprehensive code repository designed for implementing deep learning models for text classification tasks using the Keras framework. It offers ready-to-use implementations of popular models such as FastText, TextCNN, and TextRNN, making it a valuable resource for researchers and developers. The repository simplifies the application of these advanced models to text classification problems, supporting both English and Chinese documents. It serves as an excellent starting point for those looking to explore or integrate deep learning-based text classification into their projects, providing a foundational codebase for further development and experimentation.

techniques

techniques

58%

The 'techniques' GitHub repository serves as a comprehensive resource for deep learning methods specifically tailored for satellite and aerial imagery analysis. It provides an organized overview of various techniques designed to handle the unique challenges of processing large-scale image datasets. The repository focuses on methodologies for identifying diverse object classes within these images, making it a valuable asset for researchers and developers in the field. As an open-source project, it is freely accessible for both research and development purposes, fostering collaboration and advancement in the application of AI to geospatial data.

TFC-pretraining

TFC-pretraining

58%

TFC-pretraining is a specialized tool designed for self-supervised contrastive learning, specifically tailored for time series data. It leverages a novel approach called time-frequency consistency to significantly improve the learning process and the quality of representations derived from complex time series. The tool provides researchers and practitioners with not only the underlying methodology but also includes processed datasets and readily available code for implementing the technique. This makes it an invaluable resource for those working in time series analysis, enabling them to explore advanced predictive analytics and pattern recognition with greater efficiency and accuracy. Its focus on robust representation learning addresses key challenges in handling sequential data.

torchcv

torchcv

58%

TorchCV is a PyTorch-based framework designed for deep learning applications in computer vision. It offers a comprehensive collection of implementations for various models, primarily focusing on image classification and other common computer vision tasks. The framework is built with the goal of keeping pace with the latest advancements and research in the field, providing developers with up-to-date resources. While the provided content is a GitHub pricing page, the context indicates torchcv is a tool for developers working with computer vision models, likely open-source given its GitHub presence. It serves as a valuable resource for those looking to implement or experiment with state-of-the-art computer vision algorithms.

AnySplat

AnySplat

58%

AnySplat is an open-source tool designed for feed-forward 3D Gaussian Splatting from unconstrained views. It utilizes a transformer-based geometry encoder followed by three decoder heads to predict Gaussian parameters, depth maps, and camera poses. These outputs are then used to construct pixel-wise 3D Gaussians, which are voxelized and rendered into multi-view images and depth maps. The tool supports training and inference, with code available for installation and quick start. It also includes a Gradio-based demo for visualizing reconstructed 3D Gaussian Splats from uploaded images or videos, making it a valuable resource for researchers and developers in computer vision and graphics.

awesome-seml

awesome-seml

58%

Awesome-seml is a comprehensive, curated list of articles dedicated to software engineering best practices for developing machine learning applications. This resource goes beyond core ML algorithms, focusing instead on the crucial surrounding activities such as data ingestion, coding standards, rigorous testing, version control, seamless deployment, quality assurance, and effective team collaboration. It serves as an invaluable guide for ML engineers and software engineers aiming to build robust, reliable, and production-ready machine learning systems. The list is categorized into broad overviews, data management, model training, deployment and operation, social aspects, governance, and tooling, offering a structured approach to understanding and implementing best practices.

awesome-gpt4

awesome-gpt4

58%

awesome-gpt4 is an open-source GitHub repository offering a comprehensive, curated list of resources centered around the GPT-4 language model. It serves as a valuable hub for researchers, developers, and enthusiasts looking to delve deeper into GPT-4's applications and advancements. The repository categorizes resources into several key areas, including impactful scientific papers, a diverse collection of open-source projects leveraging GPT-4, community-contributed demos showcasing its capabilities, and various product integrations that utilize the model. Additionally, it features a section dedicated to GPT-4 news and announcements, keeping users updated on the latest developments. A significant part of awesome-gpt4 is its collection of impressive prompts, demonstrating effective ways to interact with GPT-4 for various tasks, from acting as a pharmacologist or lawyer to a debugger or mobile app developer. This makes it an indispensable resource for understanding, experimenting with, and developing applications based on GPT-4.

use-stick-to-bottom

use-stick-to-bottom

58%

use-stick-to-bottom is a lightweight, zero-dependency React Hook and Component specifically designed for AI chat applications. It automatically sticks to the bottom of a container and smoothly animates content to maintain its visual position as new messages are added. This tool does not rely on `overflow-anchor` CSS support, making it compatible with browsers like Safari. It uses the `ResizeObserver` API to detect content resizing, supporting both content growth and shrinking without losing stickiness. The hook also correctly handles scroll anchoring, preventing content jumps when elements above the viewport resize. Users can cancel stickiness by scrolling up, with clever logic distinguishing user scrolls from animation events. It features a custom smooth scrolling algorithm with velocity-based spring animations, ideal for streaming content with variable sizing common in AI chatbots.

AViD

AViD

58%

AViD is a streamlined toolkit designed for fine-tuning state-of-the-art vision-language detection models with parameter-efficient adaptation. Built upon the powerful Grounding DINO framework, AViD introduces capabilities for fine-tuning image-to-text grounding on custom datasets, which is crucial for applications requiring precise alignment between textual descriptions and image regions. Key features include a complete fine-tuning pipeline, parameter-efficient training with LoRA (allowing training of only ~2% of parameters), and EMA stabilization to retain pre-trained knowledge. It also offers optional phrase-based NMS for removing redundant boxes and includes a sample fashion dataset for immediate experimentation. The framework provides a comprehensive evaluation suite with metrics like mAP, Precision, Recall, and F1 Score, along with visualizations and detailed reporting.

Voqal

Voqal

58%

Voqal offers a native voice control SDK designed for mobile developers to integrate Arabic and English voice commands into their iOS and Android applications. The SDK supports over 10 Arabic dialects, including Egyptian, Gulf, Levantine, Maghrebi, and Iraqi, ensuring broad user understanding. It boasts a response time of less than 5 seconds and an accuracy rate exceeding 95%. Voqal handles voice recognition, intent parsing, and response handling, allowing developers to add voice control without modifying their backend. The integration process is streamlined, taking minutes rather than days, and supports popular frameworks like React Native and Flutter. Built-in analytics provide insights into usage patterns and recognition accuracy, making it a comprehensive solution for voice-enabling mobile apps in the MENA region.

brian2

brian2

58%

Brian2 is a free, open-source simulator for spiking neural networks, primarily written in Python. It provides a user-friendly and efficient platform for researchers to model and simulate complex neural circuits. The simulator is designed with ease of learning and use in mind, aiming to save scientists' time in addition to processing power. Brian2 is highly flexible and easily extensible, making it suitable for a wide range of neuroscience research applications. It is available on almost all platforms and offers comprehensive documentation. Users are encouraged to report issues via GitHub or the Brian forum and to cite the provided article if used for published research.

bullet3

bullet3

58%

bullet3 is the official C++ source code repository for the Bullet Physics SDK, offering real-time collision detection and multi-physics simulation capabilities. It is widely used across various domains including virtual reality, game development, visual effects, robotics, and machine learning. The SDK supports a range of platforms like Windows, Linux, Mac OSX, iOS, and Android, and includes experimental OpenCL GPGPU support for accelerating collision detection and rigid body dynamics. Users can also leverage PyBullet, Python bindings for enhanced support in robotics, reinforcement learning, and VR, with simple installation via pip. The project is licensed under the permissive zlib license.

aiTouch

aiTouch

58%

aiTouch is an advanced technologies software services startup recognized by the Government of India, specializing in AI, ML, and data science. They offer a comprehensive suite of services including custom software development for web and mobile applications, SaaS solutions, and full-stack development. A core offering is their data annotation and labeling services, covering image, video, text, and audio annotation, supported by an in-house annotation tool. aiTouch focuses on creating high-quality data sets essential for AI/ML model training and development. They serve various verticals such as Retail & CPG, Sports, Automotive, and Healthcare, assisting clients globally from early ventures to large-scale enterprises in building top-performing AI models and software solutions.

wenet

wenet

58%

wenet is an open-source, production-first, end-to-end speech recognition toolkit designed to offer comprehensive solutions for automatic speech recognition (ASR). The project emphasizes production readiness and ease of use, making it suitable for developers and organizations looking to integrate robust speech recognition capabilities into their applications. It provides the foundational components necessary for building and deploying ASR systems, focusing on practical implementation rather than just research. The toolkit is hosted on GitHub, indicating a collaborative development model and accessibility for the developer community.

accelerated-text

accelerated-text

58%

Accelerated Text is a no-code natural language generation platform designed to transform data into varied textual descriptions. It provides a web-based Document Plan builder where users can define the logical structure of a document, express communication goals, and specify data usage within the text. The platform's Natural Language Generation engine then uses these Document Plans and connected data to produce multiple text variations, ensuring precision and adherence to the provided data. Key features include a document plan editor, support for CSV data samples, text structure variations, language and vocabulary control, a built-in rule engine, and a live preview of generated text. It focuses on data-bound text generation, ensuring descriptions are always and only about the data provided.

weight-loss

weight-loss

58%

weight-loss is an open-source GitHub repository that leverages machine learning to analyze personal weight and lifestyle data, helping users understand the factors contributing to weight changes. It provides scripts for data collection, conversion to a machine learning-friendly format (Vowpal Wabbit), and analysis to identify correlations between lifestyle choices (diet, sleep, exercise) and weight fluctuations. The project emphasizes a personal journey of experimentation and discovery, offering insights into effective weight loss strategies, particularly those related to ketosis and fasting. Users can apply the provided code to their own data to gain personalized insights, with a focus on identifying significant factors like sleep and carbohydrate intake.