Coding & Development
Browsing page 187 of AI tools for Open Source & Models in Coding & Development. Sorted by confidence score — our independent quality rating.
Gaussian-SLAM
Gaussian-SLAM is an open-source project available on GitHub, designed for photo-realistic dense Simultaneous Localization and Mapping (SLAM). It leverages Gaussian splatting to achieve high-quality 3D reconstruction, offering a robust solution for researchers and engineers in computer vision and robotics. The tool supports various datasets including Replica, TUM_RGBD, ScanNet, and ScanNet++, and provides scripts for easy setup and data downloading. Users can configure and run SLAM experiments, reproduce results, and even generate fly-through videos based on reconstructed scenes. It's tested on powerful GPUs like RTX3090 and RTX A6000, ensuring performance for demanding tasks.
github-widget
github-widget is an open-source tool designed to easily embed GitHub profile details into any website. Users can display their GitHub username, repository information, and other relevant details by simply copying and pasting a small code snippet into their HTML. The widget is highly customizable and can be integrated via direct script inclusion, npm, or bower, offering flexibility for different development workflows. This tool is ideal for developers, designers, or anyone who wants to showcase their GitHub activity and contributions directly on their personal website, portfolio, or project pages, providing a dynamic and up-to-date representation of their work.
FabricView
FabricView is an open-source Android library designed for canvas drawing, offering functionalities similar to Fabric.js for web development. It enables developers to integrate robust drawing capabilities into their Android applications, supporting various input types including text, images, and hand/stylus drawing. The library is currently under active development, with plans for refactoring and polishing. Key features include support for multiple input colors, different background modes like notebook and graph paper, and the ability to export the canvas as an image. Future enhancements are planned, such as layers, groups, transparency, and advanced transformations like rotations and scaling. FabricView provides an API for easy integration and offers comprehensive documentation.
TheBloke Quantized Models
TheBloke Quantized Models is a Hugging Face Space designed to help users find and explore quantized AI models. Quantization is a technique that reduces the size and computational cost of AI models, making them more efficient for deployment and use on various hardware. This tool provides a search interface where users can look for models based on the author or the model's specific name. The platform presents a table of available models, detailing their types and other relevant information. While the current status indicates a build error, the intent of the space is to serve as a repository and discovery tool for these optimized AI models, primarily hosted on Hugging Face.
OpenCV-Face-Recognition
OpenCV-Face-Recognition is an open-source project designed for real-time face recognition using OpenCV and Python. It serves as a foundational resource for developers and data scientists looking to implement face detection and recognition systems. The project includes comprehensive tutorials, making it accessible for those who want to build end-to-end face recognition applications. It leverages the power of OpenCV for image processing and Python for scripting, providing a robust framework for various computer vision tasks related to facial analysis. This tool is particularly useful for learning and developing custom solutions in areas such as security, attendance systems, or interactive applications requiring real-time facial identification.
PaddleDetection
PaddleDetection is an end-to-end object detection development toolkit built on PaddlePaddle, offering a rich set of model components and benchmarks. It focuses on industrial applications by providing specialized models and tools, along with practical application examples. This toolkit helps developers streamline the entire process from data preparation and model selection to training and deployment. It supports various tasks including 2D/3D object detection, instance segmentation, face detection, keypoint detection, multi-object tracking, and semi-supervised learning. PaddleDetection also features low-code full-process development capabilities and a modular design for easy model construction.
nerfies.github.io
Nerfies is an open-source project that hosts the source code for the Nerfies website, which is dedicated to Deformable Neural Radiance Fields. This repository serves as a valuable resource for researchers and developers working with neural radiance fields, particularly those interested in creating dynamic and deformable 3D scenes from 2D images. The project is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License, encouraging collaboration and further development within the AI community. It provides the foundational code for understanding and implementing Nerfies, making it an essential reference for advancing research in computer vision and graphics.
MessageDisplayKit
MessageDisplayKit is an open-source framework designed to help developers build instant messaging (IM) applications with features akin to WeChat. It supports a wide range of messaging capabilities, including sending text, pictures, audio, video, and location messages. Beyond core messaging, the kit also includes functionalities for managing local address books, sharing moments with friends, and other interactive social features like 'drift bottle' and 'shake for new friends'. The project is highly customizable, supports arbitrary message sizes, and includes data detectors for recognizing phone numbers, links, and dates. It is compatible with iPhone and iPad, Xcode6 or later, and iOS 6.0+, making it a valuable learning resource and a foundation for IM app development.
pgmpy
pgmpy is an open-source Python library designed for causal and probabilistic reasoning through graphical models. It offers comprehensive implementations of data structures for various models including DAGs, PDAGs, MAGs, PAGs, Bayesian Networks, Dynamic Bayesian Networks, and Structural Equation Models. The toolkit includes algorithms for key tasks such as causal discovery, causal identification, causal and probabilistic inference, model validation, parameter estimation, and simulations. Its modular and extensible API ensures compatibility with scikit-learn, allowing direct use, integration into sklearn pipelines, or building higher-level tools. pgmpy supports both discrete and linear Gaussian data, as well as mixture data with arbitrary relationships.
sdk
Microlink SDK is an open-source tool designed to transform any URL into an embeddable, rich link preview. It leverages the Microlink API to fetch metadata, presenting it as a customizable card with a title, description, image, and more. The SDK supports various media types including images, videos, audio, screenshots, and embedded iframes, offering multiple card sizes (small, normal, large). It features lazy loading for performance, media controls for video/audio, and theming options via CSS variables or contrast mode. Available as both a React component and a vanilla JavaScript version, it also includes hover packages to display previews on mouse-over. Developers can customize data, disable API fetching for static content, and fine-tune media playback behavior.
Vista
Vista is an open-source project from OpenDriveLab, presented at NeurIPS 2024, offering a generalizable world model specifically designed for autonomous driving. This tool allows for the prediction of high-fidelity futures across a wide range of driving scenarios, extending these predictions to continuous and long horizons. A key feature is its ability to execute multi-modal actions, including steering angles, speeds, commands, trajectories, and goal points. Furthermore, Vista can provide rewards for different actions without requiring access to ground truth actions, making it a valuable resource for researchers and developers in the autonomous driving field. The implementation is based on generative-models from Stability AI, and the project includes installation, training, and sampling scripts, along with model weights available on Hugging Face and Google Drive.
Z3D E621 Convnext Space
Z3D E621 Convnext Space is a Hugging Face Space designed to analyze images and provide relevant tags. Users can either upload an image or capture one directly through the application. The tool then processes the image using a Convnext model and returns a comprehensive list of tags, each accompanied by a confidence score. This functionality is particularly useful for organizing image libraries, enhancing searchability, or understanding the content of an image through automated tagging. It offers a straightforward interface for quick image analysis.
batchgenerators
batchgenerators is a Python package designed for data augmentation, specifically tailored for 2D and 3D image classification and segmentation tasks. Developed jointly by the German Cancer Research Center (DKFZ) and the Helmholtz Imaging Platform, it offers a comprehensive suite of augmentations including mirroring, channel translation, elastic deformations, rotations, scaling, resampling, and multi-channel misalignments for spatial data. Color augmentations cover brightness, contrast, and gamma, while noise augmentations include Gaussian and Rician noise. The framework also provides cropping options like random and center crop, along with padding. A key differentiator is its compatibility with both 2D and 3D input data, addressing a common gap in other frameworks. It also features anatomy-informed and misalignment data augmentations for specialized applications. The package is designed for flexibility, using a simple Python dictionary structure for data handling, and supports multi-threaded augmentation for performance.
algorithmic_trading_book
algorithmic_trading_book is a GitHub repository offering comprehensive resources for individuals interested in algorithmic trading. It includes two distinct books: 'Successful Algorithmic Trading' and 'Advanced Algorithmic Trading'. Each book is provided in PDF format and is accompanied by its corresponding source code, allowing users to study the theoretical concepts and immediately apply them through practical examples. The repository is designed to support learning and implementation of various algorithmic trading strategies, catering to both beginners looking to understand the fundamentals and more experienced traders seeking advanced techniques. All materials are open source, making them freely accessible for educational and development purposes.
Find3D
Find3D is an open-world 3D part segmentation model designed to identify and segment specific components within 3D objects. Users can upload their own .pcd files or select from provided samples to analyze point cloud data. The tool allows for precise part queries, enabling the segmentation of complex 3D objects into their constituent parts. This capability is particularly useful for applications requiring detailed structural analysis, object recognition, and component isolation within 3D environments. Developed as a Hugging Face Space, Find3D offers an accessible platform for researchers, developers, and enthusiasts working with 3D data and AI applications.
FaceAISDK_Android
FaceAISDK_Android is a comprehensive SDK designed for Android devices, offering robust on-device face recognition, liveness detection, and 1:N & M:N face search capabilities. It supports offline operation, meaning no internet connection is required, and no sensitive facial information is uploaded or stored, enhancing user privacy and data security. The SDK includes silent liveness detection and action-based liveness detection (mouth opening, smiling, blinking, head shaking, nodding). It also supports UVC protocol USB cameras for clear imaging. This solution is ideal for applications requiring secure, local facial authentication and identification, such as mobile attendance, access control, smart locks, and smart home systems, significantly reducing cloud infrastructure costs.
FaceRecognition
FaceRecognition is an on-device, offline SDK developed by FaceAISDK, specializing in face detection, recognition, and liveness detection. It supports advanced functionalities like 1:N and M:N face search, making it suitable for various applications from access control to surveillance. A key differentiator is its complete offline operation, ensuring that no facial information or sensitive data is uploaded or saved, which significantly enhances user privacy and security. The SDK is compatible with Android devices (versions 7-16) and supports various liveness detection methods including mouth opening, smiling, blinking, head shaking, and nodding. It also offers features like improved accuracy for unclear images, enhanced recognition for distant and small faces, and optimized stability for low-spec devices running continuously for extended periods.
David-Silver-Reinforcement-learning
David-Silver-Reinforcement-learning is an open-source repository offering comprehensive notes and practical implementations for David Silver's renowned Reinforcement Learning course. It covers a wide range of topics from Week 1 (Introduction to RL) to Week 10 (Case Study: RL in Classic Games), with each week's content including slides and video links. The repository features algorithm implementations using Keras (with TensorFlow backend) and OpenAI's Gym framework, making it a valuable resource for students and researchers. It supports Python, TensorFlow, Keras, Gym, and Numpy, and encourages community contributions for expanding implementations to other frameworks like PyTorch or Caffe.
jaspy
Jaspy is a unique Python bytecode virtual machine (VM) implemented from scratch in JavaScript, designed to explore new ways of web programming on the client side. While speed is not its primary goal, it offers several distinctive features not found in other Python-to-JavaScript solutions. These include a suspendable interpreter with full support for threading and greenlets, an integrated debugger that provides interactive remote debugging capabilities (compatible with CLI and PyCharm), and a flexible preprocessor-based architecture for optimization. It also allows for easy extensibility with native JavaScript modules, full support for meta-classes, built-in subclassing, operator overloading, asynchronous imports, and arbitrary-length integers. Jaspy is an open-source project, welcoming contributions to expand its functionality and stability.
Lean
Lean is an event-driven, professional-caliber algorithmic trading platform built by QuantConnect, designed for elegant engineering and deep quantitative concept modeling. It supports both Python and C# for developing trading strategies. The platform offers out-of-the-box alternative data and live-trading capabilities, with a modular design that allows for pluggable and customizable components. The QuantConnect Lean CLI provides a command-line interface for managing projects, running backtests, deploying live algorithms, and performing various tasks directly from the terminal. It simplifies the workflow by automating tasks and integrating with cloud services, making it a powerful and flexible tool for quant developers.
geckoview
GeckoView is an open-source project by Mozilla, offering a robust set of components for embedding the Gecko browser engine into Android applications. This allows developers to seamlessly integrate web content rendering capabilities directly within their native Android apps, providing a consistent and powerful browsing experience. The project emphasizes customizability, enabling developers to tailor the web view to their specific application needs. It is a foundational technology for applications like Firefox for Android, providing a secure and performant way to display web content. The GitHub repository serves as the documentation hub, guiding contributors and users on how to get started and utilize its features.
OccNet-Course
OccNet-Course offers the first comprehensive course in China on Occupancy Network algorithms, covering everything from BEV (Bird's Eye View) to Occupancy Network principles and engineering practices, including edge-side deployment. This open-source course is designed for autonomous driving enthusiasts and professionals, providing in-depth knowledge on surrounding semantic occupancy perception. It includes detailed documentation, PowerPoint presentations, and source code, making it a valuable resource for both theoretical understanding and practical application. The curriculum covers various aspects such as BEV perception, different Occupancy Network approaches (pure vision, point cloud, multi-modal fusion), important datasets, benchmarks, and deployment strategies for NVIDIA and Horizon J5 chips. The course also features practical coding exercises and a final project to solidify learning.
Open-DiffusionGS
Open-DiffusionGS is an open-source project that implements a novel approach to single-stage image-to-3D generation and reconstruction by integrating Gaussian Splatting directly into a diffusion denoiser. This method allows for fast and scalable creation of 3D objects, including mesh exportation, and efficient scene reconstruction without the need for depth estimators. The tool is capable of generating 3D outputs in approximately 6 seconds, significantly faster than some state-of-the-art methods. It supports both object-centric image-to-3D generation and scene-level reconstruction, with evaluation capabilities for the latter using datasets like RealEstate10K. The project provides comprehensive scripts for environment setup, quick demonstrations, data preparation for both scene and object-level datasets (including G-Objaverse), evaluation, and multi-stage training of custom models.
open-im-server
OpenIM Server offers an open-source instant messaging solution tailored for developers, enabling them to integrate comprehensive chat functionalities into their applications. Unlike standalone chat apps, OpenIM provides both an SDK and a server, covering essential features like message sending and receiving, user management, and group management. Built with Golang, it supports cross-platform deployment and features a microservices architecture for scalability, handling massive user bases and billions of messages. It also includes REST APIs for business system integration and webhooks for expanding business forms through callbacks, making it a robust framework for implementing efficient instant messaging.