AI Agents & Automation
Browsing page 100 of AI Frameworks & Infra in AI Agents & Automation. Sorted by confidence score — our independent quality rating.
MAmmoTH2
MAmmoTH2 is presented as the strongest open-source large language model (LLM) specifically designed for reasoning tasks. Hosted on Hugging Face Spaces by TIGER-Lab, this tool allows users to interact with a chatbot that generates text responses based on conversation history and a provided system prompt. It is primarily intended for research and development, enabling users to explore advanced language understanding and complex problem-solving. As an open-source model, MAmmoTH2 offers flexibility for developers and researchers to integrate and adapt it for various applications.
MyScaleDB
MyScaleDB is a powerful SQL vector database built as a fork of ClickHouse, specifically optimized for AI applications. It provides high-performance vector search, filtered search, and full-text search capabilities, all accessible through standard SQL. This allows developers to leverage their existing SQL knowledge for building scalable and production-ready GenAI applications without needing to learn new complex tools. MyScaleDB unifies SQL database, vector database, and full-text search engine functionalities into a single, efficient system, leading to reduced infrastructure and maintenance costs. It supports various data types including structured, text, vector, JSON, geospatial, and time-series data, and offers millisecond search performance on billion-scale vectors.
mllm
mllm is a fast and lightweight multimodal LLM inference engine specifically designed for mobile and edge devices. It allows developers to deploy and run large language models (LLMs) on hardware with limited resources, supporting both text and image processing. The tool features a Pythonic eager execution API for rapid model development, unified hardware support across Arm CPU, OpenCL GPU, and QNN NPU, and advanced optimizations like quantization, pruning, and speculative execution. It integrates seamlessly with popular community frameworks' checkpoints, converting PyTorch and SafeTensors models into its optimized format. mllm also provides a deployment toolkit including an SDK and CLI inference tool, making it a central hub for AI inference on mobile platforms.
MORSE Corp
MORSE Corp is an employee-owned company specializing in algorithm and software development services. Their team comprises talented engineers, software developers, and scientists with diverse technical capabilities, rooted in Aerospace Engineering. They apply a user-centered development process, continuously engaging with users to understand needs and gather feedback. A key differentiator is their hands-on approach to field testing alongside users in operationally representative conditions, ensuring products are tailored to user needs and operational constraints. This approach avoids forcing users to adapt their tactics or procedures to suit the product, instead delivering solutions that seamlessly integrate into existing workflows.
nextpy
Nextpy is an open-source framework designed for building self-modifying AI software, currently in a 'just for friends' development stage. It emphasizes guardrails to define AI system boundaries and offers a powerful prompt engine for greater control over Large Language Models (LLMs) compared to traditional methods. Key capabilities include pre-compiling prompts, maintaining session state with LLMs for efficiency, and optimizing token usage. The framework is built for better AI generations, especially optimized for code generation, and can detect and fix syntax errors in LLM-generated code. It boasts modularity, multiplatform support, and a developer-first approach, aiming for 4-10x faster performance than alternatives like Streamlit.
Open CoT Leaderboard
Open CoT Leaderboard is a platform designed to track, rank, and evaluate the Chain-of-Thought (CoT) quality of open large language models (LLMs). Hosted as a Hugging Face Space, it provides a centralized location for researchers and developers to browse and filter a leaderboard of LLM benchmarks. Users can submit their own models for evaluation, allowing for comparison against existing models and contributing to the collective understanding of LLM performance. The platform offers transparency into the evaluation process and the status of submitted models, making it a valuable resource for identifying top-performing open-source LLMs and advancing AI research.
Open Japanese LLM Leaderboard
The Open Japanese LLM Leaderboard is a platform designed for exploring and comparing large language models (LLMs) tailored for the Japanese language. Hosted on Hugging Face Spaces, this tool allows users to search for models by name, and apply filters based on type, size, and precision. It provides performance metrics and visualizations to help researchers, developers, and enthusiasts assess the capabilities of various Japanese LLMs. The leaderboard aims to facilitate the identification of top-performing models, supporting advancements in Japanese natural language processing and AI development. While the current live website indicates a runtime error, the intended functionality is to offer a comprehensive resource for evaluating and understanding the landscape of open Japanese LLMs.
OpenCUA
OpenCUA is a comprehensive open-source framework designed for scaling Computer-Use Agent (CUA) data and foundation models. It features AgentNet, the first large-scale computer-use task dataset spanning multiple operating systems and applications, and AgentNetTool, an annotation infrastructure for capturing human computer-use demonstrations. The framework also includes AgentNetBench, an offline evaluator for benchmarking model-predicted actions, and OpenCUA Models, end-to-end computer-use foundation models with strong planning and grounding capabilities. Notably, OpenCUA-72B achieves state-of-the-art performance on OSWorld-Verified, making it ideal for developers and researchers working on advanced AI agents.
OxyGent
OxyGent is an open-source Python framework designed to empower developers in quickly building production-ready intelligent systems. It unifies various AI tools, models, and agents into modular components called 'Oxy', facilitating transparent, end-to-end pipelines. The framework emphasizes efficient development through standardized, hot-swappable components, allowing rapid assembly and reuse of agents. It supports intelligent collaboration with dynamic planning paradigms, enabling agents to decompose tasks, negotiate solutions, and adapt to changes in real-time. OxyGent features an elastic architecture that supports diverse agent topologies and includes automated dependency mapping and visual debugging tools. It also promotes continuous evolution through built-in evaluation engines that generate training data, ensuring agents continuously improve while maintaining transparency.
paddler
Paddler is an open-source load balancer and serving platform designed for self-hosting Large Language Models (LLMs) and Vision Language Models (VLMs) at scale. It offers a streamlined alternative to existing solutions like llm-d or Docker Model Runner, focusing on fewer moving parts and simpler deployments built around the ggml ecosystem. Key features include a built-in llama.cpp engine for inference, LLM-specific load balancing, dynamic model swapping, and request buffering for scaling from zero hosts. Paddler also provides a web admin panel for management, monitoring, and testing, along with observability metrics. It runs efficiently on both CPU and GPU, catering to product teams needing LLM inference, DevOps/LLMOps teams deploying models at scale, and organizations with high compliance and privacy requirements.
Avea Robotics
Avea Robotics is dedicated to advancing the physical AI revolution within the field of robotics. The company's core mission is to empower the development of next-generation, real-world robotic systems. By providing simplified, end-to-end pipelines, Avea Robotics seeks to unlock unprecedented capabilities and scalability for robotics development. This approach aims to streamline the complex process of integrating AI into physical robots, making advanced robotic systems more accessible and efficient to build and deploy. The focus is on creating robust infrastructure that supports the intricate demands of AI-driven robotics.
regl-cnn
regl-cnn is an open-source project designed for GPU-accelerated handwritten digit recognition, leveraging Convolutional Neural Networks (CNNs) within WebGL. This tool serves as a practical demonstration of how to implement a CNN directly on the GPU using WebGL, offering insights into high-performance computing for machine learning in web environments. The underlying network was initially trained using TensorFlow, and subsequently, its architecture and functionality were meticulously reimplemented in WebGL to showcase client-side inference capabilities. It is particularly useful for web developers interested in integrating machine learning models into web applications and machine learning enthusiasts looking to understand GPU-accelerated CNNs.
rig
rig is a powerful tool designed for developers focused on building modular and scalable LLM (Large Language Model) applications, primarily utilizing the Rust programming language. It enables the creation of custom workflows, allowing for flexible integration of LLMs into various development projects. The tool emphasizes facilitating the development of robust and efficient AI applications, providing the necessary infrastructure for managing and scaling these complex systems. While the provided context is a GitHub pricing page, the tool's core function revolves around enhancing AI application development with Rust, offering a structured approach to LLM integration and workflow customization.
RoleChain
RoleChain provides an orchestration layer for AI agents, enabling users to discover, deploy, and manage specialized AI agents across the Web3 ecosystem. The platform features a marketplace with agents designed for diverse blockchain applications, including DeFi yield optimization, cross-chain bridging, MEV protection, GameFi analytics, and smart contract auditing. It caters to individuals, teams, and large organizations with flexible pricing plans, offering features like custom agents, advanced analytics, API access, and team collaboration. RoleChain aims to empower Web3 communities with decentralized AI solutions, ensuring security and privacy through its decentralized node training network.
Sciform
Sciform is an AI consulting firm that specializes in helping companies, organizations, investors, and board members build responsible AI solutions. They leverage in-depth knowledge in applied mathematics, distributed computing, and interdisciplinary collaboration to provide profound support for projects. Sciform aims to create real value for clients and their customers by smoothly realizing complex solutions in Artificial Intelligence, Big Data, Numerics, High Performance Computing, and Quantum Computing. They offer consulting services tailored for both companies/organizations and investors/board members, providing insights and support even without a specific technical background.
smile
SMILE (Statistical Machine Intelligence and Learning Engine) is a robust and comprehensive machine learning framework implemented in Java, with convenient APIs available for Scala and Kotlin developers. It offers a wide array of algorithms and tools for statistical machine intelligence and learning applications, making it suitable for various data science tasks. The framework is designed for performance and flexibility, supporting Java 8 and newer versions. It empowers developers to build, train, and deploy machine learning models efficiently, catering to both research and production environments. SMILE's extensive feature set covers areas such as classification, regression, clustering, association rule mining, and more, providing a solid foundation for advanced analytical projects.
slimevolleygym
slimevolleygym is an OpenAI Gym environment designed for testing single and multi-agent reinforcement learning algorithms through a simple Slime Volleyball game. This environment is lightweight, requiring only gym and numpy as dependencies, making it less prone to breaking and easy to integrate. It features a baseline 120-parameter neural network opponent, which can be replaced for multi-agent or self-play scenarios. The environment runs efficiently, achieving around 12.5K timesteps per second on state-space observations, facilitating faster iteration in experiments. It supports both state-space and pixel observations, with the latter mimicking Atari Learning Environment setups, and includes a tutorial for various training methods. The environment is particularly useful for educational purposes and for exploring advanced RL methods like self-play and continual learning.
similarities
similarities is a comprehensive, open-source toolkit designed for advanced similarity calculation and semantic search. Built with Python 3, it offers out-of-the-box functionality for various tasks, including text-to-text, text-to-image, and image-to-image searches, capable of handling billion-level datasets. The toolkit features semantic matching models based on text2vec for text similarity and search, supporting multiple SentenceBERT-like pre-trained models across various languages. It also includes literal matching models like Word2Vec and BM25. For image and cross-modal similarity, similarities leverages CLIP models, enabling image-to-image, text-to-image, and vector-to-image searches with support for Chinese-CLIP models and GPU acceleration. It provides command-line tools for vector extraction, index building, batch retrieval, and service deployment, making it a versatile solution for developers and data scientists.
SimpleTuner
SimpleTuner is a comprehensive, open-source fine-tuning kit designed for image, video, and audio diffusion models. It prioritizes simplicity and code understandability, making it an ideal academic exercise and collaborative development platform. The tool features a user-friendly web UI, multi-modal and multi-GPU training capabilities, and advanced caching for faster training. It supports various model architectures, including Stable Diffusion XL, Stable Diffusion 3, and Flux, with integrations for DeepSpeed and FSDP2 for memory optimization. SimpleTuner also includes enterprise-grade features like worker orchestration, SSO integration, role-based access control, and a job queue with priorities, all available for free.
similarity-search-kit
SimilaritySearchKit is a Swift package designed for iOS and macOS applications, enabling on-device text embeddings and semantic search. It offers a robust solution for developers to integrate powerful NLP capabilities directly into their apps without relying on external cloud services, ensuring data privacy and functionality in low-connectivity environments. The kit supports various built-in state-of-the-art NLP models and similarity metrics, with options for extensibility through custom implementations. Use cases include privacy-focused document search engines, offline question-answering systems, and document clustering. Developers can easily add it as a Swift Package Manager dependency and choose specific models to optimize binary size.
SkyRL
SkyRL is a modular, open-source, full-stack reinforcement learning (RL) library specifically designed for large language models (LLMs). It aims to streamline research and development in the field of AI agents by offering a flexible framework for building and training intelligent agents. While the provided website content is a GitHub pricing page for GitHub itself, the tool's description indicates its core purpose is to support advanced AI development. Researchers and developers can leverage SkyRL to experiment with and implement various RL algorithms tailored for LLM applications, fostering innovation in AI agent capabilities and performance.
SqueezeLLM
SqueezeLLM is a post-training quantization framework designed to optimize the serving of large language models (LLMs) through a novel Dense-and-Sparse Quantization method. This approach addresses the significant memory requirements of LLMs by splitting weight matrices into a dense component, which can be heavily quantized without performance loss, and a sparse component that preserves sensitive outlier parts. This allows for serving larger models with a smaller memory footprint, maintaining the same latency, and achieving higher accuracy and quality compared to baseline models. For instance, SqueezeLLM's variant of Vicuna models can operate within 6 GB of memory, surpassing FP16 baseline models in MMLU accuracy despite the latter requiring twice the memory. The framework supports various LLMs including LLaMA, LLaMA-2, Mistral, Vicuna, XGen, and OPT, with options for 3-bit and 4-bit quantization and different sparsity levels.
Stable-Diffusion-WebUI-TensorRT
Stable-Diffusion-WebUI-TensorRT is a TensorRT extension designed to significantly boost the performance of Stable Diffusion Web UI on NVIDIA RTX GPUs. This tool is compatible with a range of Stable Diffusion models, including 1.5, 2.1, SDXL, SDXL Turbo, and LCM, ensuring broad applicability for users. To leverage its capabilities, users must install the extension and generate optimized engines, following the detailed instructions provided. This optimization is crucial for achieving faster inference times and a smoother workflow when generating images, making it an essential addition for developers and graphic designers working with Stable Diffusion on NVIDIA hardware.
tanuki.py
tanuki.py is a tool designed to assist developers in building and optimizing applications powered by Large Language Models (LLMs). Its core focus is on enhancing the efficiency and cost-effectiveness of LLM-based applications. The tool incorporates prompt engineering techniques to refine LLM interactions and improve output quality. Additionally, it supports test-driven alignment, a methodology that ensures the LLM's behavior aligns with desired outcomes through systematic testing. This approach helps developers to iteratively improve the performance and reliability of their LLM applications, making them more robust and production-ready.