AI Agents & Automation
Browsing page 198 of AI Agents & Automation. Sorted by confidence score — our independent quality rating.
IgnteQ
IgnteQ is a leading technology partner focused on delivering seamless, scalable, and innovative software solutions to drive business success. They offer a comprehensive suite of services including brand identity creation, UI/UX research and strategy, expert consultancy, and Odoo ERP development and customization. IgnteQ also specializes in advanced Generative AI & LLM Solutions and provides scalable on-premises & cloud infrastructure. Their expertise spans various business domains such as Ecommerce, Fintech and Banking, E-Learning, and Travel and Hospitality, allowing them to tailor solutions to specific industry needs. They transform ideas into reality, empowering businesses with cutting-edge technology.
Google SGE
Google SGE (Search Generative Experience) is an innovative AI-powered feature integrated into Google Search, designed to transform how users interact with search results. It utilizes advanced machine learning and natural language processing capabilities to provide contextual and comprehensive answers directly within the search interface. Instead of just listing links, SGE synthesizes information from multiple sources to present users with a concise summary, aiming to offer a more direct and efficient way to find information. This experimental feature, part of Google Labs, represents Google's ongoing efforts to integrate AI into its core products, offering a glimpse into the future of search.
Workato
Workato offers an Enterprise MCP (Management and Control Plane) designed to power agentic AI, connecting AI agents to over 1,400 business applications. Built on a leading iPaaS, it provides secure, scalable, and reliable AI workflows. Key features include an Agentic Agent Studio for designing and deploying enterprise-grade agents, Agent Orchestration for coordinating workflows, and Agent Insights for tracking performance. The platform emphasizes context, trust, and action, offering a knowledge base, enterprise search, process intelligence, and real-time signals. It also includes robust observability, governance, security, and compliance features, alongside a Skills Builder, MCP Composition, Registry, and Gateway for managing AI capabilities.
Mihup.ai
Mihup.ai offers a comprehensive Enterprise Voice AI platform designed for scalable human-machine interactions across various industries. The platform provides solutions for automotive, contact centers, IoT, and developers, enabling features like automated virtual agents, voice agents, agent assist, and interaction analytics. It boasts high accuracy in noisy and multilingual environments, supporting over 120 languages, accents, and dialects. Mihup's technology is built on proprietary G2P for unmatched accuracy and is optimized for edge deployment, ensuring low latency, high reliability, and privacy. The platform helps businesses automate customer calls, analyze conversations in real-time, and coach agents with AI, leading to improved efficiency and customer experience.
Enidia AI SARL-S
Enidia AI SARL-S offers the Enidia app, an AI backoffice specifically designed for lawyers and compliance professionals. It provides verification-driven AI, ensuring every answer is traced to its source and gaps are detected. The app features an Associate module for extraction and verification with visual proof of correctness, a Librarian module for internal knowledge search across document history, and a Researcher module for case law and precedent analysis. Enidia AI emphasizes uncompromised data sovereignty, offering on-premise or cloud deployment, and guarantees no public training of client data. It is also designed for compliance with EU AI Act Articles 9, 12, and 14.
vlm_arm
vlm_arm is a project focused on creating human-robot collaborative embodied intelligent agents by integrating robotic arms with large language models and multimodal AI. It enables robotic arms to understand human language, interpret visual input, locate coordinates, plan actions, and format responses. The project utilizes advanced models like Yi-Large, Claude 3 Opus, and GPT-4o for language understanding and multimodal vision. It is designed for robotics research and development, specifically using Elephant Robotics Mycobot 280 Pi with a Raspberry Pi 4B. The project provides tutorials for replication and setup, targeting developers and researchers interested in advanced robotics and AI applications.
Visualyze.AI
Visualyze.AI is an intelligent automation platform designed to build software robots for the workplace. It allows users to create intelligent robots, transfer human intelligence to them, and enable them to acquire understanding skills and precise decision-making abilities. The platform features Robot Studio for building robots, Robot Cloud for managing automated workforces, Robot AI for intelligence transfer, and Document AI for custom document understanding. It boasts a no-code, no-scripting approach with a visual workflow builder, allowing users to model complex workflows and integrate with business applications like SAP, Office 365, and AWS. Visualyze.AI aims to reduce process cycle time, increase capacity, and free up critical resources by automating repetitive tasks.
ProjectFitter.ai
ProjectFitter.ai is an AI-driven platform designed to streamline the recruitment process for tech companies. It leverages advanced AI and machine learning algorithms to precisely match tech specialists' CVs with specific job requirements. The platform offers features such as advanced filtering options, a candidate scoring system, and seamless ATS integration to help recruiters narrow down their search and prioritize interviews efficiently. ProjectFitter.ai also includes onboarding tools and success rate tracking to continuously refine its algorithms and improve hiring outcomes. By automating initial screening and ranking candidates, it aims to accelerate time-to-hire and reduce recruitment costs, ensuring a precise fit for every role while maintaining data protection compliance.
FineTuningLLMs
FineTuningLLMs is the official repository for the book "A Hands-On Guide to Fine-Tuning LLMs with PyTorch and Hugging Face." This resource offers comprehensive guidance and practical code examples for fine-tuning large language models. It covers essential concepts such as quantization, low-rank adapters (LoRA), and dataset formatting templates. The repository features Jupyter notebooks that can be easily run on Google Colab with GPU support, making it accessible for hands-on learning. It delves into topics like loading quantized models, fine-tuning with SFTTrainer, and deploying models locally using formats like GGUF with Ollama or llama.cpp. The guide is designed for an intermediate-level audience, assuming a foundational understanding of deep learning concepts.
nuwax
Nuwax is presented as the world's first universal agent operating system, designed to help users build private, vertical general-purpose AI agents. It serves as a platform for the design, development, and practical application of AI solutions, eliminating the need for coding. The system supports various deployment endpoints and APIs, offering comprehensive capabilities for workflow management, plugin integration, and application development. Nuwax also includes RAG (Retrieval-Augmented Generation) knowledge base and data table storage functionalities, making it suitable for a wide range of users. It supports multiple platforms and provides robust management features for users, audits, public models, content, and tasks.
Huatuo-Llama-Med-Chinese
Huatuo-Llama-Med-Chinese, also known as BenCao, is an open-source project focused on instruction-tuning large language models with Chinese medical knowledge. It leverages models such as LLaMA, Alpaca-Chinese, and Bloom, fine-tuning them with datasets built from medical knowledge graphs and literature using ChatGPT API. This process significantly improves the base models' performance in medical question-answering. The project provides LoRA weights for various base models, enabling efficient fine-tuning. It also introduces a knowledge-finetuning approach that allows models to explicitly utilize knowledge base information during inference, enhancing reliability in generating Chinese medical responses.
mnehmos.multi-agent.framework
mnehmos.multi-agent.framework is an open-source project designed to give Large Language Models (LLMs) a 'nervous system,' transforming them from stateless text predictors into more autonomous 'organisms.' It provides a biological architecture that organizes sensation, reflex, memory, and action into coherent loops. The framework features a multi-layered architecture including Central (Cognition), Somatic (Voluntary Action), Autonomic (Subconscious), and Reflex (Spinal Cord) components. It supports various modes for task decomposition, system design, planning, research, coding, debugging, and knowledge management. Key features include an OODA Loop for decision-making, a TDD Cycle for development, and a Boomerang Protocol for structured data returns, making it suitable for developers building advanced AI agents.
ollama-gui
ollama-gui is a modern web interface designed for interacting with local Large Language Models (LLMs) through the Ollama API. It boasts a clean and responsive user interface, ensuring a smooth chatting experience. Key features include local chat history management using IndexedDB, comprehensive Markdown support for messages, and a dark mode option for user comfort. The tool prioritizes privacy by processing all data locally, ensuring no information leaves your system. It also offers a development proxy for easy network access and supports Docker deployment for simplified setup, allowing users to run both Ollama and the GUI together without complex configurations.
TheBloke Wizard Vicuna 13B Uncensored HF
TheBloke Wizard Vicuna 13B Uncensored HF is an AI chatbot hosted as a Hugging Face Space. This tool offers an uncensored version of the Wizard Vicuna 13B model, allowing users to engage in conversational AI interactions without typical content restrictions. While the live website currently indicates a runtime error, suggesting it may not be fully operational at this moment, the intention is to provide a platform for direct interaction with this specific large language model. It is designed for those interested in exploring the capabilities of uncensored AI models within a readily accessible web environment.
Nested Knowledge, Inc.
Nested Knowledge, Inc. offers an AI-powered software platform designed to revolutionize systematic literature review and meta-analysis. The tool provides comprehensive capabilities for researchers, including advanced search functionalities, efficient screening processes, and robust data extraction tools. It also features powerful visualization insights to help users understand complex data more clearly. By automating and assisting in these critical research stages, Nested Knowledge aims to significantly accelerate the research workflow, enabling the creation of updatable syntheses of evidence and enhancing the overall efficiency and quality of academic research.
NeuralBox
NeuralBox by NeuralCam is an AI-powered visual second brain designed to help users remember anything through photos. Users can easily capture photos, screenshots, and documents, and the tool's advanced AI indexes both objects (semantic image search) and text content (OCR) within them. This allows for effortless retrieval using simple descriptions, eliminating the need for complex organization or tagging. NeuralBox also offers visually similar image browsing, mirroring how the human brain organizes information. It provides efficient on-device and cloud storage, helping to unclutter main photo galleries by housing 'utility photos' like receipts or inspirational designs. The tool supports multiple capture methods, including a lock screen widget and automatic screenshot import, ensuring users can quickly save anything that catches their eye.
PaddleFormers
PaddleFormers is an open-source library built on the PaddlePaddle deep learning framework, designed to offer model interfaces and functionalities comparable to Hugging Face Transformers. It supports the training of both large language models (LLM) and visual language models (VLM). The library leverages PaddlePaddle's inherent advantages in high-performance training, incorporating advanced distributed training strategies like tensor parallelism, pipeline parallelism, and expert parallelism, alongside automatic mixed precision for acceleration. PaddleFormers aims to provide a high-performance, low-resource-consumption training experience, enabling users to efficiently complete large model training without delving into complex optimization details. It supports a wide array of mainstream LLMs and VLMs, including DeepSeek-V3, GLM-4.5 series, Qwen2/3, and ERNIE models, and offers full-lifecycle training capabilities from pre-training to post-training, including CPT, SFT, SFT-LoRA, DPO, and DPO-LoRA.
Early Stage Co
Early Stage Co specializes in providing tailored AI solutions to drive innovation and growth for businesses. They offer a comprehensive suite of services including new product innovation, where they bring ideas to life with a team of engineers, designers, and product managers. Their AI consultancy helps businesses harness the power of artificial intelligence to achieve their goals, with a focus on improving bottom lines and reaching new customers. Additionally, Early Stage Co provides robust software development services, covering the full project lifecycle from conception to implementation, with flexible outsourcing models. They also excel in AI, ML, and data analytics, offering cloud AI services and bespoke strategies to unlock data's true potential.
visual-openllm
Visual-openLLM is an open-source project designed to interactively connect various visual models, functioning similarly to Visual ChatGPT. It is built upon established technologies like ChatGLM, Visual ChatGPT, and Stable Diffusion, positioning itself as an open-source version of '文心一言'. The tool supports ChatGLM3, adding features such as VQA (Visual Question Answering) and Pix2Pix capabilities. Its development roadmap includes support for multi-turn chat, integration with other visual tools, and compatibility with additional large language models, making it a versatile platform for visual AI experimentation and application.
FlashPaper
FlashPaper is an AI writing tool designed to support students and researchers in academic writing tasks. It offers features for generating graduation theses within ten minutes, creating outlines, and paraphrasing text. The tool also includes functionalities for plagiarism detection, text rewriting, and citation generation. FlashPaper aims to simplify the academic writing process by providing AI-powered assistance for various stages, from initial topic generation and literature review to final paper refinement and formatting. It supports tasks like generating opening reports and literature reviews, making it a comprehensive aid for academic work.
Video-LLaMA
Video-LLaMA is an instruction-tuned audio-visual language model designed for comprehensive video understanding. Built upon BLIP-2 and MiniGPT-4, it integrates both Vision-Language (VL) and Audio-Language (AL) branches. The VL branch, utilizing a ViT-G/14 visual encoder and BLIP-2 Q-Former, processes video representations and is trained on datasets like Webvid-2M and LLaVA image captions. The AL branch, powered by ImageBind-Huge, handles audio representations. The tool supports pre-training and fine-tuning stages, allowing for customization and enhanced instruction-following capabilities using datasets from MiniGPT-4, LLaVA, and VideoChat. It is an open-source project, making it accessible for AI researchers and developers to explore and build upon.
transformers_tasks
transformers_tasks is an open-source project on GitHub that integrates various NLP algorithms using the powerful Hugging Face transformers library. It offers implementations for a wide range of tasks, including text matching (PointWise, DSSM, Sentence Bert, SimCSE), information extraction (UIE), prompt tasks (PET, p-tuning), and text classification (BERT-CLS). The project also delves into advanced areas like Reinforcement Learning from Human Feedback (RLHF) for language models, text generation (T5-Based models), and large language model (LLM) applications and training. It provides a flexible framework for researchers and developers to train and fine-tune models using their own datasets.
aidea-server
AIdea Server is a fully open-source application server developed in Golang, designed to integrate a wide range of large language models (LLMs) and image generation models. It supports prominent LLMs such as GPT, Tongyi Qianwen, and Wenxin Yiyan, alongside image generation capabilities like Stable Diffusion (text-to-image, image-to-image, SDXL 1.0), super-resolution, and image coloring. This versatile backend facilitates AI chat, collaboration, and advanced image processing, making it suitable for developers looking to self-host AI services. The project offers a robust framework for modular application development with dependency injection and an in-house ORM for database operations, ensuring a scalable and maintainable architecture.
GLiNER HandyLab
GLiNER HandyLab is a versatile AI tool hosted on Hugging Face Spaces, designed to assist users with a range of Natural Language Processing (NLP) tasks. Users can input any text and select from functionalities such as Named Entity Recognition (NER), Question Answering, Open Information Extraction, Summarization, Relation Extraction, and Text Classification. This makes it a valuable resource for quickly processing and understanding textual data without needing to set up complex environments. The application is suitable for educational purposes, research, and anyone looking to experiment with advanced NLP models.