💻

Coding & Development

Browsing page 19 of AI tools for Testing & QA in Coding & Development. Sorted by confidence score — our independent quality rating.

All Backend & APIs Code Assistants Coding Agents Database & SQL DevOps & Infrastructure Documentation Frontend & UI Game Development Mobile Development No-Code / Low-Code Open Source & Models Prompt Engineering Testing & QA Vibe Coding Web Scraping & Automation

DryRun Security

60%

DryRun Security is an AI-native code security platform designed to enhance application security with high accuracy and minimal noise. It functions as a Static Application Security Testing (SAST) tool, providing rapid feedback on Pull Requests (PRs) and comprehensive Code Security Intelligence. The platform leverages Contextual Security Analysis to understand data flow, architecture, and change history, identifying logic flaws and broken authentication that traditional pattern-matching scanners often miss. DryRun Security eliminates the need for maintaining brittle rules, offering AI-driven custom policy checks in every PR. It supports a wide range of languages including Python, Ruby, JavaScript, Java, and C#, and integrates with popular tools like GitHub, GitLab, and Slack, acting as a dedicated secure code review agent.

TAMI Studio by UI-licious

60%

TAMI Studio by UI-licious is an AI-powered test automation platform designed to simplify UI testing. It leverages Vision AI to convert screenshots and recordings into readable and maintainable test cases and automation scripts. The platform allows users to plan, automate, and run UI tests effortlessly, offering features like AI-suggested test cases, automated test script generation, and bug report writing based on failed tests. It integrates with a low-code UI test framework and supports parallel test execution on a cloud testing grid across major browsers. TAMI Studio also provides test case management, scheduling, and notification functionalities, making it a comprehensive solution for UI testing.

Getgud.io

60%

Getgud.io offers a comprehensive platform for game developers to gain smarter insights into player behavior, leading to happier players and increased revenue. The tool provides total gameplay observability, allowing teams to replay every session, analyze player actions, debug issues faster, and detect cheats and toxicity. It features a unified dashboard to query player data, matches, and user reports, and define automated actions via webhooks. Getgud.io also includes AI-powered chat moderation to detect and act on toxic chat across multiple languages, and advanced game analytics to understand weapon dominance, map performance, and character balancing. It helps in crafting personalized campaigns for player engagement and retention.

Mechatronics Innovation Lab - MIL

60%

Mechatronics Innovation Lab (MIL) serves as Norway's leading technology hub, fostering collaboration between industry, academia, and the public sector to advance mechatronics and AI. MIL's core mission is to enhance the competitiveness of its clients by providing access to cutting-edge technology, specialized expertise, and an extensive international network. The lab offers services in industrial 3D printing, robotics, and artificial intelligence, enabling businesses to test, develop, and innovate solutions. MIL focuses on helping clients increase global competitiveness, develop sustainable solutions, bring production back to Norway, and create exciting job opportunities, acting as a crucial partner for technology adoption and innovation.

aideml

60%

aideml, or AIDE ML, is an open-source machine learning engineering agent designed for AI-driven exploration in the space of code. It functions as a tree-search agent that autonomously drafts, debugs, and benchmarks code to maximize or minimize user-defined metrics. The tool is provided as a research-friendly Python package, complete with utilities like a command-line interface (CLI), visualization tools, and configuration presets. This allows academics and engineer-researchers to replicate the underlying paper, test new ideas, and prototype ML pipelines efficiently. Key capabilities include natural-language task specification, iterative agentic tree search, and utility features like an HTML visualiser and Streamlit UI.

Span

60%

Span is an AI-native developer intelligence platform designed to enhance software engineering performance. It offers engineering leaders the ability to reduce coordination costs and gain valuable insights into developer productivity. A key feature of Span is its AI Code Detector, powered by span-detect-1, which is specifically built for identifying AI-generated code. The platform aims to deliver a comprehensive view of engineering impact and overall team health, enabling better decision-making and optimized development processes.

AILiveSim

60%

AILiveSim is a cloud-based simulation platform designed for validating AI and autonomous systems. It offers a robust environment for AI training and testing, allowing teams to conduct realistic, scenario-driven simulations to enhance AI performance. The platform supports comprehensive testing of perception, navigation, and control functionalities across diverse conditions, ensuring thorough validation before real-world deployment. By providing advanced simulation capabilities, AILiveSim helps organizations mitigate testing risks, accelerate the development cycle, and achieve autonomy readiness for applications in maritime, robotics, autonomous vehicles, and mobile machines. It is a critical tool for deep learning, dataset generation, and various AI testing methodologies including regression and functional testing.

ConeLabs

60%

ConeLabs offers an AI-powered platform designed to revolutionize inspections for various infrastructures, including buildings, utilities, and transport. The platform allows users to capture data using off-the-shelf hardware like smartphones or drones, then upload images for high-detail 3D reconstruction with photorealistic textures. It provides multiple inspection tools, including measurements, annotations, and labeling, all accessible through a web-based application. ConeLabs emphasizes seamless sharing of models, enabling remote teams to collaborate effectively. By automating manual and repetitive processes, the platform empowers engineers and technicians with centralized, comprehensive tools to increase quality, productivity, and minimize risks in inspection workflows.

Fulloop AI

60%

Fulloop AI leverages artificial intelligence to transform the technical interview process, offering AI interviewers that can conduct multi-round assessments. The platform is designed to streamline the hiring workflow for technical roles, supporting various assessment types including coding challenges and system design questions. By providing structured feedback, Fulloop AI helps companies gain deeper insights into candidate performance, ultimately improving the efficiency of their hiring process. It aims to help identify top tech talent faster and more effectively. The tool offers both autopilot and co-pilot modes, providing flexibility in how interviews are managed and assessed.

Fume (YC W24)

60%

Fume is an AI testing tool designed to automate end-to-end browser testing by acting as an autonomous QA team. Users record a Loom-style video describing what they want to test, and Fume generates and maintains Playwright tests. It extracts test cases in seconds, covering major user flows quickly. Fume offers self-healing tests with AI agent fallbacks for Playwright's fragility, ensuring tests continue even if scripts break. It provides free test runners in the cloud and allows full ownership of the generated Playwright code, which can be run locally or integrated into existing CI/CD pipelines. Fume also supports migrating existing Playwright, Selenium, or Cypress tests.

Agora Demo

60%

Agora Demo is a Hugging Face Space designed to simulate how different AI agents communicate using the Agora protocol. Users can select specific tasks and choose various AI models for the agents involved in the simulation. The application then visualizes the entire communication process, providing insights into how these agents interact and exchange information. This demo is particularly useful for researchers and developers interested in understanding and experimenting with multi-agent systems and communication protocols in an AI context. It offers a practical, hands-on way to observe complex AI interactions.

Auto Benchmark

60%

Auto Benchmark is a Hugging Face Space designed to facilitate performance benchmarking for machine learning models. Users can select a model, task, and various backends such as OpenVINO, PyTorch, and IPEX to evaluate their models' efficiency. This tool is particularly useful for AI developers and machine learning engineers who need to compare the performance of different models or optimize existing ones across various hardware and software configurations. By providing a streamlined interface for benchmarking, Auto Benchmark helps in making informed decisions about model deployment and optimization strategies, ensuring models run efficiently on target platforms.

Camera Settings As Tokens

60%

Camera Settings As Tokens is an AI tool hosted on Hugging Face Spaces designed for generating detailed images. Users can input a text description and then fine-tune various camera settings, such as focal length and ISO speed, to achieve specific visual outcomes. This functionality allows for greater control over the generated image's aesthetic, moving beyond simple text-to-image prompts. The tool is intended for AI enthusiasts, developers, and researchers who wish to explore the impact of photographic parameters on AI-generated visuals. While the concept is innovative, the current live website indicates a runtime error, suggesting the tool is not currently operational.

AI-inspect

60%

AI-Inspect specializes in AI-based vision systems to automate inspections of industrial products and company assets. The platform provides an end-to-end solution, encompassing high-speed, multi-camera vision capture tailored to specific workflows, a robust AI engine for real-time detection and analysis, and a dashboard for insights with continuous optimization. It serves industries such as Fresh Food & Produce, Logistics & Warehousing, Automotive & Mobility, Manufacturing & Assembly, and Infrastructure & Assets. The multidisciplinary engineering team, with expertise in aerospace engineering, AI, and machine vision, designs and integrates complete vision systems, including standalone indoor/outdoor inspection systems and modules for existing production lines. Key capabilities include anomaly detection, controlled environment engineering, virtual prototyping, object segmentation, multi-object tracking, and high-speed vision.

Cetvel

60%

Cetvel is a comprehensive tool designed to serve as a unified benchmark for evaluating Turkish Large Language Models (LLMs). Developed by KUIS-AI, this application enables researchers and developers to assess and compare the performance of different Turkish language models across a variety of linguistic tasks and datasets. Users can gain insights into how models perform, facilitating informed decisions for model selection and development. The tool is built with Streamlit, ensuring an interactive and user-friendly experience, and is licensed under the MIT license, promoting open access and collaboration within the AI community. It is hosted as a Hugging Face Space, making it easily accessible for anyone interested in Turkish LLM evaluation.

Code Llama - Playground

60%

Code Llama - Playground is an AI tool designed for code generation and text completion, leveraging the Code Llama model. It provides a user-friendly interface to input code or text prompts and receive generated content. The platform allows for customization through adjustable settings, such as temperature, giving users control over the creativity and randomness of the output. Built as a Hugging Face Space, it offers an accessible environment for developers and other technical users to experiment with AI-powered code assistance and explore various code-related functionalities. This tool is ideal for quick prototyping, learning, and testing different code snippets or text generations.

CyberSecEvalTest

60%

CyberSecEvalTest is a specialized tool designed for evaluating the cybersecurity posture of large language models (LLMs). Developed by AI at Meta, this application offers a comprehensive suite of tests to identify potential risks and assess the security capabilities of LLMs. It features a public leaderboard that ranks different models based on their performance in these evaluations, alongside visual analysis tools to help users understand the strengths and weaknesses of each LLM. The platform is hosted on Hugging Face Spaces, making it accessible for researchers and developers interested in enhancing the security of AI systems. It operates under the Apache-2.0 license, promoting open collaboration and development in the field of AI security.

DMOSpeech2 Demo

60%

DMOSpeech2 Demo is a Hugging Face Space that provides a demonstration of the DMOSpeech 2 model. This tool enables users to generate natural-sounding speech by uploading a reference audio and providing text input. It offers different modes to balance between generation speed and output quality, making it versatile for various applications. The demo is ideal for individuals interested in experimenting with advanced speech synthesis technology and understanding its capabilities in voice cloning and text-to-speech conversion.

Dots Demo

60%

Dots Demo is an AI demonstration tool hosted on Hugging Face Spaces, designed for testing and research purposes. This application allows users to interact with an AI assistant by typing any question or request and receiving immediate, helpful responses. The AI can provide suggestions, develop plans, or generate written content based on the user's input. It serves as a practical example of AI's capabilities in real-time interaction and content generation, making it suitable for those exploring AI applications or conducting academic research. The tool is available for free and operates under an Apache-2.0 license.

Teste.ai

60%

Teste.ai is a comprehensive AI-powered platform designed to enhance software quality assurance. It enables testers to quickly generate test cases, scenarios, step-by-step guides, and test data using artificial intelligence. The tool supports a wide range of testing types, including API, functional, security, and performance testing. By leveraging advanced AI models and specialized prompts, Teste.ai aims to significantly reduce the time and effort involved in test creation and specification, allowing QA professionals to cover more requirements and improve test coverage. It also offers features like SQL query generation for specific data mass and integration with OpenAI's language models for precise results. Teste.ai is ideal for QA professionals and software developers looking to boost productivity and efficiency in their testing processes.

Max Stack Labs

60%

Max Stack Labs is an IT service and solutions company established in 2016, specializing in a broad range of technology offerings. Their core services include application testing, artificial intelligence development, business intelligence, blockchain solutions, and web/mobile application development. They also provide CRM and high-end solutions for quality management products. Max Stack Labs emphasizes AI development, creating powerful, custom AI machines to help businesses save time, money, and resources, ultimately improving ROI. Additionally, they offer custom software development, quality assurance with test automation, Open ERP services (including Odoo development and implementation), and resource augmentation to enhance client teams.

Falcon H1 Playground

60%

The Falcon H1 Playground is a Hugging Face Space designed for interacting with Falcon-H1 AI models. Users can input messages and engage in conversations, with the flexibility to choose different model sizes. The platform also offers adjustable parameters such as creativity and response length, allowing for a tailored conversational experience. This playground serves as an experimental environment for exploring the capabilities and behaviors of the Falcon-H1 models, making it suitable for those interested in natural language processing and AI model interaction.

Falcon H1R Playground

60%

Falcon H1R Playground is a chat demo designed to showcase and allow interaction with Falcon-H1R reasoning models. Users can engage in real-time conversations with AI assistants by typing questions or prompts and receiving immediate responses. The platform supports multiple chat sessions, enabling users to switch between different interactions and test various aspects of the AI's reasoning capabilities. This tool is particularly useful for exploring advanced language models and understanding their conversational and reasoning prowess.

SWE-Chatbot-Arena

60%

SWE-Chatbot-Arena provides a platform for evaluating and comparing various chatbots specifically designed for software engineering tasks. Users can input a link to a GitHub, GitLab, or Hugging Face repository, or even a specific resource like an issue, pull request, commit, or file. The application then extracts pertinent data, such as README files, issue descriptions, or pull request diffs, to facilitate the chatbot evaluation process. This tool is ideal for developers and researchers interested in assessing the performance and capabilities of AI agents in a software development context.

EXPLORE OTHER CATEGORIES

🎨 Content & Design 📊 Productivity & Business 🤖 AI Agents & Automation 📚 Research & Education 🧘 Wellness & Lifestyle 💼 Career Development 📈 Marketing & Growth 📉 Data & Analytics 💬 Customer Support & CX 💰 Finance 🛒 E-commerce