Coding & Development
Browsing page 36 of AI tools for Open Source & Models in Coding & Development. Sorted by confidence score — our independent quality rating.
php-text-analysis
php-text-analysis is a comprehensive PHP library designed for developers to integrate Information Retrieval (IR) and Natural Language Processing (NLP) capabilities into their applications. It provides a wide array of functionalities, including document classification, sentiment analysis, and text summarization. Developers can also leverage its tools for frequency analysis, tokenization, stemming, collocations with Pointwise Mutual Information, lexical diversity, and corpus analysis. The library supports keyword extraction using the Rake algorithm and offers an easy-to-invoke PHP implementation of Vader for sentiment scoring. For document classification, it includes a Naive Bayes implementation, making it a versatile solution for various text analysis needs in PHP-based projects.
BERTIN GPT-J-6B
BERTIN GPT-J-6B is an open-source AI model specifically developed for Spanish language text generation. Built upon the GPT-J-6B architecture, this model is trained using the mc4-es-sampled dataset, making it highly proficient in understanding and generating Spanish text. It is hosted on Hugging Face Spaces, providing a platform for developers and researchers to access and utilize it for various natural language processing tasks in Spanish. While the live website currently indicates a runtime error due to hardware capacity, the project aims to offer a robust solution for Spanish language AI applications.
punica
Punica is an open-source project designed to efficiently serve multiple LoRA (Low-Rank Adaptation) finetuned Large Language Models (LLMs) simultaneously. While pretrained LLMs can be massive, LoRA finetuned models add only about 1% storage and memory overhead. Punica leverages a custom CUDA kernel, called Segmented Gather Matrix-Vector multiplication (SGMV), to efficiently compute the LoRA addon, preserving the strong batching effect of the pretrained model. This approach allows Punica to achieve significantly higher throughput, up to 12 times, compared to other systems like HuggingFace Transformers, DeepSpeed, FasterTransformer, and vLLM, across various LoRA model popularity settings. It can be installed from binary packages or built from source, supporting different CUDA, Python, and PyTorch versions.
CogVideoX Fun 5b
CogVideoX Fun 5b is an AI video generation tool hosted on Hugging Face Spaces by alibaba-pai. This application allows users to generate short videos based on textual descriptions of a scene. Additionally, it offers a unique feature where users can upload an existing video with empty or incomplete areas, and the system will intelligently fill them in according to user input. This makes it a versatile tool for experimenting with video generation models and creative video editing. The tool is built with Gradio, indicating an accessible and user-friendly interface for interaction. It is licensed under an open-source license, promoting accessibility and community engagement.
ControlNet Uncanny Faces
ControlNet Uncanny Faces is an AI tool hosted on Hugging Face Spaces, designed for generating images with a specific focus on faces. While the live website currently indicates a build error, the tool's purpose is to allow users to explore AI capabilities in face generation. As a Hugging Face Space, it typically offers a platform for developers and users to experiment with machine learning models and applications. The tool is presented as a free application, making it accessible for individuals interested in AI-driven image creation without a financial barrier. Its primary function revolves around leveraging ControlNet for nuanced facial image synthesis.
DeepLab
DeepLab is a technology company focused on empowering businesses with deep intelligence through custom machine learning solutions. They offer expertise in designing and developing scalable ML infrastructure, employing cutting-edge technology to productize ideas and facilitate fast experimentation. DeepLab also provides production-grade algorithmic solutions for products serving billions of users, optimizing KPIs and delivering competitive advantages. The company continuously invests in core machine learning and deep learning research, pushing boundaries in areas like Transfer and Continual Learning, as well as applications such as Computer Vision and Language. Their services include building innovative AI-powered solutions for recommender systems, cybersecurity, fraud detection, pricing optimization, and automotive driver behavior modeling.
Dia2 2B
Dia2 2B is an advanced AI tool developed by Nari Labs, designed for real-time streaming conversational audio. Users can input a back-and-forth script and optionally add short voice prompt files for each speaker to condition the model. By adjusting a few sampling sliders, the tool generates a single audio file that voices the entire conversation. This capability makes it ideal for creating dynamic and natural-sounding dialogues without needing the complete text input upfront, offering a flexible solution for various audio generation needs.
react-native-executorch
React Native ExecuTorch provides a declarative and efficient way for developers to integrate and run AI models directly on mobile devices using React Native, leveraging Meta's ExecuTorch framework. This open-source library simplifies the process of deploying on-device AI, eliminating the need for extensive native programming or machine learning expertise. It offers out-of-the-box support for various AI tasks, including large language models (LLMs), computer vision, speech-to-text, text-to-speech, object detection, and image/text embeddings. The tool is designed for the New React Native architecture and includes demo applications to showcase its capabilities. Developers can also export and run their own AI models in the .pte format.
ruby-nlp
ruby-nlp is an extensive, open-source collection of links dedicated to Natural Language Processing (NLP) resources for Ruby developers. This repository categorizes and lists a wide array of libraries, tools, and software, covering diverse NLP functionalities. From APIs for third-party NLP services and instant messaging bots to advanced topics like machine learning, named entity recognition, and text summarization, ruby-nlp offers a curated guide. It includes resources for tasks such as language detection, text classification, sentiment analysis, and machine translation, making it an invaluable hub for anyone looking to implement NLP capabilities in Ruby applications.
Gemma Fine Tuning
Gemma Fine Tuning is a web-based application hosted on Hugging Face Spaces, designed to simplify the process of fine-tuning Google's Gemma models. Users can upload and preprocess their own datasets, configure various model parameters, and initiate the training of Gemma models. A key feature is the ability to export the fine-tuned models in multiple formats, making them versatile for different deployment scenarios. This tool provides an accessible interface for individuals and researchers looking to customize large language models for specific tasks or domains without extensive coding knowledge.
Veritone
Veritone is a human-centered AI technology leader providing innovative AI solutions across diverse industries such as media and entertainment, legal and compliance, and government. The platform, aiWARE, tokenizes unstructured data like video, audio, and text into AI-ready tokens, powering smarter models and automated workflows. Veritone helps businesses accelerate decision-making, improve efficiency, and unlock extraordinary potential by transforming media into actionable data. Its offerings include solutions for public safety productivity, retail talent acquisition, and custom AI development, enabling organizations to achieve measurable business outcomes and grow revenue.
GPT-OSS-120B on AMD MI300X
GPT-OSS-120B on AMD MI300X is an AI chatbot hosted on Hugging Face Spaces, designed to run on AMD MI300X GPUs. This tool offers a simple chat interface where users can input questions or requests and receive spoken-language responses from the GPT-OSS-120B model. It provides flexibility by allowing users to adjust the system prompt and temperature, enabling customization of the AI's behavior and output. This makes it suitable for experimentation and research with large language models, offering a platform to explore different conversational AI scenarios and model responses. The tool is open-source, licensed under Apache 2.0, promoting accessibility and collaborative development within the AI community.
FLUXllama gpt-oss
FLUXllama gpt-oss is an AI tool hosted on Hugging Face Spaces, designed for generating high-resolution images from text descriptions. It leverages FLUX 4-bit Quantization for efficient image model processing. Users can provide a short text prompt, and the application will create a corresponding image. For richer and more detailed results, the tool includes an AI that can first improve the user's initial prompt with additional artistic and descriptive elements. This makes it suitable for experimentation with advanced image generation techniques and for users looking to produce visually enhanced outputs from concise inputs.
Boltzbit
Boltzbit is an AI research and development lab specializing in customized generative AI models with live learning capabilities. Their core mission is to advance General Learning Intelligence (GLI) as the new path to AGI 2.0, aiming to empower individuals with unique AI that learns from private data. The platform offers features like custom LLMs, a family of office applications (Space), and an API for integration. Boltzbit also conducts research into areas such as gradient-based hyperparameter optimization for Hamiltonian Monte Carlo and novel MCMC samplers using quasi-Newton methods. Their work extends to practical applications in financial services, developing real-time, adaptive AI solutions.
Vulcan - Security for GenAI
Vulcan provides an enterprise AI trust layer, offering comprehensive security for GenAI models and applications. It features Vulcan Attack, a patented engine for testing and uncovering vulnerabilities with extensive jailbreak and risk libraries, and Vulcan Protect, which continuously monitors GenAI interactions to detect jailbreaks, prevent data leakage, and enforce moderation. The platform also offers expert-led AI Red Teaming, security penetration testing, vulnerability assessments, and External Attack Surface Management (EASM). Vulcan is designed for GenAI, ensuring safety, security, and operational integrity, and integrates seamlessly into CI/CD pipelines for continuous compliance and protection. It supports multilingual and culturally sensitive threat detection, adhering to global AI standards like OWASP Top 10 for LLMs and MITRE ATLAS™.
Speech-AI-Forge
Speech-AI-Forge is an open-source project designed for advanced Text-to-Speech (TTS) generation, offering both an API server and a user-friendly Gradio-based WebUI. It supports a wide array of TTS models, including ChatTTS, CosyVoice, FishSpeech, GPT-SoVITS, and F5-TTS, along with ASR capabilities using Whisper and SenseVoice. Key features include speaker switching, custom voice uploads, style control, long text inference, and audio adjustment options like speed, pitch, and volume. The platform also provides tools for SSML script editing, podcast creation, and voice management, making it a versatile solution for developers and content creators looking to integrate or experiment with cutting-edge speech AI.
Speech-Emotion-Analyzer
Speech-Emotion-Analyzer is an open-source project designed to build a machine learning model capable of detecting emotions from speech. The neural network model can identify five different male/female emotions from audio speeches, leveraging deep learning, natural language processing (NLP), and Python. The project utilizes datasets like RAVDESS and SAVEE for training, extracting features using the LibROSA library. While Multilayer Perceptrons and Long Short Term Memory models were explored, a Convolutional Neural Network proved most effective, achieving over 70% accuracy in emotion detection and 100% accuracy in distinguishing male/female voices. This tool has potential applications in various industries, such as marketing for personalized product recommendations or automotive for adjusting autonomous car behavior based on driver emotion.
AdminForth
AdminForth is an open-source framework designed to accelerate the development of back-office applications. Built with Vue3 and Node.js, it offers a comprehensive admin panel that is highly extendable and customizable using TailwindCSS. Key features include OWASP-compliant authentication and authorization, OAuth2/OpenID SSO integration, 2FA, and user management. Developers can create custom pages and dashboards with Vue3 components, integrate LLM-based translation, audit logging, and AI autocomplete plugins. It also supports file uploads to Amazon S3, advanced data filtering, rich text editing, and CSV import/export, making it a versatile solution for various administrative needs.
StreamSpeech
StreamSpeech is an innovative open-source project offering an "All in One" seamless model for comprehensive speech processing. It supports both offline and simultaneous speech recognition (ASR), speech-to-text translation (S2TT), and speech-to-speech translation (S2ST), alongside real-time speech synthesis (TTS). A key differentiator is its ability to present intermediate ASR or translation results during simultaneous translation, enhancing low-latency communication. The tool is designed for researchers and developers working with speech technologies, providing models for various language pairs like French-English, Spanish-English, and German-English, and includes a Web GUI demo for local browser experience.
text-to-lora
text-to-lora offers a reference implementation of Text-to-LoRA (T2L), a system designed for instant transformer adaptation. This tool leverages hypernetworks to efficiently adapt Large Language Models (LLMs) for various benchmark tasks. A key feature is its ability to perform these adaptations using only textual descriptions of the desired tasks as input, simplifying the process of fine-tuning LLMs. The project provides detailed instructions for installation, running demos, generating LoRAs from the command line, and evaluating generated LoRAs. It also includes comprehensive guides for both SFT (Supervised Fine-Tuning) and Reconstruction training, making it a valuable resource for researchers and developers working with LLM adaptation.
Sentient.io
Sentient.io offers a comprehensive AI & Data platform designed to empower enterprises with intelligent solutions. The platform provides ready-made AI models for easy adoption, allowing businesses to quickly integrate AI capabilities into their operations. Additionally, Sentient.io delivers turnkey AI solutions, catering to the specific needs of enterprises embarking on their digitalization journey. The platform emphasizes security and aims to simplify the process of leveraging artificial intelligence for various business applications, making advanced AI accessible for enhanced decision-making and operational efficiency.
AlphaMonarch-7B
AlphaMonarch-7B is an AI language model hosted on Hugging Face Spaces, designed to generate text responses based on user input. This application allows users to ask questions, request various writing tasks, and engage in conversational interactions. It is capable of providing detailed answers and can process a diverse range of prompts. The tool is presented as a web application, making it accessible through a browser. It is developed by Maxime Labonne and is licensed under Apache-2.0, indicating its open-source nature and potential for research and experimentation in natural language processing.
SkyChat-Chinese-Chatbot-GPT3
SkyChat-Chinese-Chatbot-GPT3 is an open-source chatbot project developed using the Chinese GPT-3 API. It offers a range of functionalities beyond basic conversation, including human-machine chat, question and answer capabilities, and Chinese-English translation. The tool can also perform creative tasks such as content continuation, generating couplets, writing Chinese ancient poems, creating recipes, third-person narration, and generating interview questions. It provides demo usage tutorials for both Python and Unity environments, detailing the setup of model services and Python environments, including dependency installation and API key integration. This makes it a versatile tool for developers and researchers interested in Chinese natural language processing applications.
Learning Machines
Learning Machines specializes in building production-ready GenAI systems and enterprise-grade AI solutions. Their services span from agentic architectures and multi-agent systems to classical machine learning, data pipelines, and cloud infrastructure, delivering full-stack solutions at enterprise scale. They assist organizations in ingesting data, building ML models for decision-making, and automating production, distribution, and service. Their expertise includes backend systems, APIs, microservices, enterprise integration, data engineering, GenAI & agentic systems, machine learning, and cloud architecture. They guide clients through discovery, strategy, build, integration, deployment, scaling, and continuous optimization of AI capabilities.