#Github

Logo of ComfyUI_IPAdapter_plus
ComfyUI_IPAdapter_plus
The IPAdapter Plus by ComfyUI provides sophisticated models for efficient image-to-image style and subject transfer. It features precise style transfer capabilities, compatibility with Kolors FaceIDv2, and optimized memory usage for long animations. Its experimental ClipVision Enhancer improves high-resolution visuals. The project remains open-source, powered by sponsorship, with comprehensive video guides and installation instructions available on GitHub to fully utilize its advanced image conditioning functionalities in creative applications.
Logo of gold-miner
gold-miner
An open translation project that converts top-notch tech articles from Juejin, including areas like blockchain, AI, and more. With 4000+ articles and input from 1500 translators, it is a vital resource for developers looking to explore new technologies.
Logo of IMS-Toucan
IMS-Toucan
IMS Toucan is a leading toolkit for multilingual Text-to-Speech Synthesis, supporting over 7000 languages. Created at the Institute for Natural Language Processing, University of Stuttgart, it provides a quick and adjustable solution, functioning efficiently with minimal computing power. Free access through Hugging Face allows exploration of demos and use of a comprehensive multilingual TTS dataset. Easy-to-follow installation instructions are available for Linux, Windows, and Mac, ensuring versatility in training and inference, with the option of using pretrained models for enhanced efficiency.
Logo of autoscraper
autoscraper
AutoScraper provides an efficient solution for automatic web scraping in Python, known for its user-friendly operation, speed, and minimal resource usage. It learns scraping patterns from provided data or URLs to gather similar content from additional pages. Compatible with Python 3, installation is possible through Git or PyPI. It's effective for retrieving data like StackOverflow question titles or Yahoo Finance stock prices. The tool supports custom requests with proxies or headers for greater flexibility. Model saving/loading enhances reusability, while tutorials offer guidance for advanced applications including API development with Flask.
Logo of python-weekly
python-weekly
Python Trending Weekly compiles top articles, tutorials, and projects from diverse sources to aid in Python proficiency and career advancement. The publication employs a subscription model from Issue 47, with earlier editions eventually becoming free. Access content via GitHub, Substack, or Telegram for convenient learning.
Logo of sonnet
sonnet
Sonnet, created by DeepMind researchers, provides a flexible programming structure for machine learning advancements using TensorFlow 2. It emphasizes modularity with `snt.Module`, aiding in the development of neural networks adaptable to various learning forms. Sonnet supports both predefined modules and custom-built ones, such as `snt.Linear`, `snt.Conv2D`, and `snt.nets.MLP`. While lacking an integrated training framework, it empowers users to leverage existing solutions or create new ones, supporting distributed learning. Simple installation and illustrative examples on Google Colab make Sonnet accessible for constructing complex machine learning models.
Logo of neural_prophet
neural_prophet
NeuralProphet integrates neural networks with traditional algorithms on PyTorch, providing an accessible platform for high-frequency time series prediction. Designed for collaborative development, it supports rapid model customization, emphasizing clarity and adaptability. While initial predictions may require tuning, iterative refinement results in precise models. Optimized for datasets covering a minimum of two years, it includes features like autoregression and seasonality. Participate in community discussions on GitHub and Slack, and utilize comprehensive tutorials. A suitable choice for developers exploring open-source forecasting solutions.
Logo of InternGPT
InternGPT
InternGPT introduces a unique visual interaction system using pointing devices, enhancing ChatGPT's efficiency in vision-centric tasks. Utilizing the Husky model, it specializes in multi-modal dialogues and visual scenarios, allowing for image editing and generation, and more. Community-driven, it continually evolves, promising improved functionalities and user experiences.
Logo of llm-action
llm-action
Explore an open-source project offering detailed guidance on LLM training and fine-tuning with NVIDIA GPUs and Ascend NPUs. The resources cover parameter-efficient methods like LoRA and QLoRA and introduce distributed training techniques. Access practical examples using frameworks like HuggingFace PEFT, DeepSpeed, and Megatron-LM to enhance large language models. Understand distributed AI framework complexities and learn effective LLM deployment strategies.
Logo of safeguards-shield
safeguards-shield
This toolkit offers a secure solution for managing LLM interactions, mitigating significant risks in GenAI applications. It includes over 20 detectors for thorough protection and enables the customization of LLM behaviors. Additionally, it tracks incidents, expenses, and responsible AI metrics while addressing risks such as bias, toxicity, and privacy through multi-layered defense.
Logo of StabilityMatrix
StabilityMatrix
StabilityMatrix provides a versatile solution for Stable Diffusion, featuring a multi-platform package manager and inference UI with seamless one-click installs for packages like Automatic 1111 and Fooocus. Its customizable interface supports syntax highlighting and project files, while integrated model browsing from CivitAI and HuggingFace enhances usability. Fully portable and efficient, it simplifies AI project management across diverse systems.
Logo of cortex
cortex
Cortex is a Local AI API Platform that allows for the running and customization of Language Learning Models (LLMs) using a user-friendly CLI. This platform, implemented in C++, integrates with Huggingface and Cortex's own models, and supports various engines like llama.cpp, ONNXRuntime, and TensorRT-LLM. Available with local and network installers, it provides cross-platform compatibility across Windows, MacOS, and Linux. Cortex enables flexible model management and can be deployed as a standalone API or integrated into other applications. The platform supports multiple model quantizations and is on track to include full OpenAI API features.
Logo of Pytorch-UNet
Pytorch-UNet
This PyTorch-based U-Net implementation enhances high-definition image segmentation, particularly for challenges like Kaggle's Carvana Image Masking. Featuring Docker for straightforward deployment and mixed precision optimization, the model boasts a Dice coefficient of 0.988423 across vast test sets. The project supports diverse segmentation applications, such as medical and portrait, and offers seamless training and inference with Weights & Biases for live training progress. Pretrained models are accessible for swift application.
Logo of reor
reor
Reor is an AI-powered app for managing notes, automatically linking and searching content, and generating flashcards, all stored locally. Leveraging local models, Reor integrates tools like Ollama and Transformers.js, with a markdown editor similar to Obsidian for easy editing and organization. Supports effective Q&A and cross-referencing, simplifying digital note management. Includes local model running, note importing, and supports open-source contributions.
Logo of mem0
mem0
Mem0 offers an innovative memory layer for AI, enhancing personalization by adapting to user needs. Ideal for AI assistants, chatbots, and autonomous systems, this open-source solution uses a hybrid database for efficient memory management. Key features include multi-level memory retention and a user-friendly API. Easily integrate Mem0 to optimize memory retrieval and ensure context-aware interactions. Choose from managed services or self-host with simple pip installation, supporting Graph Memory and various LLMs.
Logo of spaCy
spaCy
Explore spaCy's robust NLP platform supporting over 70 languages using state-of-the-art neural networks. Access pretrained pipelines for essential tasks like tokenization, named entity recognition, and text classification. Leverage multi-task learning with BERT transformers, ensuring easy deployment and production-readiness. Enhance projects with custom models in frameworks like PyTorch or TensorFlow, and utilize powerful visualizers for syntax and NER. This open-source software, under the MIT license, offers high accuracy and extensibility for all your NLP needs.
Logo of langchaingo
langchaingo
Explore how Go developers can integrate large language models (LLMs) seamlessly using LangChain. With detailed documentation and practical examples, this project supports efficient AI solution development and experimentation with OpenAI and others. Access valuable resources and community support for innovative AI applications in Go.
Logo of distilabel
distilabel
Distilabel is a framework for creating synthetic data and obtaining AI feedback, serving those developing NLP and LLM projects. It facilitates the creation of high-quality, varied datasets using established research techniques. The framework allows engineers to concentrate on enhancing data quality and controlling model tuning, integrating feedback across LLM providers with a single API. As an open-source, community-supported project, Distilabel ensures scalable and adaptable data generation pipelines to enhance the efficiency and quality of AI development.
Logo of mit-deep-learning
mit-deep-learning
Explore the MIT Deep Learning repository, which features a well-rounded set of tutorials focused on neural network basics, driving scene segmentation, and advanced techniques like generative adversarial networks. The DeepTraffic competition further enriches your learning experience by offering practical challenges in deep reinforcement learning. This evolving resource, aligned with MIT's ongoing courses, serves as a beneficial tool for newcomers and experienced practitioners in artificial intelligence.
Logo of lorax
lorax
LoRAX is a cost-effective framework for serving fine-tuned large language models efficiently on a single GPU, maintaining high throughput and low latency. It enables dynamic adapter loading and merging from various sources such as HuggingFace and Predibase, ensuring seamless concurrent processing. With support for heterogeneous batching, optimized inference, and ready-for-production tools like Docker images and Prometheus metrics, LoRAX is well-suited for diverse deployment scenarios. This platform supports models like Llama and Mistral and is free for commercial use under the Apache 2.0 License.
Logo of ChainForge
ChainForge
This open-source platform simplifies comparative prompt engineering and LLM response evaluation. It enables users to simultaneously query multiple LLMs, offering quick comparisons in response quality across various prompts and models. Supporting model providers like OpenAI and Google PaLM2, the platform provides robust tools for setting evaluation metrics and visualizing results. With features like prompt permutations, chat turns, and evaluation nodes, it facilitates a thorough analysis of prompt and model efficiency. Encouraging experimentation and sharing, it includes functionalities for exporting results and integrating evaluations into research projects, making it a practical tool for researchers.
Logo of WebGPT
WebGPT
WebGPT offers an educational platform with near-native GPU access in web applications. It leverages JavaScript and HTML to provide a transformer model ideal for educational use, with tests on up to 500M parameters demonstrating its adaptability. Compatible with Chrome v113 and Canary, WebGPT supports practical models like GPT-Shakespeare and GPT-2 117M. This tool provides insights into GPU optimization, simplifying the experimentation with web-based neural networks and focusing on efficiency and resource management.
Logo of Firefly
Firefly
Firefly is a versatile tool for training large models, offering pre-training, instruction fine-tuning, and DPO functionality for a broad range of popular models, including Llama3 and Vicuna. It employs methodologies such as full parameter tuning, LoRA, and QLoRA for efficient resource usage, catering to users with limited computing power. Its user-friendly approach allows for straightforward model training with optimized configurations to minimize memory and time consumption. Discover open-source model weights and benefit from proven methods, achieving notable improvements in the Open LLM Leaderboard.
Logo of ADeus
ADeus
Discover an innovative open-source AI wearable device designed to capture and store personal interactions, ensuring users retain complete data ownership. This device securely transcribes and stores spoken words and auditory interactions on personal servers. Users can enhance their communication experience with a context-aware personal AI through a dedicated app. Built with a user-friendly mobile/web interface, robust hardware, and backed by Supabase, it guarantees data privacy while seamlessly fitting into everyday life. Engage with the community for real-time development insights and support.
Logo of keras
keras
Keras 3 is an adaptable deep learning framework compatible with JAX, TensorFlow, and PyTorch. It enhances model building across fields like computer vision and NLP with superior flexibility and performance. This framework ensures effortless scaling from personal setups to datacenter environments. Users benefit from easy installation via pip and improved GPU performance through CUDA integration. Keras 3 supports existing tf.keras code and extends custom components, leveraging each backend's distinct capabilities for robust machine learning solutions.
Logo of vercel
vercel
Vercel provides a comprehensive Frontend Cloud solution that enhances developer experience and offers the infrastructure needed to effortlessly build, scale, and secure web applications. Enjoy seamless deployment through simple 'git push' commands, backed by a detailed guide on using the Vercel CLI. The platform supports local development with pnpm and encourages smooth contribution processes through GitHub discussions and pull requests. Developers can efficiently manage dependencies within a monorepo setup and thoroughly test their changes using unit and integration tests. Extensive documentation ensures successful deployment and troubleshooting while maintaining code integrity.
Logo of stable-diffusion-webui
stable-diffusion-webui
A web interface using Gradio offers features for image creation using Stable Diffusion, including txt2img, img2img, and more. Supports advanced neural networks and customization for creative tasks.
Logo of AutoGPT
AutoGPT
Explore a versatile platform designed for the creation, deployment, and management of AI agents to streamline complex workflows. Compatible with both self-hosted environments and a cloud-based beta system, this platform allows users to design bespoke agents or select from numerous pre-configured options. It features a user-friendly interface for agent interaction, comprehensive workflow management tools, and a reliable server infrastructure, all ensuring scalable AI automation. Discover unique features such as an agent builder, a marketplace for pre-configured agents, and use cases like video content production and social media management handled by AI-driven agents.
Logo of supabase
supabase
Discover a powerful open source alternative to Firebase, providing essential tools for developers. Features include a Hosted Postgres Database, authentication, real-time updates, and AI integration. Offers well-documented REST and GraphQL APIs and a modular library system for both hosted and self-hosted environments.
Logo of sunfish
sunfish
Sunfish is a Python-based chess engine recognized for simplicity and efficiency, with a Lichess rating above 2000. Its minimalist 131-line codebase makes it ideal for experimenting with chess algorithms. It supports terminal and GUI execution and includes advanced features like NNUE. Although compact and efficient, it doesn't support the 50 moves draw rule. Users can explore its potential by code modifications and using the PyPy JIT interpreter for enhanced performance.
Logo of CLIP
CLIP
CLIP employs contrastive language-image pre-training to achieve zero-shot prediction, matching performance with labeled data models. By integrating with PyTorch and TorchVision, CLIP facilitates diverse tasks like CIFAR-100 predictions and linear-probe evaluations through its image and text encoding capabilities.
Logo of trax
trax
Explore Trax, the deep learning library prioritizing code clarity and speed. Maintained by Google Brain, it features pre-trained models like Transformers and welcomes community contributions. Trax supports diverse environments from Python scripts to shell, operates on CPUs, GPUs, and TPUs, and integrates TensorFlow Datasets for data handling. It simplifies model training with functional pipelines, providing accessible high-performance deep learning solutions.
Logo of knowledge
knowledge
Knowledge enables effective data management and interaction, utilizing cutting-edge Large Language Models for interactive learning. Its integrated Chat feature transforms data engagement, while the built-in Chromium browser facilitates easy content summarization and note creation. This tool serves as a versatile resource for both complex topic exploration and everyday browsing needs.
Logo of yt-channels-DS-AI-ML-CS
yt-channels-DS-AI-ML-CS
Explore a comprehensive compilation of over 180 YouTube channels providing insights into data science, machine learning, AI, programming, and software engineering. This collection serves as a valuable resource for students, professionals, and hobbyists seeking to broaden their knowledge with content ranging from tutorials to podcast discussions. Regular updates and community input ensure the list's accuracy and relevance, appealing to those interested in data engineering, statistics, web development, and cybersecurity. Discover specialized content in programming languages such as Python, R, and C++ to enhance your learning experience.
Logo of PhoGPT
PhoGPT
PhoGPT is an open-source generative model designed for Vietnamese, comprising a base model and a chat-focused variant with 4 billion parameters. Trained on a vast dataset and fine-tuned with comprehensive conversational data, PhoGPT demonstrates improved performance for language tasks. Access technical details and guidelines for implementation. Note limitations in complex reasoning and safe question handling.
Logo of tidb
tidb
TiDB is an open-source SQL database designed for high availability and scalability, leveraging cloud-native solutions. It supports horizontal scaling through independent compute and storage operations and integrates smoothly with Kubernetes. The database ensures data integrity with ACID compliance via distributed transactions and MySQL compatibility for straightforward migrations without code overhauls. Its HTAP features use both row and columnar storage to maximize performance, accompanied by community-driven innovation.
Logo of alan-sdk-web
alan-sdk-web
Alan AI provides a comprehensive solution for integrating Generative AI Agents into web applications, minimizing UI changes and enhancing workflow efficiency. The platform includes tools like Alan AI Studio for dialog management and analytics, lightweight SDKs for easy integration, and a supportive cloud environment. Its serverless architecture optimizes infrastructure use, facilitating quick updates and management. Alan AI supports multiple frameworks, including React, Angular, and Vue, ensuring adaptable implementation for diverse app requirements.
Logo of deep-learning-for-image-processing
deep-learning-for-image-processing
Explore a tutorial focused on applying deep learning in image processing, without overstatements or promotional language. The course targets learners at all levels, offering video sessions on constructing and training networks using PyTorch and TensorFlow. Gain insights into models like LeNet, AlexNet, ResNet, and their application across tasks such as classification, detection, and segmentation. Detailed navigation includes network explanations and coding examples, with resources like downloadable PPTs for an efficient learning path.
Logo of ClickHouse
ClickHouse
Delve into a column-oriented database management system renowned for its real-time data processing. With effortless installation across Linux, macOS, and FreeBSD, access extensive resources like tutorials and documentation, backed by a robust community. Engage in monthly calls and global meetups to discuss updates and gain insights. Stay informed on the latest with expert talks, and contribute to a pioneering team in analytics.
Logo of gpt-pilot
gpt-pilot
Discover the capabilities of AI in code generation with GPT Pilot, where LLMs can produce nearly complete production-ready applications, complemented by developer adjustments. Seamlessly integrate with platforms like Docker and PostgreSQL. Engage with the Discord community and access insightful blogs. Compatible with Python 3.9+, utilizing advanced AI like OpenAI, this tool ensures efficient workflows via CLI and Docker. Streamline your development process with AI-driven solutions.
Logo of AppFlowy
AppFlowy
AppFlowy is an open-source Notion alternative focused on data security and AI tools. It supports macOS, Windows, Linux, iOS, and Android, offering customizable tools for individuals and enterprises to create secure, cross-platform environments. Join the community driving workspace innovation.
Logo of AIF360
AIF360
AI Fairness 360 is an open-source toolkit providing tools to detect, explain, and mitigate biases in machine learning models throughout their lifecycle. It includes metrics and algorithms available in Python and R, supporting fields like finance, healthcare, and education. The platform offers interactive experiences, tutorials, and an API for user guidance, and welcomes contributions to expand its capabilities. Detailed documentation ensures ease of use across various systems for effective bias management.
Logo of labelImg
labelImg
LabelImg, an open source image annotation tool, is designed in Python with a Qt interface. It supports PASCAL VOC, YOLO, and CreateML formats, aiding machine learning through efficient image labeling. Despite LabelImg's development integration with Label Studio, it remains a fundamental tool for image annotation, available on PyPI for Python 3+. It includes features for annotation visualization, class pre-definition, and hotkey usage, and supports Docker deployments. Compatible with multiple platforms, it requires minimal setup with Python and PyQt.
Logo of vosk-api
vosk-api
Vosk is an open source speech recognition toolkit offering offline capabilities in over 20 languages. It is suitable for applications like chatbots, smart devices, and transcription services. The toolkit features compact models for efficient, zero-latency performance and supports multiple programming languages and platforms, ranging from Raspberry Pi to large clusters, making it versatile for various speech-driven tasks.
Logo of pg_vectorize
pg_vectorize
The Postgres extension simplifies the text-to-embedding transformation process and integrates seamlessly with vector search and LLM applications. Utilizing popular LLM capabilities, it supports both vector similarity search and RAG workflows, allowing efficient embedding updates with minimal effort. The extension works with OpenAI and Hugging Face for embedding and text-to-vector transitions, making it a suitable choice for those wanting a streamlined solution to leverage vector databases in current Postgres setups.
Logo of DALLE-pytorch
DALLE-pytorch
This project offers an implementation of OpenAI's DALL-E in Pytorch, providing text-to-image transformation capabilities with options for scalability and customization, including the use of pretrained VAE models and adjustable attention mechanisms. It includes CLIP integration for image generation ranking and supports training protocols like reversible networks and sparse attention.
Logo of RWKV-LM
RWKV-LM
Leveraging a unique attention-free architecture, RWKV combines the strengths of RNNs and Transformers to deliver exceptional language model performance. It supports rapid inference, low VRAM usage, and efficient training. RWKV's parallelization capabilities facilitate GPT-style computation, making it adaptable for various AI applications such as text generation and image processing. This model is compatible with edge devices, ensuring resource efficiency and offering diverse training and fine-tuning options for tailored outputs across different data scales.
Logo of civitai
civitai
Explore a platform that drives AI model sharing and collaboration. Users can upload and browse diverse AI-generated models, fostering a community of learning and enhancement. Engage in community-driven improvements and gain insights from AI enthusiasts.
Logo of MLE-Flashcards
MLE-Flashcards
Access over 200 flashcards covering machine learning, computer vision, and deep learning essentials, designed to support interview preparation for leading tech firms. Created from academic exercises, these slides serve both seasoned professionals and newcomers. View the latest presentations for animated content. Perfect for reviewing foundational ML knowledge or gaining an overview with additional resources. Contribute feedback on GitHub to help improve this evolving tool.
Logo of Paddle
Paddle
PaddlePaddle is a pioneering deep learning platform from China, supporting over 10.7 million developers and 235,000 companies. It offers sophisticated features and support for industries such as manufacturing and agriculture, with a portfolio of over 860,000 models to drive AI commercialization. Known for its advances in large-scale training and high-performance inference, PaddlePaddle ensures third-party framework compatibility and adaptability across different deployment scenarios, including Cloud, Mobile, and IoT. The platform is well-suited for both beginners and expert developers, providing extensive documentation, courses, and community resources to facilitate AI integration and project execution.