#model training

Logo of djl
djl
Deep Java Library (DJL) is a high-level, open-source Java framework that facilitates deep learning integration without requiring machine learning expertise. It supports Java developers to seamlessly incorporate deep learning into applications using familiar Java IDEs. DJL's engine-agnostic feature offers flexibility in computational engine choice, and its ergonomic APIs promote best practices in model training and inference. It also provides automatic hardware optimization and extensive documentation for enhanced development and deployment.
Logo of pytorch-sentiment-neuron
pytorch-sentiment-neuron
This open-source project leverages PyTorch, CUDA, and Python 3.5 for sentiment analysis by generating and analyzing sentiments in reviews. It supports model implementation, visualization, and retraining with adjustable parameters including sequence length, batch size, and RNN setup, providing a flexible framework for developers to explore sentiment analysis.
Logo of ACG2vec
ACG2vec
Discover a robust suite of open-source deep learning solutions designed specifically for anime, comics, and games. Key models include acgvoc2vec for text representation, dclip for image analysis, and pix2score for evaluating anime artwork. Accessible online demos provide real-world applications like image retrieval and rating systems. Simplify deployment through Docker while benefiting from integrations with TensorFlow, Milvus, and other advanced technologies.
Logo of SummerTTS
SummerTTS
SummerTTS is a self-contained text-to-speech tool that performs offline voice synthesis for both Chinese and English. By leveraging Eigen for neural network tasks, it operates without relying on frameworks such as PyTorch or TensorFlow. While primarily tested on Ubuntu, it is expected to function on similar systems like Linux-based platforms. The VITS algorithm ensures effective speech processing, and latest updates have enhanced the speed for English single-speaker tasks. Downloadable models support various voice configurations, focusing on ease of use and maintaining good audio quality.
Logo of Digital-Life-DL-B
Digital-Life-DL-B
Discover the digital avatar creation framework utilizing ChatGLM, Wav2lip, and So-VITS-SVC technologies. This open-source initiative, finalized in March 2023, provides instructions for setting up Python environments, installing ffmpeg, and training models, including ChatGLM and So-VITS-svc. Detailed instructions on integrating Wav2lip are also provided. Anticipate updates for enhanced usability by AI学社 post the ongoing competition. The project abides by the GNU GPLv3 licensing terms, with specific compliance needed for each model's license.
Logo of repeng
repeng
Repeng is a Python library designed for swift control vector generation through representation engineering, offering vector training in less than a minute. The library allows for creating datasets with paired statements, utilizing models such as Mistral-7B for creative content development. Key features include rapid vector training, adjustable control strength during inference, and flexible integration. The project includes comprehensive examples in its notebooks and supports export for quantized usage with llama.cpp. Currently, it does not support MoE models. For more details, visit the project's blog post and CHANGELOG.
Logo of nitrain
nitrain
Nitrain is an adaptable AI framework focused on medical imaging, offering streamlined model training and data augmentation across leading platforms such as Torch, TensorFlow, and Keras. It features intuitive defaults and high-level abstractions for easier use. Access comprehensive tutorials for medical imaging AI model integration and explore advanced Python techniques for improved medical visualization, catering to AI researchers and healthcare professionals aiming to enhance diagnostic capabilities.
Logo of LyCORIS
LyCORIS
LyCORIS introduces efficient fine-tuning techniques for Stable Diffusion models, optimizing image generation without compromising model size or training speed. Incorporating methods like LoRA, LoHa, and DyLoRA, it ensures versatile and diverse output generation. It is compatible with platforms such as sd-webui, ComfyUI, and InvokeAI, facilitating easy integration. The project is backed by detailed documentation and dynamic community interaction. Recent updates include automatic detection of quantized layers and improved functional APIs, offering adaptable solutions for users.
Logo of recommenders
recommenders
TensorFlow Recommenders is an open-source library crafted for creating recommender system models using TensorFlow. This library guides users through the complete workflow, encompassing data preparation, model training, evaluation, and deployment. Integrated with Keras, it combines an intuitive learning curve with the ability to construct complex models. Through easy pip installation and abundant resources like tutorials and API references, it allows efficient model building with datasets such as Movielens 100K. The library's advanced embedding-based capabilities enhance both user and item representation, boosting recommendation precision.
Logo of TF-ID
TF-ID
Discover TF-ID's suite of models designed to identify tables and figures in academic papers, featuring MIT-licensed resources. Choose between models optimized for text captions or streamlined extraction, leveraging Microsoft's Florence-2 for seamless integration. Achieve accurate results with up to 98% success rate, supported by comprehensive training and implementation guides for scholarly contexts.