Open Source Today (2024-09-10): DeepSeek-Coder-V2 Released, Now Supports 338 Languages and 128K Context

Explore top AI open-source projects: DeepSeek-Coder-V2, CodeFuse-muAgent, Anything-LLM, Flux Gym, KubeAI, and Tomato. Boost your coding and AI solutions today!

Sep 10, 2024

Here are some interesting AI open-source models and frameworks I wanted to share today:

Project: DeepSeek-Coder-V2-Instruct-0724

DeepSeek released their DeepSeek-Coder-V2-Instruct-0724, which ranks 2nd on the Aider LLM leaderboard, surpassing DeepSeek V2.5.

DeepSeek-Coder-V2 is an open-source Mixture-of-Experts (MoE) code model. It performs on par with GPT4-Turbo on code-specific tasks.

This model is further pre-trained from DeepSeek-V2’s intermediate checkpoint with 6 trillion more tokens. It boosts coding and math reasoning skills while maintaining solid performance in general language tasks.

Supported programming languages expanded from 86 to 338, and the context length increased from 16K to 128K.

https://huggingface.co/deepseek-ai/DeepSeek-Coder-V2-Instruct-0724

Project: CodeFuse-muAgent

CodeFuse-muAgent is an innovative agent framework powered by a knowledge graph engine. It combines multi-agent systems, function calls, and code interpreters.

With drag-and-drop features and simple text, large language models can carry out complex procedures under human guidance.

This framework has been proven in various complex DevOps scenarios at Ant Group.

https://github.com/codefuse-ai/CodeFuse-muAgent

Project: Anything-LLM

AnythingLLM is a full-stack app that lets users turn any document, resource, or content into context for any large language model (LLM).

It supports multiple LLMs or vector databases, multi-user management, and permission settings.

Users can run it locally or host it remotely to have smart conversations with any provided document.

https://github.com/Mintplex-Labs/anything-llm

Project: Flux Gym

Flux Gym is a simple Web UI for training FLUX LoRA in low-memory environments (12GB/16GB/20GB).

It combines AI-Toolkit's Gradio UI for the frontend with Kohya Scripts on the backend, offering a flexible and easy-to-use interface for LoRA training.

https://github.com/cocktailpeanut/fluxgym

Project: KubeAI

KubeAI is a private OpenAI alternative running on Kubernetes. It supports various open-source models (like LLMs, Whisper) and offers an OpenAI-compatible API.

KubeAI supports multi-platform setups (CPU, GPU) and scales automatically based on the workload. It also includes a chat interface (OpenWebUI) and message integration (Kafka, PubSub).

https://github.com/substratusai/kubeai

Project: Tomato

Tomato is a proof-of-concept steganography tool that uses Minimum Entropy Coupling (MEC) to hide encrypted messages in natural language.

By aligning the probability distribution of hidden messages with the LLM-generated cover text, it makes the embedded messages statistically indistinguishable from normal language, making them hard to detect.

https://github.com/user1342/Tomato

Today's Open Source (2024-09-09): DeepSeek-V2.5 Combines General and Coding Capabilities in an Upgrade

Meng Li

Sep 9

Today's Open Source (2024-09-09): DeepSeek-V2.5 Combines General and Coding Capabilities in an Upgrade

Here are some interesting AI open-source models and frameworks I wanted to share today:

Read full story

AI Disruption

Today's Open Source (2024-09-09): DeepSeek-V2.5 Combines General and Coding Capabilities in an Upgrade

Discussion about this post