Effective Algorithm for Text Processing in Business Processes Developed by Sber

GigaEmbeddings is suitable for smart search in e-commerce, creating chatbots with advanced functions, analyzing customer requests, and generating recommendations

Sber researchers have developed GigaEmbeddings, a model that improves the handling of Russian-language texts. It is based on GigaChat-3B and uses a three-stage training process: preliminary preparation, fine-tuning, and multi-task learning. The architecture is optimized, which reduced the neural network parameters by 25% without compromising quality.

Until now, businesses have lacked effective tools for analyzing texts in Russian. Existing solutions either required significant computing power or struggled with search and classification. GigaEmbeddings solves these problems. The model is suitable for smart search in e-commerce, creating chatbots with advanced functions, analyzing customer requests, and generating recommendations.

Today, we are addressing a critical market need for high-quality NLP solutions for the Russian language. Our comprehensive platform allows businesses to radically optimize all text-related processes — from basic search and recommendation algorithms to advanced RAG systems in chatbots. [...] Companies are finally getting a unified solution — they no longer need to assemble functionality piecemeal from foreign products.
Fedor Minkin, Technical Director of GigaChat Sberbank

The model is available on GitVerse and HuggingFace. Developers expect it to become the standard for the financial sector, retail, and government services.

Read more on this topic:

Platform for the safe implementation of AI in government structures and corporations developed at St. Petersburg Electrotechnical University "LETI"

Operating system from Sber — SberOS now supports the open RISC-V architecture, which expands the possibilities for import substitution

Russian software build system from "STC IT ROSA" included in the register of the Ministry of Digital Development

Now on home