Nornickel has presented MetalGPT-1, a specialized language model for the metallurgical and mining sectors. The company calls the development the first open solution of this scale in the industry.
MetalGPT-1 contains 32 billion parameters and is trained on 10 GB of professional materials, which is comparable to half the volume of the English-language Wikipedia.
Nornickel calls the unique quality of the training corpus the key advantage of the model: it includes more than 1 million documents that are not publicly available. Among them are technological regulations, internal instructions of enterprises, design and construction documentation, patents, R&D reports, and specialized scientific and technical literature. All data has undergone multi-stage cleaning and mandatory anonymization.
In addition, the developers have created about 500 thousand "question-answer" pairs and instructive examples that reflect the real tasks of production and scientific research.
MetalGPT-1 opens a line of industry LLMs with open source code, on which Nornickel plans to develop its own ecosystem of industrial AI solutions.
Read more materials on the topic:
Now on home
Субмарины проекта 636.3 считают самыми тихими в мире
Liners received multi-level protection
The production operates on a full cycle — from «primary chemistry» to battery production
The new 59095С-15 provide the crew with protection against small arms and landmines under the wheels
Тяга двигателя — 1,36 кгс
If the order is successfully completed, the commercial transportation sector may also open up
Statistical reliability of units reached 99.9%