대한민국 인공지능 여기까지, 솔트룩스

Ver 1.x

Customized language model platform essential for various AI services

The LANGUAGE STUDIO makes it easy to build domain-specific language model thanks to its user-friendliness and reduced complex coding. Create custom language models for various fields such as finance or law, for both public and private institutions.

#Super-large language model
#Large-capacity language analysis
#Intent classification
#Emotion/sentiment analysis
#Learning data construction
#Similarity analysis

What Makes LANGUAGE STUDIO So Special?

LANGUAGE STUDIO lets anyone create large language models that can be easily optimized for each domain.

Point 01
High-quality natural language

on deep learning
Point 02
Seamless domain

application
Point 03
6 specialized transfer

learning models
Point 04
Direct creation and

GUI-based language

System Configuration

Pre-training

Supermassive language model that enables language understanding

Create your own model using specialized language templates pre-trained with jargon gathered from large domain-specific learning data. Our large-scale language template models can be used for a variety of natural language processing.

Features

01High-quality natural language processing based on deep learning

High-quality natural language processing is achievable with the use of the most recent machine learning and deep learning (artificial neural network) technologies, which provide faster and greater performance than the existing techniques.
02Seamless domain application

By using dictionaries and rules that are specific to each domain in addition to common dictionaries, it offers language models according to language characteristics of various fields and allows functions to create vast volumes of learning data individually.
03Latest super-large language models available

BERT, ELECTRA, and RoBERTa are just a few of the models Saltlux offers, along with a proven model with up to 350 million parameters. Saltlux also give an ultra-large language model that has been trained with over 100G of data in order to create a domain-specific language model.

Fine-tuning/transfer learning

A language model that addresses various domain-specific needs

Through pre-learning model-based transfer learning, our technology learns various tasks (text classification, sentence embedding, named entity recognition, morpheme analysis), and evaluates quality using our template models in order to provide the optimal, best performing model for your needs.

Features

01Text classification

It categorizes text entered into a predefined class, allowing you to arrange everything from simple sentences to complex documents.
0202 Sentence embedding

With the sentence embedding method that not only understands the meaning of a sentence, it also finds similar sentences.
03Named entity recognition

It provides a user-defined object recognition model from input text, thus enabling information retrieval and object name recognition for interactive systems that requires high accuracy.
04Morpheme analysis

It provides analytic results in 'morphemes', the smallest semantic unit of phrases, allowing the restoration of adjectives and verbs to their original forms, and directly modifying the outcomes using lexicographic functions.
05Emotion, sentiment analysis

It categorizes the emotions and sentiments of text, providing a model that categorizes positive/negative sentences and the emotional state of the counterpart.
06Intent analysis

It analyzes the meaning of the sentence, classify its intent and offers a specialized model that understands user intention in conversations.

LANGUAGE STUDIO TOOL

Specialized tools for the implementation of comprehensive custom language models by domain

LANGUAGE STUDIO provides key features for text service implementation, from the development of learning models to their deployment and management.

Learning data management
Model learning
Model arrangement
Model management
Labeling tool

Tool Introduction

01Learning data management
- Provides integrated management of large volumes of pre-training data and learned data by various domains
  1. Upload/download JSON files
  2. Data pre-processing
  3. Data statistics processing
  4. Data sampling
02Model learning
- GUI-based model learning
  1. Latest super-large language models available
  2. Learning with multi-GPU
  3. Merge learning data and provide statistics
  4. Check learning status of models and loss curves
03Model arrangement and management
- Integrated management from model creation, arrangement to application
  1. Fully trained domain models optimized for domain
  2. Use a variety of cloud arrangement techniques to launch models instantaneously
  3. Web-based model management
  4. GUI-based process for model creation, learning, and evaluation
04Labeling tool (option)
- Web-based annotation
  1. Customize data for a variety of issues
  2. Support simultaneous operation for collaboration among multiple domain experts
  3. Inspect data annotation
  4. Download final delivery as JSON file

Use Case

LANGUAGE STUDIO creates new value by converging with various solutions.

Success Story

Samsung Electronics
Hanwha Group
HAdvanced Korea Press Foundation

KMS analysis and knowledge-to-data
Samsung Electronics Mosaic, a collective intelligence platform

Established a comprehensive knowledge management system for the overall technology process, network, and technology analysis trends for KMS information and opinion that are registered internally.
- Improved knowledge productivity and work efficiency through analysis and knowledge transfer of registered Samsung Electronics' KMS documents.
- 5 times more users compared to the existing system

Integrated VOC system
Hanwha Group integrated VOC analysis system

Internal and external data collecting systems for each subsidiary were built using the VOC and big data analysis system, and via the analysis of VOC, a CS management support and customer support monitoring system were established.
- To improve Hanwha Group's management of its VOC resources, a methodical collecting procedure was established through the VOC analysis system, and the VOC analysis was carried out in accordance with user demands.

News analysis system
Advanced Korea Press Foundation media content analysis system

Project to enhance the system that makes data production relatively easy by refining the platform which has vast amount of news data
- Established a foundation for systematic analysis of global news data
- Enhanced quality of language analysis through improved management tools and language resource purification
- Improved user convenience by improving web service quality

Samsung Electronics
KMS analysis and knowledge-to-data
Samsung Electronics Mosaic, a collective intelligence platform
- Improved knowledge productivity and work efficiency through analysis and knowledge transfer of registered Samsung Electronics' KMS documents.
- 5 times more users compared to the existing system
Hanwha Group
Integrated VOC system
Hanwha Group integrated VOC analysis system
- To improve Hanwha Group's management of its VOC resources, a methodical collecting procedure was established through the VOC analysis system, and the VOC analysis was carried out in accordance with user demands.
HAdvanced Korea Press Foundation
News analysis system
Advanced Korea Press Foundation media content analysis system
- Established a foundation for systematic analysis of global news data
- Enhanced quality of language analysis through improved management tools and language resource purification
- Improved user convenience by improving web service quality

Reference

Intelligent integrated search
Constitutional Court

A simple semantic based precedent retrieval method using daily terminology
New technology sensing
Samsung Electronics

Knowledge management system with extensive R&D trend signal detection
Incomplete sales
Nonghyup Card

Monitoring of outbound incomplete sales and consultation analysis report
Analysis of call center consultation topics
Nonghyup Bank

Analysis of positive/negative feedback on call center consultation
KT VOC system
KT

Customer VOC analysis, reports and insights on KT telecommunication products
VOC analysis system
Korea Expressway Corporation

Continuous monitoring system for customer complaints through atypical VOC analysis
Top 2020 Digital New Deal practices
Korean dialect AI data
National Information society Agency

15,000 hours of learning data, transcription of 2.5 million sentences of regional dialect
2020 Daily Conversation Corpus
National Institute of Korean Language

Transcription corpus by recruiting more than 2,000 speakers from each region
AI data for in-depth interviews in specialized fields
National Information society Agency

Learning data from in-depth expertise interviews covering more than 2,000 hours
Korean corpus research
National Institute of Korean Language

5.5 million separate words of spoken language in the corpus of the foundational study on contemporary language.
Large-scale Korean corpus
National Information society Agency

Online colloquial learning data based on a total of 2 billion words of web data
Spoken language corpus
National Institute of Korean Language

15,000 hours of broadcast conversations, establishing about 15.4 million raw corpus

Customized language model platform essential for various AI services

What Makes LANGUAGE STUDIO So Special?

High-quality natural language

Seamless domain

6 specialized transfer

Direct creation and

System Configuration

Core Technology

Pre-training

Supermassive language model that enables language understanding

Features

01High-quality natural language processing based on deep learning

02Seamless domain application

03Latest super-large language models available

Fine-tuning/transfer learning

A language model that addresses various domain-specific needs

Features

01Text classification

0202 Sentence embedding

03Named entity recognition

04Morpheme analysis

05Emotion, sentiment analysis

06Intent analysis

LANGUAGE STUDIO TOOL

Tool Introduction

01Learning data management

02Model learning

03Model arrangement and management

04Labeling tool (option)

Use Case

Success Story

Samsung Electronics Mosaic, a collective intelligence platform

Hanwha Group integrated VOC analysis system

Advanced Korea Press Foundation media content analysis system

Samsung Electronics Mosaic, a collective intelligence platform

Hanwha Group integrated VOC analysis system

Advanced Korea Press Foundation media content analysis system

Reference

Constitutional Court

Samsung Electronics

Nonghyup Card

Nonghyup Bank

KT

Korea Expressway Corporation

National Information society Agency

National Institute of Korean Language

National Information society Agency

National Institute of Korean Language

National Information society Agency

National Institute of Korean Language

Do you have any questions?​ Listen to SALTLUX​

Do you have any questions?
Listen to SALTLUX