KoreaTechToday - Korea's Leading Tech and Startup Media Platform
  • Topics
    • Naver
    • Kakao
    • Nexon
    • Netmarble
    • NCsoft
    • Samsung
    • Hyundai
    • SKT
    • LG
    • KT
    • Retail
    • Startup
    • Blockchain
    • government
  • Lists
KoreaTechToday - Korea's Leading Tech and Startup Media Platform
  • Topics
    • Naver
    • Kakao
    • Nexon
    • Netmarble
    • NCsoft
    • Samsung
    • Hyundai
    • SKT
    • LG
    • KT
    • Retail
    • Startup
    • Blockchain
    • government
  • Lists
KoreaTechToday - Korea's Leading Tech and Startup Media Platform
No Result
View All Result
Home AI

Kakao Unveils Kanana-o: Korea’s First AI Model That Sees, Hears, and Speaks

Siwoo Jung by Siwoo Jung
PUBLISHED: May 1, 2025 UPDATED: May 5, 2025
in AI, Kakao
0
Kakao Unveils Kanana-o: Korea’s First AI Model That Sees, Hears, and Speaks

Image credits: Kakao

Kakao introduced Kanana-o, South Korea’s first multimodal large language model (LLM), capable of understanding and processing text, voice, and images together. The company shared details of the model’s performance and development through its official tech blog on the 1st, highlighting its ability to listen, speak, and interact human-likely by integrating advanced voice technology.

According to Kakao, Kanana-o demonstrates a competitive edge on par with global AI systems developed by OpenAI and Google. The company also unveiled Kanana-a, an audio-focused language model. 

Kanana-o, an artificial intelligence model, integrates the understanding and processing of text, voice, and images simultaneously. This multimodal system allows users to input queries in any combination of the three forms of communication, with the model generating responses in text or natural voice tailored to the context. Kakao achieved this integration by merging its existing models, Kana-v, which specializes in image processing, and Kana-a, which focuses on audio understanding and generation, using a technology called “Model Merge.”

The development of Kanana-o leverages a large-scale Korean dataset to accurately reflect the distinctive features of the Korean language, including its tense variations, intonation and unique speech structure. The model also has the ability to interpret regional dialects, such as those from Gyeongsang and Jeju provinces, converting them into standard Korean and producing natural, fluent speech. This ensures the model can effectively handle nuanced Korean communication and regional linguistic differences.

One of the standout features of Kanana-o is its use of speech emotion recognition technology, which enables the model to analyze nonverbal signals such as intonation, voice trembling, and speech patterns accurately. This helps the system understand and interpret user emotions, allowing it to generate responses that are contextually relevant and emotionally appropriate. This capability brings the model closer to mimicking human communication, making interactions with the AI more natural and empathetic.

In performance tests, Kanana-o showed a high level of competence, comparable to global models like OpenAI’s GPT-4 and Google’s Gemini 1.5 Pro, especially in tasks related to the Korean language. The model excelled in emotional recognition, outperforming its international counterparts in both Korean and English. This achievement indicates the potential of Kanana-o to revolutionize AI communication by recognizing and responding to human emotions, a significant step in the evolution of multimodal AI models.

Kakao aims to further enhance Kanana-o’s capabilities by refining its speech synthesis technology, with plans for the development of a Korean voice tokenizer for continuous performance improvement. The company’s goal is to establish a strong competitive presence in the AI industry, building on its unique multimodal technology. By sharing its research and contributing to the development of Korea’s AI ecosystem. 

Kakao plans to advance Kanana-o by focusing on three key areas: enabling multi-turn conversations, enhancing the model’s ability to handle simultaneous two-way data, and improving safety features to prevent inappropriate outputs. These efforts aim to deliver more natural and fluid interactions in voice-based conversation environments, bringing AI communication closer to real-life dialogue.

Kim Byung-hak, head of performance at Kakao Kanana, emphasized that the Kana model is evolving into an AI that processes text and sees, hears, speaks, and empathizes like a human. He added that Kakao will continue to build on its proprietary multimodal technology to strengthen its AI competitiveness while actively contributing to the growth of South Korea’s AI ecosystem through open research and development.

 

Tags: AIKakaoLLM

Related Posts

The Hidden Cost of Modern Work: Why Presenteeism Is Driving a New Wave of Workplace AI
AI

The Hidden Cost of Modern Work: Why Presenteeism Is Driving a New Wave of Workplace AI

June 1, 2026
Can AI Agents Run South Korea’s Warehouses? Logistics Companies Are Preparing for the Shift
AI

Can AI Agents Run South Korea’s Warehouses? Logistics Companies Are Preparing for the Shift

June 1, 2026
Can AI Help K-Content Reach Global Audiences Simultaneously?
AI

Can AI Help K-Content Reach Global Audiences Simultaneously?

June 1, 2026
The Rise of Gaming’s Meta-Layer: How New Platforms Are Building Above Steam, PlayStation, and Xbox
AI

The Rise of Gaming’s Meta-Layer: How New Platforms Are Building Above Steam, PlayStation, and Xbox

June 1, 2026
MiFood Says South Korea Is Becoming a ‘Living Laboratory’ for Food Robotics
AI

MiFood Says South Korea Is Becoming a ‘Living Laboratory’ for Food Robotics

June 1, 2026
Naver and Samsung backed Gaudio Lab wins top Korea tech award as AI reshapes content distribution
AI

Naver and Samsung backed Gaudio Lab wins top Korea tech award as AI reshapes content distribution

May 1, 2026
No Result
View All Result

Most Popular

  • Naver’s Audio Clip Expands to Stay No.1

    0 shares
    Share 0 Tweet 0
  • Ministry of ICT: Google Service Blackout First Case Under “Netflix Law”

    0 shares
    Share 0 Tweet 0
  • KakaoTalk Wallet’s Digital Authentication Exceeds 10 Million Users

    0 shares
    Share 0 Tweet 0
  • Korea Inc. Comes Home: How Samsung, Hyundai and SK Are Reshaping the Domestic Tech Economy

    0 shares
    Share 0 Tweet 0
  • Naver’s Webtoon Entertainment Eyes June US IPO with Goldman Sachs, Morgan Stanley

    0 shares
    Share 0 Tweet 0
  • Naver Unveils Asia’s Largest Data Center, GAK Sejong, for Tech Innovation

    0 shares
    Share 0 Tweet 0

PRODUCTS

[ads_amazon]

TOPICS

  • Naver
  • Kakao
  • Nexon
  • Netmarble
  • NCsoft
  • Samsung
  • Hyundai

FREE NEWSLETTER

[mc4wp_form id="4726"]

FOLLOW US

  • About Us
  • Cookie policy
  • home
  • homepage
  • mainhome
  • Our Services
  • Privacy Policy
  • Terms of Use

Copyright © 2024 KoreaTechToday | About Us | Terms of Use |Privacy Policy |Cookie Policy| Contact : [email protected] |

No Result
View All Result
  • Topics
    • Naver
    • Kakao
    • Nexon
    • Netmarble
    • NCsoft
    • Samsung
    • Hyundai
    • SKT
    • LG
    • KT
    • Retail
    • Startup
    • Blockchain
    • government
  • Lists

Copyright © 2024 KoreaTechToday | About Us | Terms of Use |Privacy Policy |Cookie Policy| Contact : [email protected] |