KoreaTechToday - Korea's Leading Tech and Startup Media Platform
  • Topics
    • Naver
    • Kakao
    • Nexon
    • Netmarble
    • NCsoft
    • Samsung
    • Hyundai
    • SKT
    • LG
    • KT
    • Retail
    • Startup
    • Blockchain
    • government
  • Lists
KoreaTechToday - Korea's Leading Tech and Startup Media Platform
  • Topics
    • Naver
    • Kakao
    • Nexon
    • Netmarble
    • NCsoft
    • Samsung
    • Hyundai
    • SKT
    • LG
    • KT
    • Retail
    • Startup
    • Blockchain
    • government
  • Lists
KoreaTechToday - Korea's Leading Tech and Startup Media Platform
No Result
View All Result
Home AI

Kakao Unveils Kanana-o: Korea’s First AI Model That Sees, Hears, and Speaks

Siwoo Jung by Siwoo Jung
PUBLISHED: May 1, 2025 UPDATED: May 5, 2025
in AI, Kakao
0
Kakao Unveils Kanana-o: Korea’s First AI Model That Sees, Hears, and Speaks

Image credits: Kakao

0
SHARES
350
VIEWS
Share on FacebookShare on Twitter

Kakao introduced Kanana-o, South Korea’s first multimodal large language model (LLM), capable of understanding and processing text, voice, and images together. The company shared details of the model’s performance and development through its official tech blog on the 1st, highlighting its ability to listen, speak, and interact human-likely by integrating advanced voice technology.

According to Kakao, Kanana-o demonstrates a competitive edge on par with global AI systems developed by OpenAI and Google. The company also unveiled Kanana-a, an audio-focused language model. 

Kanana-o, an artificial intelligence model, integrates the understanding and processing of text, voice, and images simultaneously. This multimodal system allows users to input queries in any combination of the three forms of communication, with the model generating responses in text or natural voice tailored to the context. Kakao achieved this integration by merging its existing models, Kana-v, which specializes in image processing, and Kana-a, which focuses on audio understanding and generation, using a technology called “Model Merge.”

The development of Kanana-o leverages a large-scale Korean dataset to accurately reflect the distinctive features of the Korean language, including its tense variations, intonation and unique speech structure. The model also has the ability to interpret regional dialects, such as those from Gyeongsang and Jeju provinces, converting them into standard Korean and producing natural, fluent speech. This ensures the model can effectively handle nuanced Korean communication and regional linguistic differences.

One of the standout features of Kanana-o is its use of speech emotion recognition technology, which enables the model to analyze nonverbal signals such as intonation, voice trembling, and speech patterns accurately. This helps the system understand and interpret user emotions, allowing it to generate responses that are contextually relevant and emotionally appropriate. This capability brings the model closer to mimicking human communication, making interactions with the AI more natural and empathetic.

In performance tests, Kanana-o showed a high level of competence, comparable to global models like OpenAI’s GPT-4 and Google’s Gemini 1.5 Pro, especially in tasks related to the Korean language. The model excelled in emotional recognition, outperforming its international counterparts in both Korean and English. This achievement indicates the potential of Kanana-o to revolutionize AI communication by recognizing and responding to human emotions, a significant step in the evolution of multimodal AI models.

Kakao aims to further enhance Kanana-o’s capabilities by refining its speech synthesis technology, with plans for the development of a Korean voice tokenizer for continuous performance improvement. The company’s goal is to establish a strong competitive presence in the AI industry, building on its unique multimodal technology. By sharing its research and contributing to the development of Korea’s AI ecosystem. 

Kakao plans to advance Kanana-o by focusing on three key areas: enabling multi-turn conversations, enhancing the model’s ability to handle simultaneous two-way data, and improving safety features to prevent inappropriate outputs. These efforts aim to deliver more natural and fluid interactions in voice-based conversation environments, bringing AI communication closer to real-life dialogue.

Kim Byung-hak, head of performance at Kakao Kanana, emphasized that the Kana model is evolving into an AI that processes text and sees, hears, speaks, and empathizes like a human. He added that Kakao will continue to build on its proprietary multimodal technology to strengthen its AI competitiveness while actively contributing to the growth of South Korea’s AI ecosystem through open research and development.

 

Tags: AIKakaoLLM

Related Posts

South Korea Trains Civil Servants in AI to Drive ‘Super-Innovation Economy’
AI

South Korea Trains Civil Servants in AI to Drive ‘Super-Innovation Economy’

September 18, 2025
SK Telecom Brings AI Agent Technology Overseas with TimeTree Partnership
AI

SK Telecom Brings AI Agent Technology Overseas with TimeTree Partnership

September 17, 2025
KakaoTalk to Get ChatGPT Integration in Major AI Push
Kakao

KakaoTalk to Get ChatGPT Integration in Major AI Push

September 17, 2025
LG Group Expands U.S. Innovation Push with New AI and Robotics Center
AI

LG Group Expands U.S. Innovation Push with New AI and Robotics Center

September 15, 2025
LG Uplus Launches “AI Universe” to Expand Public Access to Artificial Intelligence
AI

LG Uplus Launches “AI Universe” to Expand Public Access to Artificial Intelligence

September 1, 2025
 Court Trial Heats Up: Kakao Founder Faces Prison over Controversial SM Deal
Kakao

 Court Trial Heats Up: Kakao Founder Faces Prison over Controversial SM Deal

September 1, 2025
No Result
View All Result

Most Popular

  • Top Nine Mobile MMORPG in South Korea for 2020

    0 shares
    Share 0 Tweet 0
  • 5 Best Korean to English Translation Apps

    0 shares
    Share 0 Tweet 0
  • Naver Launches 3D Street View for Immersive Navigation Experience

    0 shares
    Share 0 Tweet 0
  • Korea’s Navigation Battle Heats Up: Naver and Kakao vs. Google maps

    0 shares
    Share 0 Tweet 0
  • South Korea Unveils $735 Billion Plan to Build Sovereign AI Built on Korean Data

    0 shares
    Share 0 Tweet 0
  • South Korea Invests $1.1B to Build National AI GPU Infrastructure

    0 shares
    Share 0 Tweet 0
  • 5 All-Time Best Rom-Com K-Dramas to Watch

    0 shares
    Share 0 Tweet 0
  • South Korea Commits $2.9 Billion to Build National AI Computing Hub by 2030

    0 shares
    Share 0 Tweet 0
  • South Korea to Invest $349 Million in Industrial AI Innovations in 2025

    0 shares
    Share 0 Tweet 0
  • LG’s Return to Smartphones: A New AI Collaboration with Samsung

    0 shares
    Share 0 Tweet 0

PRODUCTS

[ads_amazon]

TOPICS

  • Naver
  • Kakao
  • Nexon
  • Netmarble
  • NCsoft
  • Samsung
  • Hyundai

FREE NEWSLETTER

FOLLOW US

  • About Us
  • Cookie policy
  • home
  • homepage
  • mainhome
  • Our Services
  • Privacy Policy
  • Terms of Use

Copyright © 2024 KoreaTechToday | About Us | Terms of Use |Privacy Policy |Cookie Policy| Contact : [email protected] |

No Result
View All Result
  • Topics
    • Naver
    • Kakao
    • Nexon
    • Netmarble
    • NCsoft
    • Samsung
    • Hyundai
    • SKT
    • LG
    • KT
    • Retail
    • Startup
    • Blockchain
    • government
  • Lists

Copyright © 2024 KoreaTechToday | About Us | Terms of Use |Privacy Policy |Cookie Policy| Contact : [email protected] |