KoreaTechToday - Korea's Leading Tech and Startup Media Platform
  • Topics
    • Naver
    • Kakao
    • Nexon
    • Netmarble
    • NCsoft
    • Samsung
    • Hyundai
    • SKT
    • LG
    • KT
    • Retail
    • Startup
    • Blockchain
    • government
  • Lists
Advertisement
KoreaTechToday - Korea's Leading Tech and Startup Media Platform
  • Topics
    • Naver
    • Kakao
    • Nexon
    • Netmarble
    • NCsoft
    • Samsung
    • Hyundai
    • SKT
    • LG
    • KT
    • Retail
    • Startup
    • Blockchain
    • government
  • Lists
Advertisement
KoreaTechToday - Korea's Leading Tech and Startup Media Platform
No Result
View All Result
Home AI

Kakao Unveils Kanana-o: Korea’s First AI Model That Sees, Hears, and Speaks

Siwoo Jung by Siwoo Jung
PUBLISHED: May 1, 2025 UPDATED: May 5, 2025
in AI, Kakao
0
Kakao Unveils Kanana-o: Korea’s First AI Model That Sees, Hears, and Speaks

Image credits: Kakao

0
SHARES
254
VIEWS
Share on FacebookShare on Twitter

Kakao introduced Kanana-o, South Korea’s first multimodal large language model (LLM), capable of understanding and processing text, voice, and images together. The company shared details of the model’s performance and development through its official tech blog on the 1st, highlighting its ability to listen, speak, and interact human-likely by integrating advanced voice technology.

According to Kakao, Kanana-o demonstrates a competitive edge on par with global AI systems developed by OpenAI and Google. The company also unveiled Kanana-a, an audio-focused language model. 

Kanana-o, an artificial intelligence model, integrates the understanding and processing of text, voice, and images simultaneously. This multimodal system allows users to input queries in any combination of the three forms of communication, with the model generating responses in text or natural voice tailored to the context. Kakao achieved this integration by merging its existing models, Kana-v, which specializes in image processing, and Kana-a, which focuses on audio understanding and generation, using a technology called “Model Merge.”

The development of Kanana-o leverages a large-scale Korean dataset to accurately reflect the distinctive features of the Korean language, including its tense variations, intonation and unique speech structure. The model also has the ability to interpret regional dialects, such as those from Gyeongsang and Jeju provinces, converting them into standard Korean and producing natural, fluent speech. This ensures the model can effectively handle nuanced Korean communication and regional linguistic differences.

One of the standout features of Kanana-o is its use of speech emotion recognition technology, which enables the model to analyze nonverbal signals such as intonation, voice trembling, and speech patterns accurately. This helps the system understand and interpret user emotions, allowing it to generate responses that are contextually relevant and emotionally appropriate. This capability brings the model closer to mimicking human communication, making interactions with the AI more natural and empathetic.

In performance tests, Kanana-o showed a high level of competence, comparable to global models like OpenAI’s GPT-4 and Google’s Gemini 1.5 Pro, especially in tasks related to the Korean language. The model excelled in emotional recognition, outperforming its international counterparts in both Korean and English. This achievement indicates the potential of Kanana-o to revolutionize AI communication by recognizing and responding to human emotions, a significant step in the evolution of multimodal AI models.

Kakao aims to further enhance Kanana-o’s capabilities by refining its speech synthesis technology, with plans for the development of a Korean voice tokenizer for continuous performance improvement. The company’s goal is to establish a strong competitive presence in the AI industry, building on its unique multimodal technology. By sharing its research and contributing to the development of Korea’s AI ecosystem. 

Kakao plans to advance Kanana-o by focusing on three key areas: enabling multi-turn conversations, enhancing the model’s ability to handle simultaneous two-way data, and improving safety features to prevent inappropriate outputs. These efforts aim to deliver more natural and fluid interactions in voice-based conversation environments, bringing AI communication closer to real-life dialogue.

Kim Byung-hak, head of performance at Kakao Kanana, emphasized that the Kana model is evolving into an AI that processes text and sees, hears, speaks, and empathizes like a human. He added that Kakao will continue to build on its proprietary multimodal technology to strengthen its AI competitiveness while actively contributing to the growth of South Korea’s AI ecosystem through open research and development.

 

Tags: AIKakaoLLM

Related Posts

Korea Picks Five National Champions to Lead Sovereign AI Push
AI

Korea Picks Five National Champions to Lead Sovereign AI Push

August 29, 2025
LG CNS Secures All Major Generative AI Cloud Certifications, a First in Korea
AI

LG CNS Secures All Major Generative AI Cloud Certifications, a First in Korea

August 1, 2025
Elon Musk Confirms $16.5B Tesla-Samsung Chip Deal to Power Next-Gen AI
AI

Elon Musk Confirms $16.5B Tesla-Samsung Chip Deal to Power Next-Gen AI

July 31, 2025
South Korea Invests $1.1B to Build National AI GPU Infrastructure
AI

South Korea Invests $1.1B to Build National AI GPU Infrastructure

July 31, 2025
SK Telecom, Krafton Launch Open-Source AI Models for Math and Code
AI

SK Telecom, Krafton Launch Open-Source AI Models for Math and Code

July 29, 2025
Kakao Becomes First in Korea to Open-Source Advanced AI Models
AI

Kakao Becomes First in Korea to Open-Source Advanced AI Models

July 29, 2025
No Result
View All Result

Most Popular

  • South Korea Invests $1.1B to Build National AI GPU Infrastructure

    0 shares
    Share 0 Tweet 0
  • South Korea Unveils $735 Billion Plan to Build Sovereign AI Built on Korean Data

    0 shares
    Share 0 Tweet 0
  • 5 All-Time Best Rom-Com K-Dramas to Watch

    0 shares
    Share 0 Tweet 0
  • Elon Musk Confirms $16.5B Tesla-Samsung Chip Deal to Power Next-Gen AI

    0 shares
    Share 0 Tweet 0
  • Korea’s Navigation Battle Heats Up: Naver and Kakao vs. Google maps

    0 shares
    Share 0 Tweet 0
  • Naver Pushes Inference AI Frontier with HyperClova X Think

    0 shares
    Share 0 Tweet 0
  • Kakao Becomes First in Korea to Open-Source Advanced AI Models

    0 shares
    Share 0 Tweet 0
  • Naver Launches 3D Street View for Immersive Navigation Experience

    0 shares
    Share 0 Tweet 0
  • LG’s Return to Smartphones: A New AI Collaboration with Samsung

    0 shares
    Share 0 Tweet 0
  • LG CNS Secures All Major Generative AI Cloud Certifications, a First in Korea

    0 shares
    Share 0 Tweet 0

PRODUCTS

[ads_amazon]

TOPICS

  • Naver
  • Kakao
  • Nexon
  • Netmarble
  • NCsoft
  • Samsung
  • Hyundai

FREE NEWSLETTER

FOLLOW US

  • About Us
  • Cookie policy
  • home
  • homepage
  • mainhome
  • Our Services
  • Privacy Policy
  • Terms of Use

Copyright © 2024 KoreaTechToday | About Us | Terms of Use |Privacy Policy |Cookie Policy| Contact : [email protected] |

No Result
View All Result
  • Topics
    • Naver
    • Kakao
    • Nexon
    • Netmarble
    • NCsoft
    • Samsung
    • Hyundai
    • SKT
    • LG
    • KT
    • Retail
    • Startup
    • Blockchain
    • government
  • Lists

Copyright © 2024 KoreaTechToday | About Us | Terms of Use |Privacy Policy |Cookie Policy| Contact : [email protected] |