KoreaTechToday - Korea's Leading Tech and Startup Media Platform
  • Topics
    • Naver
    • Kakao
    • Nexon
    • Netmarble
    • NCsoft
    • Samsung
    • Hyundai
    • SKT
    • LG
    • KT
    • Retail
    • Startup
    • Blockchain
    • government
  • Lists
KoreaTechToday - Korea's Leading Tech and Startup Media Platform
  • Topics
    • Naver
    • Kakao
    • Nexon
    • Netmarble
    • NCsoft
    • Samsung
    • Hyundai
    • SKT
    • LG
    • KT
    • Retail
    • Startup
    • Blockchain
    • government
  • Lists
KoreaTechToday - Korea's Leading Tech and Startup Media Platform
No Result
View All Result
Home AI

Kakao Unveils Kanana-o: Korea’s First AI Model That Sees, Hears, and Speaks

Siwoo Jung by Siwoo Jung
PUBLISHED: May 1, 2025 UPDATED: May 5, 2025
in AI, Kakao
0
Kakao Unveils Kanana-o: Korea’s First AI Model That Sees, Hears, and Speaks

Image credits: Kakao

Kakao introduced Kanana-o, South Korea’s first multimodal large language model (LLM), capable of understanding and processing text, voice, and images together. The company shared details of the model’s performance and development through its official tech blog on the 1st, highlighting its ability to listen, speak, and interact human-likely by integrating advanced voice technology.

According to Kakao, Kanana-o demonstrates a competitive edge on par with global AI systems developed by OpenAI and Google. The company also unveiled Kanana-a, an audio-focused language model. 

Kanana-o, an artificial intelligence model, integrates the understanding and processing of text, voice, and images simultaneously. This multimodal system allows users to input queries in any combination of the three forms of communication, with the model generating responses in text or natural voice tailored to the context. Kakao achieved this integration by merging its existing models, Kana-v, which specializes in image processing, and Kana-a, which focuses on audio understanding and generation, using a technology called “Model Merge.”

The development of Kanana-o leverages a large-scale Korean dataset to accurately reflect the distinctive features of the Korean language, including its tense variations, intonation and unique speech structure. The model also has the ability to interpret regional dialects, such as those from Gyeongsang and Jeju provinces, converting them into standard Korean and producing natural, fluent speech. This ensures the model can effectively handle nuanced Korean communication and regional linguistic differences.

One of the standout features of Kanana-o is its use of speech emotion recognition technology, which enables the model to analyze nonverbal signals such as intonation, voice trembling, and speech patterns accurately. This helps the system understand and interpret user emotions, allowing it to generate responses that are contextually relevant and emotionally appropriate. This capability brings the model closer to mimicking human communication, making interactions with the AI more natural and empathetic.

In performance tests, Kanana-o showed a high level of competence, comparable to global models like OpenAI’s GPT-4 and Google’s Gemini 1.5 Pro, especially in tasks related to the Korean language. The model excelled in emotional recognition, outperforming its international counterparts in both Korean and English. This achievement indicates the potential of Kanana-o to revolutionize AI communication by recognizing and responding to human emotions, a significant step in the evolution of multimodal AI models.

Kakao aims to further enhance Kanana-o’s capabilities by refining its speech synthesis technology, with plans for the development of a Korean voice tokenizer for continuous performance improvement. The company’s goal is to establish a strong competitive presence in the AI industry, building on its unique multimodal technology. By sharing its research and contributing to the development of Korea’s AI ecosystem. 

Kakao plans to advance Kanana-o by focusing on three key areas: enabling multi-turn conversations, enhancing the model’s ability to handle simultaneous two-way data, and improving safety features to prevent inappropriate outputs. These efforts aim to deliver more natural and fluid interactions in voice-based conversation environments, bringing AI communication closer to real-life dialogue.

Kim Byung-hak, head of performance at Kakao Kanana, emphasized that the Kana model is evolving into an AI that processes text and sees, hears, speaks, and empathizes like a human. He added that Kakao will continue to build on its proprietary multimodal technology to strengthen its AI competitiveness while actively contributing to the growth of South Korea’s AI ecosystem through open research and development.

 

Tags: AIKakaoLLM

Related Posts

Inside Stargate: How Samsung and SK Are Powering OpenAI’s Global AI Ambitions
AI

Inside Stargate: How Samsung and SK Are Powering OpenAI’s Global AI Ambitions

October 7, 2025
Hyundai Mobis Assembles Domestic Powerhouse to Build Auto Chips
AI

Hyundai Mobis Assembles Domestic Powerhouse to Build Auto Chips

September 30, 2025
SK Telecom Commits $3.6B to AI with New Company-in-Company Unit
AI

SK Telecom Commits $3.6B to AI with New Company-in-Company Unit

September 29, 2025
Samsung Launches TRUEBench: A Benchmark for Real-World AI Productivity
AI

Samsung Launches TRUEBench: A Benchmark for Real-World AI Productivity

September 29, 2025
Hyundai and Kia Deploy Wearable Robots to Transform Farming in Korea
AI

Hyundai and Kia Deploy Wearable Robots to Transform Farming in Korea

September 29, 2025
Hanwha Life, Naver Financial Partner to Accelerate Digital Finance Innovation
AI

Hanwha Life, Naver Financial Partner to Accelerate Digital Finance Innovation

September 29, 2025
No Result
View All Result

Most Popular

  • Ride-Hailing Rivalry: Kakao and Uber Bet on Membership Services in Korea

    0 shares
    Share 0 Tweet 0
  • Kakao Mobility Faces $10.5 Million Fine for Limiting Competitors’ Access to Taxi Platform

    0 shares
    Share 0 Tweet 0
  • Korea’s Navigation Battle Heats Up: Naver and Kakao vs. Google maps

    0 shares
    Share 0 Tweet 0
  • 5 Best Korean to English Translation Apps

    0 shares
    Share 0 Tweet 0
  • Naver Launches 3D Street View for Immersive Navigation Experience

    0 shares
    Share 0 Tweet 0
  • KakaoTalk to Adopt Instagram-Style Feed in Major 2025 Redesign

    0 shares
    Share 0 Tweet 0
  • Naver Maps Launches Guide in English, Chinese, and Japanese to Enhance Travel Experience for Tourists

    0 shares
    Share 0 Tweet 0
  • Top Nine Mobile MMORPG in South Korea for 2020

    0 shares
    Share 0 Tweet 0
  • Naver Unveils Asia’s Largest Data Center, GAK Sejong, for Tech Innovation

    0 shares
    Share 0 Tweet 0
  • South Korea Invests $1.1B to Build National AI GPU Infrastructure

    0 shares
    Share 0 Tweet 0

PRODUCTS

[ads_amazon]

TOPICS

  • Naver
  • Kakao
  • Nexon
  • Netmarble
  • NCsoft
  • Samsung
  • Hyundai

FREE NEWSLETTER

FOLLOW US

  • About Us
  • Cookie policy
  • home
  • homepage
  • mainhome
  • Our Services
  • Privacy Policy
  • Terms of Use

Copyright © 2024 KoreaTechToday | About Us | Terms of Use |Privacy Policy |Cookie Policy| Contact : [email protected] |

No Result
View All Result
  • Topics
    • Naver
    • Kakao
    • Nexon
    • Netmarble
    • NCsoft
    • Samsung
    • Hyundai
    • SKT
    • LG
    • KT
    • Retail
    • Startup
    • Blockchain
    • government
  • Lists

Copyright © 2024 KoreaTechToday | About Us | Terms of Use |Privacy Policy |Cookie Policy| Contact : [email protected] |