KoreaTechToday - Korea's Leading Tech and Startup Media Platform
  • Topics
    • Naver
    • Kakao
    • Nexon
    • Netmarble
    • NCsoft
    • Samsung
    • Hyundai
    • SKT
    • LG
    • KT
    • Retail
    • Startup
    • Blockchain
    • government
  • Lists
KoreaTechToday - Korea's Leading Tech and Startup Media Platform
  • Topics
    • Naver
    • Kakao
    • Nexon
    • Netmarble
    • NCsoft
    • Samsung
    • Hyundai
    • SKT
    • LG
    • KT
    • Retail
    • Startup
    • Blockchain
    • government
  • Lists
KoreaTechToday - Korea's Leading Tech and Startup Media Platform
No Result
View All Result
Home AI

Kakao Unveils Kanana-o: Korea’s First AI Model That Sees, Hears, and Speaks

Siwoo Jung by Siwoo Jung
PUBLISHED: May 1, 2025 UPDATED: May 5, 2025
in AI, Kakao
0
Kakao Unveils Kanana-o: Korea’s First AI Model That Sees, Hears, and Speaks

Image credits: Kakao

Kakao introduced Kanana-o, South Korea’s first multimodal large language model (LLM), capable of understanding and processing text, voice, and images together. The company shared details of the model’s performance and development through its official tech blog on the 1st, highlighting its ability to listen, speak, and interact human-likely by integrating advanced voice technology.

According to Kakao, Kanana-o demonstrates a competitive edge on par with global AI systems developed by OpenAI and Google. The company also unveiled Kanana-a, an audio-focused language model. 

Kanana-o, an artificial intelligence model, integrates the understanding and processing of text, voice, and images simultaneously. This multimodal system allows users to input queries in any combination of the three forms of communication, with the model generating responses in text or natural voice tailored to the context. Kakao achieved this integration by merging its existing models, Kana-v, which specializes in image processing, and Kana-a, which focuses on audio understanding and generation, using a technology called “Model Merge.”

The development of Kanana-o leverages a large-scale Korean dataset to accurately reflect the distinctive features of the Korean language, including its tense variations, intonation and unique speech structure. The model also has the ability to interpret regional dialects, such as those from Gyeongsang and Jeju provinces, converting them into standard Korean and producing natural, fluent speech. This ensures the model can effectively handle nuanced Korean communication and regional linguistic differences.

One of the standout features of Kanana-o is its use of speech emotion recognition technology, which enables the model to analyze nonverbal signals such as intonation, voice trembling, and speech patterns accurately. This helps the system understand and interpret user emotions, allowing it to generate responses that are contextually relevant and emotionally appropriate. This capability brings the model closer to mimicking human communication, making interactions with the AI more natural and empathetic.

In performance tests, Kanana-o showed a high level of competence, comparable to global models like OpenAI’s GPT-4 and Google’s Gemini 1.5 Pro, especially in tasks related to the Korean language. The model excelled in emotional recognition, outperforming its international counterparts in both Korean and English. This achievement indicates the potential of Kanana-o to revolutionize AI communication by recognizing and responding to human emotions, a significant step in the evolution of multimodal AI models.

Kakao aims to further enhance Kanana-o’s capabilities by refining its speech synthesis technology, with plans for the development of a Korean voice tokenizer for continuous performance improvement. The company’s goal is to establish a strong competitive presence in the AI industry, building on its unique multimodal technology. By sharing its research and contributing to the development of Korea’s AI ecosystem. 

Kakao plans to advance Kanana-o by focusing on three key areas: enabling multi-turn conversations, enhancing the model’s ability to handle simultaneous two-way data, and improving safety features to prevent inappropriate outputs. These efforts aim to deliver more natural and fluid interactions in voice-based conversation environments, bringing AI communication closer to real-life dialogue.

Kim Byung-hak, head of performance at Kakao Kanana, emphasized that the Kana model is evolving into an AI that processes text and sees, hears, speaks, and empathizes like a human. He added that Kakao will continue to build on its proprietary multimodal technology to strengthen its AI competitiveness while actively contributing to the growth of South Korea’s AI ecosystem through open research and development.

 

Tags: AIKakaoLLM

Related Posts

Kakao Integrates ChatGPT Into KakaoTalk, Redefining Everyday Messaging With AI
Kakao

Kakao Integrates ChatGPT Into KakaoTalk, Redefining Everyday Messaging With AI

October 29, 2025
Samsung and OpenAI Forge Strategic Partnership to Power Global AI Infrastructure
AI

Samsung and OpenAI Forge Strategic Partnership to Power Global AI Infrastructure

October 30, 2025
Samsung and SoftBank Join Forces to Advance AI-RAN and 6G Research
AI

Samsung and SoftBank Join Forces to Advance AI-RAN and 6G Research

October 28, 2025
Samsung’s XR Headset: A Strategic Leap Into Spatial Computing
AI

Samsung’s XR Headset: A Strategic Leap Into Spatial Computing

October 16, 2025
What SK Group’s ‘AI Now & Next’ Summit Reveals About the Future of Intelligent Korea
AI

What SK Group’s ‘AI Now & Next’ Summit Reveals About the Future of Intelligent Korea

October 14, 2025
Inside Stargate: How Samsung and SK Are Powering OpenAI’s Global AI Ambitions
AI

Inside Stargate: How Samsung and SK Are Powering OpenAI’s Global AI Ambitions

October 7, 2025
No Result
View All Result

Most Popular

  • Ride-Hailing Rivalry: Kakao and Uber Bet on Membership Services in Korea

    0 shares
    Share 0 Tweet 0
  • Kakao Mobility Faces $10.5 Million Fine for Limiting Competitors’ Access to Taxi Platform

    0 shares
    Share 0 Tweet 0
  • Korea’s Navigation Battle Heats Up: Naver and Kakao vs. Google maps

    0 shares
    Share 0 Tweet 0
  • 5 Best Korean to English Translation Apps

    0 shares
    Share 0 Tweet 0
  • Naver Maps Launches Guide in English, Chinese, and Japanese to Enhance Travel Experience for Tourists

    0 shares
    Share 0 Tweet 0
  • Naver Unveils Asia’s Largest Data Center, GAK Sejong, for Tech Innovation

    0 shares
    Share 0 Tweet 0
  • Naver Video Streaming Service V Live to Go Global

    0 shares
    Share 0 Tweet 0
  • What SK Group’s ‘AI Now & Next’ Summit Reveals About the Future of Intelligent Korea

    0 shares
    Share 0 Tweet 0
  • South Korea’s $2.26 Billion Vision: A Robotic Revolution by 2030

    0 shares
    Share 0 Tweet 0
  • Hanwha Aerospace to Develop Indigenous Turboprop Engine for South Korea’s Next-Gen UAVs

    0 shares
    Share 0 Tweet 0

PRODUCTS

[ads_amazon]

TOPICS

  • Naver
  • Kakao
  • Nexon
  • Netmarble
  • NCsoft
  • Samsung
  • Hyundai

FREE NEWSLETTER

FOLLOW US

  • About Us
  • Cookie policy
  • home
  • homepage
  • mainhome
  • Our Services
  • Privacy Policy
  • Terms of Use

Copyright © 2024 KoreaTechToday | About Us | Terms of Use |Privacy Policy |Cookie Policy| Contact : [email protected] |

No Result
View All Result
  • Topics
    • Naver
    • Kakao
    • Nexon
    • Netmarble
    • NCsoft
    • Samsung
    • Hyundai
    • SKT
    • LG
    • KT
    • Retail
    • Startup
    • Blockchain
    • government
  • Lists

Copyright © 2024 KoreaTechToday | About Us | Terms of Use |Privacy Policy |Cookie Policy| Contact : [email protected] |