1-655.png

July 02, 2024

iFLYTRANS: Enabling Barrier-Free Communication at ITTC 2024

The 2024 Beijing International Television Technology Conference, hosted by the China Society of Motion Picture and Television Engineers, was held in Beijing. Over 500 representatives from the National Radio and Television Administration, China Media Group, radio and television stations across the country, industry experts and scholars, and innovative technology enterprises gathered together to discuss innovative upgrade paths forward for the audio-visual industry and jointly promote the development and application of "ultra-high-definition, mobile and intelligent" innovative technologies.

 

1.png

 

Jiang Wenbo, member of the China Media Group's editorial board, and Miao Bo, deputy director of the Science and Technology Department of the National Radio and Television Administration, attended the opening ceremony and delivered speeches. Ahmed Nadim, Secretary-General of the Asia-Pacific Broadcasting Union, delivered a speech via video. As a member of the China Society of Motion Picture and Television Engineers, Deepting provided real-time tranion and translation technology support for the conference, facilitating barrier-free communication at the event.

 

At the event, Xu Jin, Director of the China Media Group's Technology Bureau and Chairman of the China Society of Motion Picture and Television Engineers, announced the official establishment of the Artificial Intelligence Generated Content (AIGC) Media Application Standards Alliance and released the Artificial Intelligence Generated Content (AIGC) Media Application Standards Alliance Consensus. The alliance is led by the China Society of Motion Picture and Television Engineers, and jointly formed by the China Media Group, radio and television stations from 32 provinces (municipalities) across the country, and other key radio and television institutions. It aims to promote the joint formulation of AIGC application standards in the media industry and promote AIGC technology as an important stimulus for new quality productive forces in the audio-visual industry. In the future, the alliance will also actively cooperate with government agencies, technology enterprises, research institutes and other stakeholders to explore the formulation of artificial intelligence media ethical guidelines, review mechanisms and industry standards with extensive consensus, enhance the credibility of AIGC, and jointly build a safe, credible and controllable artificial intelligence media application ecosystem.

 

2.png

 

 

01 iFLYTRANS

Facilitating Barrier-free Communication at Conferences

 

iFLYTRANS is a product suitable for large-scale high-end conferences, press conferences and exhibitions, providing both integrated soft and hard simultaneous interpretation equipment and SaaS clients. Based on  iFLYTEK's core technologies in speech tranion, machine translation and speech synthesis, it can perform real-time speech recognition, including recognition of mixed Chinese and English speech, as well as translation of Chinese speech into English, French, German, Japanese, Korean, Spanish and Arabic.


In addition, iFLYTRANS can also record meeting content and generate subtitles in real time, supporting real-time tranion into Chinese and other languages, as well as translation from Chinese into other languages. The product also provides subtitle strip mode and multilingual full-screen mode to meet the needs of different types of meetings. Participants can scan the QR code or wear simultaneous interpretation headphones to listen to and watch multilingual voice broadcasts at any time or place.

 

To date, iFLYTRANS has served more than 50 countries and regions around the world, supporting more than 400,000 conferences and covering more than 400 million viewers.


3.png

 

02 iFLYTRANS Powered by SPARK Large Model

Intelligent Language Differentiation and 20% Translation Improvement

 

iFLYTRANS will soon be equipped with the SPARK Large Model. The iFLYTRANS SPARK Large Model supports simultaneous interpretation between Chinese and 13 languages, as well as simultaneous interpretation between English and other minor languages. In addition, it can automatically distinguish languages without manual selection. The accuracy rate for mainstream language recognition has also improved by an average of 20%, greatly enhancing the efficiency of international communication.


4.png

 

03 iFLY A/V Translation

AIGC Empowers International Communication

 

With the tide of globalization, cross-border communication is becoming increasingly frequent. However, language barriers have always been a major obstacle to communication. In order to solve this problem, iFLYTEK has launched a revolutionary product—iFLY A/V Translation. This product not only eliminates language barriers, but also provides a more vivid and natural interactive experience. iFLY A/V Translation’s multilingual subtitle creation function can generate subtitles in multiple languages in real time for videos and live broadcasts. Whether it is conference speeches, education and training or entertainment content, the audience can choose the appropriate language according to their needs and enjoy a barrier-free information acquisition experience. Powered by machine translation technology, iFLY A/V Translation quickly and accurately translates the original language into subtitles in the target language. This technology not only supports mainstream international languages but also covers more minor languages, ensuring that every user can communicate freely on a global scale.

 

5.png

 

Multilingual Voice Dubbing

iFLY A/V Translation's multilingual voice synthesis technology can convert text into fluent, natural and emotionally rich voice output. The application of this technology is not limited to voice broadcasting but can also be used in scenarios such as virtual assistants and intelligent customer service, providing a more humanized interactive experience.

 

Small-Sample Voice Cloning

With iFLY A/V Translation's small-sample voice cloning technology, users can clone their voice using just a few voice samples. This means that users can enjoy personalized voice assistants and even imitate native speakers' pronunciation in foreign language learning, improving learning efficiency. iFLY A/V Translation accurately separates the voice patterns of different speakers in a multi-person conversational environment, achieving accurate voice recognition and tranion. Using lip-driven technology, the synthesized voice and expressions can be adjusted in sync with the speaker's lip movements, making the virtual image more realistic. This technology has revolutionary significance in fields such as virtual reality and animation production.

 

6.png

 

The two-day conference revolved around the theme of "Vibrant Vision; Smart Horizons", and explored the innovative upgrade paths forward for the audio-visual industry through various forms of speech exchanges, interactive experiences and business visits. It focused on the development and application of artificial intelligence and ultra-high-definition technology in the field of full-media production and broadcasting, to promote industry collaborative innovation and jointly build a new pattern of intelligent full-media. Deepting will continuously improve its  technical capabilities, seize development opportunities, promote the deep integration of media, and innovate content production and dissemination methods to achieve  high-quality development in the broadcasting and television industry.