Futures
Hundreds of contracts settled in USDT or BTC
TradFi
Gold
Trade global traditional assets with USDT in one place
Options
Hot
Trade European-style vanilla options
Unified Account
Maximize your capital efficiency
Demo Trading
Futures Kickoff
Get prepared for your futures trading
Futures Events
Participate in events to win generous rewards
Demo Trading
Use virtual funds to experience risk-free trading
Launch
CandyDrop
Collect candies to earn airdrops
Launchpool
Quick staking, earn potential new tokens
HODLer Airdrop
Hold GT and get massive airdrops for free
Launchpad
Be early to the next big token project
Alpha Points
Trade on-chain assets and enjoy airdrop rewards!
Futures Points
Earn futures points and claim airdrop rewards
Investment
Simple Earn
Earn interests with idle tokens
Auto-Invest
Auto-invest on a regular basis
Dual Investment
Buy low and sell high to take profits from price fluctuations
Soft Staking
Earn rewards with flexible staking
Crypto Loan
0 Fees
Pledge one crypto to borrow another
Lending Center
One-stop lending hub
VIP Wealth Hub
Customized wealth management empowers your assets growth
Private Wealth Management
Customized asset management to grow your digital assets
Quant Fund
Top asset management team helps you profit without hassle
Staking
Stake cryptos to earn in PoS products
Smart Leverage
New
No forced liquidation before maturity, worry-free leveraged gains
GUSD Minting
Use USDT/USDC to mint GUSD for treasury-level yields
Alibaba releases two new speech models
Shanghai Securities News China Securities Network News (Reporter Yang Xiangfei): On March 2nd, Alibaba released two new speech models, based on reference audio, the voice cloning model Fun-CosyVoice3.5, and the tone design model Fun-AudioGen-VD without reference audio. Both models incorporate “instruction following” capabilities, allowing free control over emotion, speech rate, scene, and more. They can be customized in freestyle mode to create characters, suitable for audiobooks, gaming, customer service, podcasts, education, live streaming, and other scenarios.
These two models achieved multiple SOTA results in benchmarks for models of similar size. In the Seed-TTS benchmark’s difficult Chinese cases, Fun-CosyVoice3.5 performed outstandingly, with the lowest Word Error Rate (WER) and Speaker Similarity (SSIM). Additionally, by optimizing pronunciation in difficult cases, the error rate for rare characters dropped from 15.2% to 5.3%.
The models show significant improvements in speech accuracy, speaker similarity, prosody naturalness, and sound quality, mainly due to training process optimizations. Using DiffRO and GRPO in reinforcement learning, rewards for duration and prosody multi-channel aspects were increased. DiffRO (Differentiable Reward Optimization), developed by Alibaba Tongyi Laboratory, is designed specifically for optimizing TTS models. GRPO (Group Relative Policy Optimization) compares different answers to determine superiority and assign rewards. GRPO is also used in Flow Matching (converting noise distribution to real data distribution) reinforcement learning, marking its first application in voice cloning models industry-wide.
Additionally, the tokenizer used in Fun-CosyVoice3.5 halves the frame rate, improving training efficiency, and reduces initial packet latency by 35%, greatly enhancing real-time interaction experience.
Starting today, users can access these two latest models on Alibaba Cloud Bailing.