SlideShare a Scribd company logo
1 of 57
Virtual singer 徴音梅林 & LINNE
CC-BY-NC
Slide author
●
張正一 (Chou Shouichi)/ MGdesigner
●
Paul Liu and me organize 徴音梅林開発委員會
● Wikimedia.tw: member of board of directors
(and direct tech development )
● A programmer
● A musician (Jazz ukulele, DTM)
● Shoichi.chou@gmail.com
Everyone knows her
Powered by
Yamaha
Vocaloid2 engine
So
Why
a FOSS 'v'ocaloid?
If you buy an instrument
You can
play any song,
Do anything.
play
By teeth
Break !
burn
In any Vocaloid product EULA
You didn't get whole rights
●
no “anti-society” (反社会) works
(so,What works are “anti-society”?)
● Trademarks protection (images, keywords)
(ex: 'Vocaloid' ,' 初音ミク ',' 初音ミク ''s image)
No using Miku images=not popular
musicians are
controlled
No freedom
Be ruled
● Using a Gibson guitar,you are its master.
● Using Vocaloid products, You are their slave.
INDIE DIE
UTAU ?
A free vocaloid-like
●
DIY a 『 vocaloid 』
● Programs: editor(frontend)+resampler+wavtool
● data: vocal DB - oto.ini + wav samples
● Vocal DB is an open spec ,many people DIY
vocaloid programs working flow
1.Editor: compose the melody(many notes)
2.Resampler: modulate a sample to Specified
pitch,or other parameters (velocity...).
3.Wavtool: combine these modulated wavs
Finally,we get a song vocal wav file,and mix into a
song
but
● Charge no fee,not freedom
● Default resampler work badly
● DB bad international support (S-JIS)
● Oto.ini no implementing ini comments “;“
● UTAU always auto sort oto.ini (hard collaboration)
● Hard UI control
● Not open source
● Its development is very private
And you know ...
Yamaha owns many super
patents
A nice “free vocaloid“
Is
Very difficult
During 2011-2012
One day, Paul Liu talked to me
● New Algorithm, 'World' better than Vocaloid2
●
Author: 日本山梨大學 Doctor 森勢将雅
● Patent free
● EFB-GW(Synthesizer) for UTAU
● Open source(old version GPL,newer is BSD)
● https://github.com/mmorise/World
●
During Dec,Dr. 森勢 'll do another great upgrade
How good is World algorithm?
● very awesome 'autotune'
(original official test is a realtime Karaoke autotune for 音痴 s. )
● Modulate a sample to any pitch without distortion
(Keep F0 well)
(Vocaloid2 can't ,so Miku need 3 different range versions of each sample)
● Very fast ,no need to pre-preapre frequency tables
(Just do it real time)
● If X86, Even works good on older machines(maybe
on ARM)
Ok Let's do it!
Finally
we made her...
Listen...
Hear “MAMA” cover
徵音梅林
「徵音」 :Ancient Chinese,Japanese
Pentatonic scale note. (Do Re Mi Sol La)
宮(きゅう)、商(しょう)、 角(かく)、 徴(
Also means We 'recruit' a voice actor(and
also a Jazz singer) from Internet
「梅林」: Merlin(super wizard)
林: Linux
http://projectmeilin.github.io/
Project Meilin Features
● CC-BY
● Utau compatible
● Professional recording(in studio)
● Src:24bits 48000hz wavs
●
VCV 連続音 、 VC 単独音
(V - Vowel c - Consonant)
● Recorded: Japanese,Mandarin(Taiwan style)
How good? A test
● Commercial Miku VS. open content Meilin
● V2 Miku each sample recorded high,middle,low
versions
VS.
Meilin each sample just record 1
version.
Listen to the
comparing video…
( song: 歌い手様総合テスト , Start from 0:44)
Especially check super low pitch and super
high pitch if is distorted (失真) ?
fact
Miku DB:
● 1gb+
● Only Japanese
Meilin DB:
● 627mb
● Japanese+Mardarin
●
Mardarlin DB is 3 倍
of JP DB
thank to Dr. 森勢
Without his effort and kindness,
a good FOSS virtual singer is
imposible
2 more
Special features
1: 14 Special effects
Defined in oto.ini
● 3 breath : br1,br2,br3 ( ex:Miku only have these breath. )
● Spanish 'R' rolling: trill
● Cough: cough
● Cry,dry tears:drytears
● Blownose: blownose
● Sucking: suck
●
sigh( 嘆 ):sgn1,sgn2,sgn3,sgn4
● Whistle :whsl
● clean throat: clnt
2: 日本方言 possible
●
EX: 円唇母音'う' in 関西弁 (video)
● in Mandarin ,there is the same 'u'
● Just borrow what we recorded.
● also can borrow other Mandarin samples for
synthesizing 方言 or some foreign languages.
(ex: 1 or 2 foreign lyrics in a Japanese song)
'v'ocaloid also can do speech
synthesis
Better than traditional speech synthesis
● Accent(= pitch,velocity,rhythm,speed) controllable
● Could do many emotion(melody lines) : cry,angry...
●
TTS,story telling,emotional ' うかがか' possible
● Some tests which I have done by Miku: 1,2,3,4 based on my scale
algorithm. 'Auto render' possible,but….
● If use Vocaloid to do this,you need to beg YAMAHA for opening API. But
our software stack are open source. She could do more than singing.
How made?
Recorded in a pro studio
Thanks to sponsor 阿怪 (Aguai),my master
(A famous POP song producer in TW.)
About the vocal
●
Her name is 羅竺 (Lo Chu).
● We choose her voice from 20 girls from on internet.
● She is a singer in a JAZZ / anime cover song band.
● Also vocal acting trained.
● Japanese accent not bad.
●
Japanese friend ATsushi 發音指導
But very hard work
Japanese recording need 3~4 hours.
But
Intact Madarin(possibility on math ,then minus
repeated samples by Phonology)
Madarin recording needs days.
The final day
LINNE platform
● We defined the FOSS 'v'ocaloid stack
● Of course opensource
● Compatible with Utau DB (but UTF-8)
● resampler+wavtool+editor(interface)+DB
-making tools
● May include 'hardware'
Hardware Ex: Doll robot
Our Oto.ini DB spec
● You can use ';' for comments
● Editors programs shouldn't resort the file
● UTF-8
● IPA based (International Phonetic Alphabet)
● By IPA,Different languages could use common
pronunciation samples
(no more re-recording again, keep the DB size smaller, more storage efficiency )
Engine (now is xvsqExec ,may need to
be changed)
Jcadencii
Linne-editor (in dev)
(song editor,front end)
Wavtool-pl
(GPL wavtool)
tn_fnds_yc (gpl)
(resampler,EFB-GW variant )
World lib
Other programs in the future
ex: linne-TTS
The chart may need evolution.
Problem now: the editor(frontend)
● Cadencii is written by .net with binding too
many Windows native calls
● Jcadencii is very slow (Cadencii java port)
● Upstream dev stopped. We also give it up.
● Another open Utau frontend:
http://fluidvocalsynth.weebly.com/ (also .Net)
●linne-editor(frontend)
● https://github.com/marty1885/linne-editor
● In very earily development
fact
We don't have enough manpower
about interface coding…
● When normal users edit, still need wine+Utau
● Similar to early Linux dev in Minix >_<
Wish more people join
Make LINNE
More complete!
徴音梅林開発委員会
● Open source community
● OSS programmers,musicians,a
physicist,Phonologists,artists...
● Members are international(TW,JP)
welcome
● Official Site
● Github: https://github.com/ProjectMeilin/
● Slack (tech talk): https://meilin.slack.com/
(email me for invitation : shoichi.chou@gmail.com )
● FB fan page
● FB group (more about DB making and musician)
● Youtube channel
Q&A
Listen another songs? :P
Hackasong.JP
Umbrella カバー
END

More Related Content

Similar to Intro for project Meilin and linne platform

Sp2012_Week3_Part2_ Workshop
Sp2012_Week3_Part2_ WorkshopSp2012_Week3_Part2_ Workshop
Sp2012_Week3_Part2_ Workshop
educw200
 
Audio Editing, Langara Journalism
Audio Editing, Langara JournalismAudio Editing, Langara Journalism
Audio Editing, Langara Journalism
Duncan McHugh
 
Langara Audacity workshop
Langara Audacity workshopLangara Audacity workshop
Langara Audacity workshop
Duncan McHugh
 
Week3 part2 workshop
Week3 part2 workshopWeek3 part2 workshop
Week3 part2 workshop
educw200
 
Japanese Rubyists you have not met yet
Japanese Rubyists you have not met yetJapanese Rubyists you have not met yet
Japanese Rubyists you have not met yet
masayoshi takahashi
 
Audacity Workshop at UBC SoJ
Audacity Workshop at UBC SoJAudacity Workshop at UBC SoJ
Audacity Workshop at UBC SoJ
Duncan McHugh
 

Similar to Intro for project Meilin and linne platform (20)

BeepComp - Chiptune Creator - User's Guide (v0.2.2)
BeepComp - Chiptune Creator - User's Guide (v0.2.2)BeepComp - Chiptune Creator - User's Guide (v0.2.2)
BeepComp - Chiptune Creator - User's Guide (v0.2.2)
 
Audacity
AudacityAudacity
Audacity
 
Task 5 sample material
Task 5 sample materialTask 5 sample material
Task 5 sample material
 
Sound Editing
Sound EditingSound Editing
Sound Editing
 
Sp2012_Week3_Part2_ Workshop
Sp2012_Week3_Part2_ WorkshopSp2012_Week3_Part2_ Workshop
Sp2012_Week3_Part2_ Workshop
 
Sound Recording Glossary
Sound Recording GlossarySound Recording Glossary
Sound Recording Glossary
 
Audio Editing, Langara Journalism
Audio Editing, Langara JournalismAudio Editing, Langara Journalism
Audio Editing, Langara Journalism
 
Langara Audacity workshop
Langara Audacity workshopLangara Audacity workshop
Langara Audacity workshop
 
Audacity Workshop
Audacity WorkshopAudacity Workshop
Audacity Workshop
 
Bunken2207
Bunken2207Bunken2207
Bunken2207
 
Week3 part2 workshop
Week3 part2 workshopWeek3 part2 workshop
Week3 part2 workshop
 
Japanese Rubyists you have not met yet
Japanese Rubyists you have not met yetJapanese Rubyists you have not met yet
Japanese Rubyists you have not met yet
 
Creating a game using C++, OpenGL and Qt
Creating a game using C++, OpenGL and QtCreating a game using C++, OpenGL and Qt
Creating a game using C++, OpenGL and Qt
 
Audacity Workshop at UBC SoJ
Audacity Workshop at UBC SoJAudacity Workshop at UBC SoJ
Audacity Workshop at UBC SoJ
 
Linux Sucks
Linux SucksLinux Sucks
Linux Sucks
 
Linux Sucks
Linux SucksLinux Sucks
Linux Sucks
 
Storm Education Software Course Overview
Storm Education Software Course OverviewStorm Education Software Course Overview
Storm Education Software Course Overview
 
Music Trackers - Linux Usergroup Nijmegen 2014
Music Trackers - Linux Usergroup Nijmegen 2014Music Trackers - Linux Usergroup Nijmegen 2014
Music Trackers - Linux Usergroup Nijmegen 2014
 
Storm Education Software Presentation
Storm Education Software PresentationStorm Education Software Presentation
Storm Education Software Presentation
 
Sound recording glossary improved
Sound recording glossary improvedSound recording glossary improved
Sound recording glossary improved
 

More from Shoichi Chou

用開源自由軟體做同人誌/寫真集
用開源自由軟體做同人誌/寫真集用開源自由軟體做同人誌/寫真集
用開源自由軟體做同人誌/寫真集
Shoichi Chou
 

More from Shoichi Chou (19)

適合民防人員的業餘無線電(HAM)基礎 - 你所需知道的那些基礎業餘無線電觀念與知識
適合民防人員的業餘無線電(HAM)基礎 - 你所需知道的那些基礎業餘無線電觀念與知識適合民防人員的業餘無線電(HAM)基礎 - 你所需知道的那些基礎業餘無線電觀念與知識
適合民防人員的業餘無線電(HAM)基礎 - 你所需知道的那些基礎業餘無線電觀念與知識
 
密碼安全與策略 - 2023年當代狀況.pdf
密碼安全與策略 - 2023年當代狀況.pdf密碼安全與策略 - 2023年當代狀況.pdf
密碼安全與策略 - 2023年當代狀況.pdf
 
向藝文人士推廣開放文化的3000天
向藝文人士推廣開放文化的3000天向藝文人士推廣開放文化的3000天
向藝文人士推廣開放文化的3000天
 
Coscup 2019 如何與侏儸紀恐龍群共舞? - 創立品牌的經驗
Coscup 2019 如何與侏儸紀恐龍群共舞? - 創立品牌的經驗Coscup 2019 如何與侏儸紀恐龍群共舞? - 創立品牌的經驗
Coscup 2019 如何與侏儸紀恐龍群共舞? - 創立品牌的經驗
 
Musicha 音創筆電試玩坊-在新竹碼農
Musicha 音創筆電試玩坊-在新竹碼農 Musicha 音創筆電試玩坊-在新竹碼農
Musicha 音創筆電試玩坊-在新竹碼農
 
Musicha - 開發一台軟、硬體整合的開源音樂創作筆電
Musicha - 開發一台軟、硬體整合的開源音樂創作筆電Musicha - 開發一台軟、硬體整合的開源音樂創作筆電
Musicha - 開發一台軟、硬體整合的開源音樂創作筆電
 
0 A.D.下一個3A級即時戰略大作介紹
0 A.D.下一個3A級即時戰略大作介紹0 A.D.下一個3A級即時戰略大作介紹
0 A.D.下一個3A級即時戰略大作介紹
 
打造維基實驗基地 - mediawiki快速架站
打造維基實驗基地 - mediawiki快速架站打造維基實驗基地 - mediawiki快速架站
打造維基實驗基地 - mediawiki快速架站
 
台語注音符號超快速入門
台語注音符號超快速入門台語注音符號超快速入門
台語注音符號超快速入門
 
開放資料成果:Open吳守禮國台對照活用辭典
開放資料成果:Open吳守禮國台對照活用辭典開放資料成果:Open吳守禮國台對照活用辭典
開放資料成果:Open吳守禮國台對照活用辭典
 
compic: Goddess Mazu is coming
compic: Goddess Mazu is comingcompic: Goddess Mazu is coming
compic: Goddess Mazu is coming
 
comic:Armageddon
comic:Armageddon comic:Armageddon
comic:Armageddon
 
用開源自由軟體做同人誌/寫真集
用開源自由軟體做同人誌/寫真集用開源自由軟體做同人誌/寫真集
用開源自由軟體做同人誌/寫真集
 
2017還再缺字,Unicode不是年年更新嗎?
2017還再缺字,Unicode不是年年更新嗎?2017還再缺字,Unicode不是年年更新嗎?
2017還再缺字,Unicode不是年年更新嗎?
 
Linux Pro audio development 2017
Linux Pro audio development 2017Linux Pro audio development 2017
Linux Pro audio development 2017
 
引入IDS動態組字技術到維基網站 / Push IDS rendering technology to mediawiki sites
引入IDS動態組字技術到維基網站 / Push IDS rendering technology to mediawiki sites引入IDS動態組字技術到維基網站 / Push IDS rendering technology to mediawiki sites
引入IDS動態組字技術到維基網站 / Push IDS rendering technology to mediawiki sites
 
用KXstudio Linux作音樂的心得
用KXstudio Linux作音樂的心得用KXstudio Linux作音樂的心得
用KXstudio Linux作音樂的心得
 
虛擬歌手打造經驗分享
虛擬歌手打造經驗分享虛擬歌手打造經驗分享
虛擬歌手打造經驗分享
 
Coscup2010-我是Programmer我也想做音樂
Coscup2010-我是Programmer我也想做音樂Coscup2010-我是Programmer我也想做音樂
Coscup2010-我是Programmer我也想做音樂
 

Recently uploaded

Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptxHarnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
FIDO Alliance
 

Recently uploaded (20)

How we scaled to 80K users by doing nothing!.pdf
How we scaled to 80K users by doing nothing!.pdfHow we scaled to 80K users by doing nothing!.pdf
How we scaled to 80K users by doing nothing!.pdf
 
2024 May Patch Tuesday
2024 May Patch Tuesday2024 May Patch Tuesday
2024 May Patch Tuesday
 
ERP Contender Series: Acumatica vs. Sage Intacct
ERP Contender Series: Acumatica vs. Sage IntacctERP Contender Series: Acumatica vs. Sage Intacct
ERP Contender Series: Acumatica vs. Sage Intacct
 
Long journey of Ruby Standard library at RubyKaigi 2024
Long journey of Ruby Standard library at RubyKaigi 2024Long journey of Ruby Standard library at RubyKaigi 2024
Long journey of Ruby Standard library at RubyKaigi 2024
 
Where to Learn More About FDO _ Richard at FIDO Alliance.pdf
Where to Learn More About FDO _ Richard at FIDO Alliance.pdfWhere to Learn More About FDO _ Richard at FIDO Alliance.pdf
Where to Learn More About FDO _ Richard at FIDO Alliance.pdf
 
Generative AI Use Cases and Applications.pdf
Generative AI Use Cases and Applications.pdfGenerative AI Use Cases and Applications.pdf
Generative AI Use Cases and Applications.pdf
 
Human Expert Website Manual WCAG 2.0 2.1 2.2 Audit - Digital Accessibility Au...
Human Expert Website Manual WCAG 2.0 2.1 2.2 Audit - Digital Accessibility Au...Human Expert Website Manual WCAG 2.0 2.1 2.2 Audit - Digital Accessibility Au...
Human Expert Website Manual WCAG 2.0 2.1 2.2 Audit - Digital Accessibility Au...
 
State of the Smart Building Startup Landscape 2024!
State of the Smart Building Startup Landscape 2024!State of the Smart Building Startup Landscape 2024!
State of the Smart Building Startup Landscape 2024!
 
UiPath manufacturing technology benefits and AI overview
UiPath manufacturing technology benefits and AI overviewUiPath manufacturing technology benefits and AI overview
UiPath manufacturing technology benefits and AI overview
 
Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...
Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...
Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...
 
AI mind or machine power point presentation
AI mind or machine power point presentationAI mind or machine power point presentation
AI mind or machine power point presentation
 
Intro to Passkeys and the State of Passwordless.pptx
Intro to Passkeys and the State of Passwordless.pptxIntro to Passkeys and the State of Passwordless.pptx
Intro to Passkeys and the State of Passwordless.pptx
 
Event-Driven Architecture Masterclass: Challenges in Stream Processing
Event-Driven Architecture Masterclass: Challenges in Stream ProcessingEvent-Driven Architecture Masterclass: Challenges in Stream Processing
Event-Driven Architecture Masterclass: Challenges in Stream Processing
 
Collecting & Temporal Analysis of Behavioral Web Data - Tales From The Inside
Collecting & Temporal Analysis of Behavioral Web Data - Tales From The InsideCollecting & Temporal Analysis of Behavioral Web Data - Tales From The Inside
Collecting & Temporal Analysis of Behavioral Web Data - Tales From The Inside
 
TopCryptoSupers 12thReport OrionX May2024
TopCryptoSupers 12thReport OrionX May2024TopCryptoSupers 12thReport OrionX May2024
TopCryptoSupers 12thReport OrionX May2024
 
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptxHarnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
 
The Metaverse: Are We There Yet?
The  Metaverse:    Are   We  There  Yet?The  Metaverse:    Are   We  There  Yet?
The Metaverse: Are We There Yet?
 
Vector Search @ sw2con for slideshare.pptx
Vector Search @ sw2con for slideshare.pptxVector Search @ sw2con for slideshare.pptx
Vector Search @ sw2con for slideshare.pptx
 
Google I/O Extended 2024 Warsaw
Google I/O Extended 2024 WarsawGoogle I/O Extended 2024 Warsaw
Google I/O Extended 2024 Warsaw
 
The Zero-ETL Approach: Enhancing Data Agility and Insight
The Zero-ETL Approach: Enhancing Data Agility and InsightThe Zero-ETL Approach: Enhancing Data Agility and Insight
The Zero-ETL Approach: Enhancing Data Agility and Insight
 

Intro for project Meilin and linne platform

  • 1. Virtual singer 徴音梅林 & LINNE CC-BY-NC
  • 2. Slide author ● 張正一 (Chou Shouichi)/ MGdesigner ● Paul Liu and me organize 徴音梅林開発委員會 ● Wikimedia.tw: member of board of directors (and direct tech development ) ● A programmer ● A musician (Jazz ukulele, DTM) ● Shoichi.chou@gmail.com
  • 6. If you buy an instrument You can play any song, Do anything.
  • 10. burn
  • 11. In any Vocaloid product EULA You didn't get whole rights ● no “anti-society” (反社会) works (so,What works are “anti-society”?) ● Trademarks protection (images, keywords) (ex: 'Vocaloid' ,' 初音ミク ',' 初音ミク ''s image)
  • 12.
  • 13.
  • 14. No using Miku images=not popular musicians are controlled No freedom
  • 15. Be ruled ● Using a Gibson guitar,you are its master. ● Using Vocaloid products, You are their slave.
  • 18. A free vocaloid-like ● DIY a 『 vocaloid 』 ● Programs: editor(frontend)+resampler+wavtool ● data: vocal DB - oto.ini + wav samples ● Vocal DB is an open spec ,many people DIY
  • 19. vocaloid programs working flow 1.Editor: compose the melody(many notes) 2.Resampler: modulate a sample to Specified pitch,or other parameters (velocity...). 3.Wavtool: combine these modulated wavs Finally,we get a song vocal wav file,and mix into a song
  • 20. but ● Charge no fee,not freedom ● Default resampler work badly ● DB bad international support (S-JIS) ● Oto.ini no implementing ini comments “;“ ● UTAU always auto sort oto.ini (hard collaboration) ● Hard UI control ● Not open source ● Its development is very private
  • 21. And you know ... Yamaha owns many super patents
  • 22. A nice “free vocaloid“ Is Very difficult
  • 23. During 2011-2012 One day, Paul Liu talked to me ● New Algorithm, 'World' better than Vocaloid2 ● Author: 日本山梨大學 Doctor 森勢将雅 ● Patent free ● EFB-GW(Synthesizer) for UTAU ● Open source(old version GPL,newer is BSD) ● https://github.com/mmorise/World ● During Dec,Dr. 森勢 'll do another great upgrade
  • 24. How good is World algorithm? ● very awesome 'autotune' (original official test is a realtime Karaoke autotune for 音痴 s. ) ● Modulate a sample to any pitch without distortion (Keep F0 well) (Vocaloid2 can't ,so Miku need 3 different range versions of each sample) ● Very fast ,no need to pre-preapre frequency tables (Just do it real time) ● If X86, Even works good on older machines(maybe on ARM)
  • 25. Ok Let's do it!
  • 28. 徵音梅林 「徵音」 :Ancient Chinese,Japanese Pentatonic scale note. (Do Re Mi Sol La) 宮(きゅう)、商(しょう)、 角(かく)、 徴( Also means We 'recruit' a voice actor(and also a Jazz singer) from Internet 「梅林」: Merlin(super wizard) 林: Linux
  • 30. Project Meilin Features ● CC-BY ● Utau compatible ● Professional recording(in studio) ● Src:24bits 48000hz wavs ● VCV 連続音 、 VC 単独音 (V - Vowel c - Consonant) ● Recorded: Japanese,Mandarin(Taiwan style)
  • 31. How good? A test ● Commercial Miku VS. open content Meilin ● V2 Miku each sample recorded high,middle,low versions VS. Meilin each sample just record 1 version.
  • 32. Listen to the comparing video… ( song: 歌い手様総合テスト , Start from 0:44) Especially check super low pitch and super high pitch if is distorted (失真) ?
  • 33. fact Miku DB: ● 1gb+ ● Only Japanese Meilin DB: ● 627mb ● Japanese+Mardarin ● Mardarlin DB is 3 倍 of JP DB
  • 34. thank to Dr. 森勢 Without his effort and kindness, a good FOSS virtual singer is imposible
  • 36. 1: 14 Special effects Defined in oto.ini ● 3 breath : br1,br2,br3 ( ex:Miku only have these breath. ) ● Spanish 'R' rolling: trill ● Cough: cough ● Cry,dry tears:drytears ● Blownose: blownose ● Sucking: suck ● sigh( 嘆 ):sgn1,sgn2,sgn3,sgn4 ● Whistle :whsl ● clean throat: clnt
  • 37. 2: 日本方言 possible ● EX: 円唇母音'う' in 関西弁 (video) ● in Mandarin ,there is the same 'u' ● Just borrow what we recorded. ● also can borrow other Mandarin samples for synthesizing 方言 or some foreign languages. (ex: 1 or 2 foreign lyrics in a Japanese song)
  • 38. 'v'ocaloid also can do speech synthesis Better than traditional speech synthesis ● Accent(= pitch,velocity,rhythm,speed) controllable ● Could do many emotion(melody lines) : cry,angry... ● TTS,story telling,emotional ' うかがか' possible ● Some tests which I have done by Miku: 1,2,3,4 based on my scale algorithm. 'Auto render' possible,but…. ● If use Vocaloid to do this,you need to beg YAMAHA for opening API. But our software stack are open source. She could do more than singing.
  • 40. Recorded in a pro studio
  • 41. Thanks to sponsor 阿怪 (Aguai),my master (A famous POP song producer in TW.)
  • 42. About the vocal ● Her name is 羅竺 (Lo Chu). ● We choose her voice from 20 girls from on internet. ● She is a singer in a JAZZ / anime cover song band. ● Also vocal acting trained. ● Japanese accent not bad. ● Japanese friend ATsushi 發音指導
  • 43. But very hard work Japanese recording need 3~4 hours. But Intact Madarin(possibility on math ,then minus repeated samples by Phonology) Madarin recording needs days.
  • 45.
  • 46. LINNE platform ● We defined the FOSS 'v'ocaloid stack ● Of course opensource ● Compatible with Utau DB (but UTF-8) ● resampler+wavtool+editor(interface)+DB -making tools ● May include 'hardware'
  • 48. Our Oto.ini DB spec ● You can use ';' for comments ● Editors programs shouldn't resort the file ● UTF-8 ● IPA based (International Phonetic Alphabet) ● By IPA,Different languages could use common pronunciation samples (no more re-recording again, keep the DB size smaller, more storage efficiency )
  • 49. Engine (now is xvsqExec ,may need to be changed) Jcadencii Linne-editor (in dev) (song editor,front end) Wavtool-pl (GPL wavtool) tn_fnds_yc (gpl) (resampler,EFB-GW variant ) World lib Other programs in the future ex: linne-TTS
  • 50. The chart may need evolution.
  • 51. Problem now: the editor(frontend) ● Cadencii is written by .net with binding too many Windows native calls ● Jcadencii is very slow (Cadencii java port) ● Upstream dev stopped. We also give it up. ● Another open Utau frontend: http://fluidvocalsynth.weebly.com/ (also .Net)
  • 53. fact We don't have enough manpower about interface coding… ● When normal users edit, still need wine+Utau ● Similar to early Linux dev in Minix >_<
  • 54. Wish more people join Make LINNE More complete!
  • 55. 徴音梅林開発委員会 ● Open source community ● OSS programmers,musicians,a physicist,Phonologists,artists... ● Members are international(TW,JP)
  • 56. welcome ● Official Site ● Github: https://github.com/ProjectMeilin/ ● Slack (tech talk): https://meilin.slack.com/ (email me for invitation : shoichi.chou@gmail.com ) ● FB fan page ● FB group (more about DB making and musician) ● Youtube channel
  • 57. Q&A Listen another songs? :P Hackasong.JP Umbrella カバー END