Skip to main navigation Skip to search Skip to main content

VCTUBE: A library for automatic speech data annotation

  • Seong Choi
  • , Seunghoon Jeong
  • , Jeewoo Yoon
  • , Migyeong Yang
  • , Minsam Ko
  • , Eunil Park
  • , Jinyoung Han
  • , Munyoung Lee
  • , Seonghee Lee
  • Sungkyunkwan University
  • Hanyang University
  • Electronics and Telecommunications Research Institute

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

We introduce an open-source Python library, VCTUBE, which can automatically generate <audio, text> pair of speech data from a given Youtube URL. We believe VCTUBE is useful for collecting, processing, and annotating speech data easily toward developing speech synthesis systems.

Original languageEnglish
Title of host publicationInterspeech 2020
PublisherInternational Speech Communication Association
Pages1013-1014
Number of pages2
ISBN (Print)9781713820697
DOIs
StatePublished - 2020
Event21st Annual Conference of the International Speech Communication Association, INTERSPEECH 2020 - Shanghai, China
Duration: 25 Oct 202029 Oct 2020

Publication series

NameProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
Volume2020-October
ISSN (Print)2308-457X
ISSN (Electronic)1990-9772

Conference

Conference21st Annual Conference of the International Speech Communication Association, INTERSPEECH 2020
Country/TerritoryChina
CityShanghai
Period25/10/2029/10/20

Fingerprint

Dive into the research topics of 'VCTUBE: A library for automatic speech data annotation'. Together they form a unique fingerprint.

Cite this