{"id":8442,"date":"2024-11-12T17:45:44","date_gmt":"2024-11-12T16:45:44","guid":{"rendered":"https:\/\/projecteaina.cat\/tech\/?post_type=publicacions&#038;p=8442"},"modified":"2024-11-21T18:53:48","modified_gmt":"2024-11-21T17:53:48","slug":"lafrescat-a-catalan-multi-accent-speech-dataset-for-text-to-speech","status":"publish","type":"publicacions","link":"https:\/\/projecteaina.cat\/tech\/publicacions\/lafrescat-a-catalan-multi-accent-speech-dataset-for-text-to-speech\/","title":{"rendered":"LaFresCat: A Catalan Multi-Accent Speech Dataset for Text-to-Speech"},"excerpt":{"rendered":"<p>Current generative text-to-speech (TTS) models are very robust and capable of learning the phonetics of a language almost perfectly. To do so, it remains crucial that the speech data used to train such models covers all phonetic richness. This includes phenomena of different accents. In the case of Catalan, although having access to various public speech corpora, there is a lack of high-quality, open access data covering its variety of accents. To meet this need, we have produced LaFresCat, a studio quality open-source Catalan multi-accent dataset with a total of 3.5 hours that covers 4 of the most prominent accents: Balearic, Central, North-Western and Valencian. We provide a detailed description of how utterances and recordings were produced. To evaluate the efficacy of LaFresCat, we trained a diffusion-based TTS model. Despite the small size of the dataset, we show that it is possible to generate accent-specific speech with an acceptable quality, and even enhance it by taking advantage of other Catalan datasets.<\/p>\n","protected":false},"featured_media":0,"template":"","meta":{"_acf_changed":false,"_monsterinsights_skip_tracking":false,"_monsterinsights_sitenote_active":false,"_monsterinsights_sitenote_note":"","_monsterinsights_sitenote_category":0},"class_list":["post-8442","publicacions","type-publicacions","status-publish","hentry"],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.6 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>LaFresCat: A Catalan Multi-Accent Speech Dataset for Text-to-Speech - Projecte Aina Tech<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/projecteaina.cat\/tech\/publicacions\/lafrescat-a-catalan-multi-accent-speech-dataset-for-text-to-speech\/\" \/>\n<meta property=\"og:locale\" content=\"ca_ES\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"LaFresCat: A Catalan Multi-Accent Speech Dataset for Text-to-Speech - Projecte Aina Tech\" \/>\n<meta property=\"og:description\" content=\"Current generative text-to-speech (TTS) models are very robust and capable of learning the phonetics of a language almost perfectly. To do so, it remains crucial that the speech data used to train such models covers all phonetic richness. This includes phenomena of different accents. In the case of Catalan, although having access to various public speech corpora, there is a lack of high-quality, open access data covering its variety of accents. To meet this need, we have produced LaFresCat, a studio quality open-source Catalan multi-accent dataset with a total of 3.5 hours that covers 4 of the most prominent accents: Balearic, Central, North-Western and Valencian. We provide a detailed description of how utterances and recordings were produced. To evaluate the efficacy of LaFresCat, we trained a diffusion-based TTS model. Despite the small size of the dataset, we show that it is possible to generate accent-specific speech with an acceptable quality, and even enhance it by taking advantage of other Catalan datasets.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/projecteaina.cat\/tech\/publicacions\/lafrescat-a-catalan-multi-accent-speech-dataset-for-text-to-speech\/\" \/>\n<meta property=\"og:site_name\" content=\"Projecte Aina Tech\" \/>\n<meta property=\"article:modified_time\" content=\"2024-11-21T17:53:48+00:00\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:site\" content=\"@projecte_aina\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/projecteaina.cat\\\/tech\\\/publicacions\\\/lafrescat-a-catalan-multi-accent-speech-dataset-for-text-to-speech\\\/\",\"url\":\"https:\\\/\\\/projecteaina.cat\\\/tech\\\/publicacions\\\/lafrescat-a-catalan-multi-accent-speech-dataset-for-text-to-speech\\\/\",\"name\":\"LaFresCat: A Catalan Multi-Accent Speech Dataset for Text-to-Speech - Projecte Aina Tech\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/projecteaina.cat\\\/tech\\\/#website\"},\"datePublished\":\"2024-11-12T16:45:44+00:00\",\"dateModified\":\"2024-11-21T17:53:48+00:00\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/projecteaina.cat\\\/tech\\\/publicacions\\\/lafrescat-a-catalan-multi-accent-speech-dataset-for-text-to-speech\\\/#breadcrumb\"},\"inLanguage\":\"ca\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/projecteaina.cat\\\/tech\\\/publicacions\\\/lafrescat-a-catalan-multi-accent-speech-dataset-for-text-to-speech\\\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/projecteaina.cat\\\/tech\\\/publicacions\\\/lafrescat-a-catalan-multi-accent-speech-dataset-for-text-to-speech\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Inici\",\"item\":\"https:\\\/\\\/projecteaina.cat\\\/tech\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"LaFresCat: A Catalan Multi-Accent Speech Dataset for Text-to-Speech\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/projecteaina.cat\\\/tech\\\/#website\",\"url\":\"https:\\\/\\\/projecteaina.cat\\\/tech\\\/\",\"name\":\"Projecte Aina Tech\",\"description\":\"Impulsant l&#039;\u00fas del catal\u00e0 en l&#039;era digital\",\"publisher\":{\"@id\":\"https:\\\/\\\/projecteaina.cat\\\/tech\\\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/projecteaina.cat\\\/tech\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"ca\"},{\"@type\":\"Organization\",\"@id\":\"https:\\\/\\\/projecteaina.cat\\\/tech\\\/#organization\",\"name\":\"Projecte Aina Tech\",\"url\":\"https:\\\/\\\/projecteaina.cat\\\/tech\\\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"ca\",\"@id\":\"https:\\\/\\\/projecteaina.cat\\\/tech\\\/#\\\/schema\\\/logo\\\/image\\\/\",\"url\":\"https:\\\/\\\/projecteaina.cat\\\/tech\\\/wp-content\\\/uploads\\\/2023\\\/11\\\/cropped-aina-home-logo.jpg\",\"contentUrl\":\"https:\\\/\\\/projecteaina.cat\\\/tech\\\/wp-content\\\/uploads\\\/2023\\\/11\\\/cropped-aina-home-logo.jpg\",\"width\":512,\"height\":512,\"caption\":\"Projecte Aina Tech\"},\"image\":{\"@id\":\"https:\\\/\\\/projecteaina.cat\\\/tech\\\/#\\\/schema\\\/logo\\\/image\\\/\"},\"sameAs\":[\"https:\\\/\\\/x.com\\\/projecte_aina\",\"https:\\\/\\\/www.linkedin.com\\\/company\\\/projecte-aina\\\/\"]}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"LaFresCat: A Catalan Multi-Accent Speech Dataset for Text-to-Speech - Projecte Aina Tech","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/projecteaina.cat\/tech\/publicacions\/lafrescat-a-catalan-multi-accent-speech-dataset-for-text-to-speech\/","og_locale":"ca_ES","og_type":"article","og_title":"LaFresCat: A Catalan Multi-Accent Speech Dataset for Text-to-Speech - Projecte Aina Tech","og_description":"Current generative text-to-speech (TTS) models are very robust and capable of learning the phonetics of a language almost perfectly. To do so, it remains crucial that the speech data used to train such models covers all phonetic richness. This includes phenomena of different accents. In the case of Catalan, although having access to various public speech corpora, there is a lack of high-quality, open access data covering its variety of accents. To meet this need, we have produced LaFresCat, a studio quality open-source Catalan multi-accent dataset with a total of 3.5 hours that covers 4 of the most prominent accents: Balearic, Central, North-Western and Valencian. We provide a detailed description of how utterances and recordings were produced. To evaluate the efficacy of LaFresCat, we trained a diffusion-based TTS model. Despite the small size of the dataset, we show that it is possible to generate accent-specific speech with an acceptable quality, and even enhance it by taking advantage of other Catalan datasets.","og_url":"https:\/\/projecteaina.cat\/tech\/publicacions\/lafrescat-a-catalan-multi-accent-speech-dataset-for-text-to-speech\/","og_site_name":"Projecte Aina Tech","article_modified_time":"2024-11-21T17:53:48+00:00","twitter_card":"summary_large_image","twitter_site":"@projecte_aina","schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/projecteaina.cat\/tech\/publicacions\/lafrescat-a-catalan-multi-accent-speech-dataset-for-text-to-speech\/","url":"https:\/\/projecteaina.cat\/tech\/publicacions\/lafrescat-a-catalan-multi-accent-speech-dataset-for-text-to-speech\/","name":"LaFresCat: A Catalan Multi-Accent Speech Dataset for Text-to-Speech - Projecte Aina Tech","isPartOf":{"@id":"https:\/\/projecteaina.cat\/tech\/#website"},"datePublished":"2024-11-12T16:45:44+00:00","dateModified":"2024-11-21T17:53:48+00:00","breadcrumb":{"@id":"https:\/\/projecteaina.cat\/tech\/publicacions\/lafrescat-a-catalan-multi-accent-speech-dataset-for-text-to-speech\/#breadcrumb"},"inLanguage":"ca","potentialAction":[{"@type":"ReadAction","target":["https:\/\/projecteaina.cat\/tech\/publicacions\/lafrescat-a-catalan-multi-accent-speech-dataset-for-text-to-speech\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/projecteaina.cat\/tech\/publicacions\/lafrescat-a-catalan-multi-accent-speech-dataset-for-text-to-speech\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Inici","item":"https:\/\/projecteaina.cat\/tech\/"},{"@type":"ListItem","position":2,"name":"LaFresCat: A Catalan Multi-Accent Speech Dataset for Text-to-Speech"}]},{"@type":"WebSite","@id":"https:\/\/projecteaina.cat\/tech\/#website","url":"https:\/\/projecteaina.cat\/tech\/","name":"Projecte Aina Tech","description":"Impulsant l&#039;\u00fas del catal\u00e0 en l&#039;era digital","publisher":{"@id":"https:\/\/projecteaina.cat\/tech\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/projecteaina.cat\/tech\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"ca"},{"@type":"Organization","@id":"https:\/\/projecteaina.cat\/tech\/#organization","name":"Projecte Aina Tech","url":"https:\/\/projecteaina.cat\/tech\/","logo":{"@type":"ImageObject","inLanguage":"ca","@id":"https:\/\/projecteaina.cat\/tech\/#\/schema\/logo\/image\/","url":"https:\/\/projecteaina.cat\/tech\/wp-content\/uploads\/2023\/11\/cropped-aina-home-logo.jpg","contentUrl":"https:\/\/projecteaina.cat\/tech\/wp-content\/uploads\/2023\/11\/cropped-aina-home-logo.jpg","width":512,"height":512,"caption":"Projecte Aina Tech"},"image":{"@id":"https:\/\/projecteaina.cat\/tech\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/x.com\/projecte_aina","https:\/\/www.linkedin.com\/company\/projecte-aina\/"]}]}},"_links":{"self":[{"href":"https:\/\/projecteaina.cat\/tech\/wp-json\/wp\/v2\/publicacions\/8442","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/projecteaina.cat\/tech\/wp-json\/wp\/v2\/publicacions"}],"about":[{"href":"https:\/\/projecteaina.cat\/tech\/wp-json\/wp\/v2\/types\/publicacions"}],"wp:attachment":[{"href":"https:\/\/projecteaina.cat\/tech\/wp-json\/wp\/v2\/media?parent=8442"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}