Python Encoder Preprocess.py

Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions

Large-scale pre-trained language models have been shown to be helpful in improving the naturalness of text-to-speech (TTS) models by enabling them to produce more naturalistic prosodic patterns.

GitHub

rbybryan/EEG_fusion_encoding

Code for predicting EEG responses to natural images from vision and language model features, and for combining them in an early-fusion ridge encoding model. The pipeline covers the full path from the ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions

rbybryan/EEG_fusion_encoding

Trending now