Interesting; I was thinking about creating something like that a few years ago - since I love listening to information a lot while doing some chores/walking - but back then, all available text-to-speech converters were unbearably robotic.
How much time does it take to convert a book/doc into audio using your approach? Also, as I understood it all runs locally, so you don't need to pay for any API access/usage?
Interesting; I was thinking about creating something like that a few years ago - since I love listening to information a lot while doing some chores/walking - but back then, all available text-to-speech converters were unbearably robotic.
How much time does it take to convert a book/doc into audio using your approach? Also, as I understood it all runs locally, so you don't need to pay for any API access/usage?
on an nvidia rtx 2060 mobile about half a day for a medium sized novel. Chatterbox TTS is really emotive, sometimes too much so.