Sunday, February 26, 2023

Apple AI Audiobook Narrators

 

Audiobooks are nerve-wracking to produce by yourself. Sure, you can offer a shared royalty route with the narrator and hope to find a good one. But a narrator search can be difficult, frustrating, and time-consuming. Not to mention, the entire production process is  daunting and expensive. Where to begin?

According to Apple, not with a human. Apple started production of digital audiobooks using artificial intelligence, specifically two AI narrators named Madison, a female soprano, and Jackson, a baritone. The platform uses advanced speech synthesis technology combined with input from linguists, quality control specialists, and audio engineers. Currently Madison and Jackson are used for fiction and romance only. In the near future, Apple will have two additional digital voices, Helena and Mitchell, for nonfiction and self-development audiobooks.

 If you’d like to hear a sample of Madison click here.

 According to Apple, these are the benefits:

·        The audiobooks will be easy to produce and delivered via preferred partners. The original ebook must be created in either Draft2Digital or Ingram CoreSource.

·        Audiobooks have wholesale price limits. (The website isn’t clear on what they’ll be.)

·        Distribution will be solely via Apple Books and to public/academic libraries.

·        Publisher/author retains audiobook rights, and there are no restrictions on producing and distributing other versions of the audiobook.

How do you make an Apple audiobook?

The ebook must be created with either Draft2Digital or Ingram CoreSouce. Then the author selects the title. Apple has a review process and acceptance isn’t guaranteed. The general requirements are as follows:

·        Ebook must be available on Apple Books.

·        Author must own the audio production rights.

·        Primary category must be romance or fiction. The only subcategories currently accepted are literary, historical, or women’s fiction.

·        Book must be in English.

How good are AI narrators?

Frankly, Madison is fine. A lousy narrator can ruin a good audiobook and I stop listening when any set my teeth on edge. Madison has a pleasant voice with good tone, if a trifle unemotional. If you listen to her, you understand why Apple limits the AI to audiobooks without an exciting chase scene or hilarious denouement. An AI can’t make handle extreme ranges of emotion. Other things AIs can’t do:

·        A multitude of characters in the same book. Their voice is typical of a national newscaster; nothing to determine a regional accents. Nor can an AI handle a stutter, quirky word pronunciations, foreign words and phrases, or fictional words. (Sorry, Samuel Clemens, J. R. R. Tolkien, and me. Our work doesn't qualify.)

·        Difference in ages. All the characters sound roughly the same age.

·        Difference in genders. Male and female characters speak with a similar tone.

 So what’s the big problem with AI?

Even if an AI sounds okay, I have issues. A top-drawer narrator makes an audiobook memorable, and AI has a long way to go to reach that level. Audiobooks are also an art, and frankly, I’m not crazy about the idea of a soulless digital character taking a job from a human who spent years honing a craft. Not to mention, a human narrator adds emotional nuance to every page, something an AI can’t do. AI narrators are still few and far between and until I have no choice, I’ll opt for a human every time.




 

 

 

No comments:

Post a Comment