Is text-to-speech on Kindle 2 a threat to audiobooks?

By Tim Conneally | Published February 13, 2009, 12:57 PM

Banner: Viewpoint

Yesterday, the Author's Guild aired complaints about the Kindle 2's new onboard text-to-speech function from predictive text specialists Nuance Communications, warning that the function could eventually cut into the audiobook market.

The group said, "This presents a significant challenge to the publishing industry. Audiobooks surpassed $1 billion in sales in 2007; e-book sales are just a small fraction of that. While the audio quality of the Kindle 2, judging from Amazon's promotional materials, is best described as serviceable, it's far better than the text-to-speech audio of just a few years ago. We expect this software to improve rapidly."

A statement from the group asks authors to be aware of audio functionality clauses in contracts for e-book rights, saying "Publishers certainly could contractually prohibit Amazon from adding audio functionality to its e-books without authorization, and Amazon could comply by adding a software tag that would prohibit its machine from creating an audio version of a book unless Amazon has acquired the appropriate rights. Until this issue is worked out, Amazon may be undermining your audio market as it exploits your e-books."

The Author's Guild, however, does not take into account accessibility issues presented to the vision-impaired. It says in the statement that "even out-loud reading by a machine is fine, of course, if it's from an authorized audio copy." This means the visually handicapped are only allowed to consume literature preordained to be read aloud? That sounds like an issue for human rights advocates.

Furthermore, it is safe to say that no automated voice, no matter how indistinguishable from a human's, will ever replace the famous actors that are often hired to dictate audiobooks. As a ravenous consumer of audiobooks, and easily excited tech geek by trade, I would be the first to abandon the amateur recordings found on librivox.org for a slick automated voice. However, nothing could replace a well-chosen voice actor, like Peter Weller for a reading of Hermann Hesse's Steppenwolf, or comedian Steven Fry for the Harry Potter series, or when authors read their own works, adding an additional layer of depth to the text.

Text-to-speech is not a threat, it's an alternate route.

Comments

View comments by with a score of at least

Here is a link to an OP-Ed piece in the NYTimes. I put it out here not as an advocate of one side or the other but to elevate discussion on how we incorporate new technology into our lives and how we might find some middle ground between extreme views. I'll keep my views on the article and the person to myself.
http://www.nytimes.com/2...nion/25blount.html?_r=1

Score: 0

|

"And while were at it I want to complain that not only is text to speech a violation but I think a teacher should pay for every student that she has in a class for reading our stuff out loud. As well as parents should pay extra to read to there children!"

Greedy bas****s..... its the same thing.

Score: 0

|

A few million years ago another type of dinosaur happened to look up from his daily, tiny brained activities and chanced a look at the horizon. There he saw a bright flash of light that frightened him greatly! He whipped out his cell phone to call his lawyer... only to realize that the lawyers hadn't climbed down out of the trees yet. Shortly thereafter the dinosaur and all his friends became extinct. Enjoy.

Score: -1

|

This doesn't make any sense at all. Even if the Kindle 2 was a threat to audiobooks, so what. The VHS format was a threat to video reels, the color TV was a threat to B&W TV, MP3 players are a threat to CD players, and the list goes on and on. The automobile was a threat to manufacturers of horse drawn carriages. This is pointless. New things come along that are "threats" to the older items, for example the Kindle 2 and audiobooks. I hope they do try some sort of lawsuit and the judge laughs in their face for the complete stupidity of this argument. What would happen if this case was to win? There are enough lawsuits out there now. If this case won, it would be a field day for lawyers. Many companies would be suing each other simply because their new product threatened an old product. Development would be stymied because businesses would question what they could sell and not "threaten" some other product.

Score: 1

|

Might as well charge people extra if they want to read to there child while there at it. Boy, these greedy bas****s should charge teachers big bucks since they read to a classroom all the time. I HATE greed.

Score: 2

|

Sounds like they hate blind people.

Score: 1

|

I'm curious how the Author's Guild (TAG) defines "authorized audio copy". Text-to-speech is not a copy, just an alternate presentation of the material. Additionally, MS Reader, Adobe Acrobat, and other software based e-book readers have allowed computer users to have their e-books read via text-to-speech for quite some time now (glad I kept all my original e-books - no ridiculous DRM/greed restrictions). Admittedly the ability has been getting chipped away by TAG's greed (in the name of protecting the authors...puhleeze - any smart author will tell you they'd prefer to have their books read then passed on to friends than try to force every potential reader to buy a copy - book trading is what introduces readers to new authors they might not otherwise know about or buy). What about text-to-speech in Windows, OSX, Linux, etc.? Sounds like TAG wants DRM for TTS to go further than it already has. Yet another group intent on doing whatever it can to stifle innovation in order to protect its outdated, dinosauric, business model.

The audio-book market seems like a relatively narrow niche by my understanding and I don't see how a reader with text-to-speech capability is going to challenge that market. For example, I can't imagine people spending money on this reader so they can set it on the passenger seat in their car and listen to text in a computer voice of an e-book they purchased if an audio-book is already available to be used with the vehicle's stereo system (not to mention the absence of misreads and nonsense words).

"even out-loud reading by a machine is fine, of course, if it's from an authorized audio copy."

Paraphrased: "Vision impaired people are great...so long as we can exploit their disability for some extra profit."

Hmmm...TAG clearly had no concern for those with disabilities when crafting its position...but its willing to use the subject to re-emphasize the profit potential to be gained. Sounds like a plan born out of nothing but pure, desperate, greed.

This smacks of another industry looking for ways to lock their perceived customer base into a single, narrow-minded, one-size-fits-almost-nobody, profit-at-all-expenses option that will only appeal to a small slice of that narrow group that fits within the definition...as opposed to the potentially larger customer base that exists and is being told "you can no has". The economy may be hurting, but moves like this never have anything to do with protecting artists, etc. - it's all about greed. Face it TAG - you were outsmarted and out-innovated in your own business market, and technology is not going to stop moving forward just because you still can't get your VCR to stop flashing 12:00 - you're only fooling yourselves.

Score: 3

|
Below viewing threshold. Show

Sudden outbreak of common sense detected.

Initiate bans.

*shakes head*

Come on, Tim...you can do better than hyping some pointless gadget virtually no-one uses. What's the global market penetration of this thing? .001%?

Score: -4

|

@sjc001

Is it even possible for you to post in this forum as anything but my personal troll? I honestly do not recall the last time you made a post here that wasn't a reply and insult directed at me.

While i am honored you feel so strongly about me, I worry that your obsession may be unhealthy for one of your low mental acuity. Seek help. You need it.

Score: -2

|

sjc? I know you are trying but when you steal a line from a character from an old TV show (Friends), you should put in quotes as Fox did but keep trying. Have faith Grasshopper.

Everyone have a good day:)

Score: 3

|

PC? Question and I means this in a joking way not a Betanews put down way- Just out of curiosity, how much does your Personal Troll pay and what is the benefit package? Who knows I may want to "supplant" my income one day:)
Everyone have a good day:)

Score: 0

|

I agree with PC_Tool for once :)

Score: 0

|

Honestly, it will not be a threat to audio books for me. I will listen to just about any audio book if the story is compelling enough, but I prefer ones that have been produced really well, usually with more than just a lone narrator. Ones that have utilized a fairly large cast of voice actors in addition to soundtrack music and sound effects are typically my favorite (Big Finish Productions versions of Doctor Who come to mind).

Most of the time I'm content with a single narrator... and even the most monotonous voice in the world can usually express the emotion of the characters far better than any digitized voice ever could.

Score: 1

|

I haven't played with the Kindle 2 yet but if it's text to speech algorithm works like some of the software I have tried in the past it will have a hard time pronouncing some of the less common words properly let alone sounding natural.

Score: 0

|

I don't think it will be a threat to audio books. I for one think I prefer the voice of the actual writer of the bio and the emotion of the person reading the book really comes through, no computer voce could come close to that, in the near future.

Score: 2

|

it is not Stephen Fry but Jim Dale that reads the Harry Potter stories..

Score: 1

|

Jim Dale did the Harry Potter audiobooks in America, but Stephen Fry narrated Harry Potter audiobooks in the UK.

Score: 0

|

PDC 2009: What have we learned this week?

There was the freebie that no one will forget, the heebie-jeebies courtesy of Scott Guthrie, and a teensy bit clearer picture of how this cloud thingie should work.

Live report: Will Google Chrome OS change Linux?

The mysteries of just what Chrome OS is, and how much of an operating system it truly is, may be resolved today.

PDC 2009: Microsoft cares about Web browser performance

The effort to give users of the world's dominant Web browser the impression of quality, is a personal one for the man who leads that battle.

Nokia re-affirms its commitment to Symbian, sort of

Maemo won't necessarily be replacing Symbian in the Nokia N-Series, but that's definitely a place where it will be found.

E-book readers will be in short supply this holiday season

E-readers are hot this year, and a lot of compelling new products have been released, but are there enough electrophoretic displays to go around?

Sony looks to finally open a single storefront for downloads

Sony has had many different download portals for movies, music, e-books, and games, and now it's looking to make a single shop for all of it.

Tuning out the tablet: Time to give the endless speculation a rest

Wide Angle Zoom: Wishing and hoping and thinking and praying....won't put an iTablet on the market.

Five improvements for IT managers in 2010

If businesses are to improve their efficiency for next year, they need to stop and reassess the basic tenets of their job.

AOL's spinoff from Time Warner to shed 2,500 jobs

As AOL moves toward become an independent company again, it will cut nearly a third of its workforce.

Gartner: SMS-based money transfer will be bigger than mobile browsing, search

Gartner issues its predictions for the 10 things our phones will be doing in 2012.

Don't forget to upgrade to Firefox 3.6 beta 3 today

Mozilla has released the latest beta its Firefox 3.6 browser software, just over one week after beta 2.