Narrative: Amazon is Ahead in Voice

Each narrative page (like this) has a page describing and evaluating the narrative, followed by all the posts on the site tagged with that narrative. Scroll down beyond the introduction to see the posts.

Each post below is tagged with
  • Company/Division names
  • Topics
  • and
  • Narratives
  • as appropriate.
    Pictures of Amazon’s Reported Echo With a Screen Emerge Online (May 5, 2017)

    A tiny, low-res picture of what might be Amazon’s Echo with a screen emerged today, and leaker Evan Blass followed up with a much higher-resolution version later in the day. The device looks vaguely like an old fashioned portable TV set, with a screen above a speaker grille, and a fairly substantial body behind the two. As I’ve said before, this form factor makes a ton of sense for Amazon for a variety of reasons, but it rather undermines the idea that voice and not touch is the next user interface. There’s also a certain irony in the prospect of Amazon announcing an Echo with a screen while Apple announces an Echo competitor without one in the space of a few weeks, as is presently rumored. The reality is that standalone voice assistants fill a useful role, but most people will want their assistants and devices to span several categories, including those with both voice capability and screens.

    via The Verge and Evan Blass’s tweet

    Ecobee Launches Thermostat with Alexa (May 3, 2017)

    This content requires a subscription to Tech Narratives. Subscribe now by clicking on this link, or read more about subscriptions here.

    ★ Apple Siri Speaker Could Debut at WWDC in June (May 1, 2017)

    KGI, which as I’ve noted before has a decent track record on future Apple products, says there’s a 50/50 chance that Apple’s entry in the connected home speaker market could debut at WWDC next month. There’s scant detail in the report other than that Apple’s speaker will have better audio hardware than the Echo, which has been criticized as being sub-par as a speaker despite its effectiveness as a voice-activated assistant device. I would certainly expect such a device to combine Siri, AirPlay, HomeKit device control, and possibly some kind of WiFi connectivity, but it’s very unlikely Apple could do all that well and still make its usual margin at the $130-180 price point that the full Echo and Home devices sell for. It’s more likely this would be sold in the range of the larger Sonos speakers (which Apple has been selling in its stores for the last little while), which would mean $300-500. That puts it in a different category from what’s out there today, which wouldn’t be unusual for Apple but would put it well out of impulse buy territory for most people and limit sales quite a bit. One big question is whether Siri is yet good enough for such a speaker, and what upgrades Apple might have in store for Siri at WWDC this year to help it get there. As I’ve suggested in the past, Siri’s shortcomings are at least in part hardware-based: more often than not, the problem is wrongly interpreting what’s said because of the tiny mics being used for voice recognition, and a big device should help a great deal with that. But Siri can also be frustrating even when it does understand what you say, and its more conversational elements are still pretty limited, which could be a big shortcoming on a device without an alternative input mechanism. I’m sure Apple will have some other special sauce in mind so this isn’t just another Echo or Home but something a bit different. But there’s a good chance this ends up being yet another new product category for Apple which sells a few million a year and which critics therefore contend is a flop, while it quietly generates a decent amount of revenue and profit for Apple (see also the Apple Watch and AirPods).

    via 9to5Mac

    Alexa Gets Speech Synthesis Tools for Developers to Help it Sound More Human (May 1, 2017)

    Amazon is giving developers of Skills (apps) for Alexa new speech tools which should help them create interactions where the assistant sounds more human through the use of pauses, different intonation, and so forth. Amazon already uses these for Alexa’s first party capabilities, but third party developers haven’t had much control over how Alexa intones the responses in their Skills. This should be a useful additional developer tool for adding a bit more personality and value, but I wonder how many developers will bother – new platform tools like this are always a great test of how engaged developers are and how committed they are to creating the best possible experience rather than just testing something out. I’ve argued from the beginning that the absolute number of Skills available for Alexa (now at 12,000) is far less meaningful than the quality of those apps, and many of them are very basic or sub-par, likely from developers trying something out as a hobby without any meaningful commitment to sustaining or improving their apps. On the other hand, the smaller number of really serious apps for Alexa should benefit from these new tools.

    via TechCrunch

    Google Launches a Closed SDK for Google Assistant on Third Party Hardware (Apr 27, 2017)

    One of Amazon’s big advantages in building scale for its Alexa assistant has been its opening of the underlying platform to third party hardware vendors, and the resulting hardware was arguably the big story of CES this year. Google, by contrast, has only opened up its Assistant very slowly to third parties, instead favoring its own hardware for the first six months or so. That’s now starting to change as it not only makes the Assistant available in Android but now also starts opening up an SDK for third party hardware makers, albeit in a fairly closed fashion for now. One thing it’ll want to make sure of is that the resulting hardware meet some minimum standards, something Amazon has done very little to enforce.

    via Ars Technica

    Amazon Readying Launch of Echo Device with a Screen (Apr 27, 2017)

    This content requires a subscription to Tech Narratives. Subscribe now by clicking on this link, or read more about subscriptions here.

    ★ Amazon Announces Echo Look, Which Adds a Camera and Fashion Advice to Echo for $20 (Apr 26, 2017)

    Amazon has announced a new device in its Echo family called the Echo Look, which assumes a different form factor, adding a still and video camera to features of the standard device for $20 more. For now, the focus is fashion advice: the camera can take full-length photos or videos of the user, acting like a full-length mirror at a basic level but also offering fashion advice through machine learning tools trained by fashion experts. I say for now, because once you have a camera in an Echo device it could be used for many other things too – indeed, when reports and pictures of this device first surfaced people assumed it was a security camera, and there’s really no reason why it couldn’t be. And several of these devices together could be very useful for motion sensing and other tasks as part of a smart home system over time too. But Amazon’s also smart to start specializing the Echo a little, with a particular focus on women, as I would guess a majority of sales of Echo devices to date have gone to men. I’d bet we’ll see other more specialized devices in time, but also other uses for this camera as it gets software updates. And this also starts to get at a real business model for Echo, which so far hasn’t done much to boost e-commerce sales but could now drive clothing revenue through sales of both third party apparel and Amazon’s own growing line. And what Amazon learns from the Look and its associated app can be fed back into the core Amazon.com clothes shopping experience too, improving recommendations in the process. But of course all this comes with downsides: not only do you have a device in your home that’s always listening, but you now have a device with a camera, which could feasibly be hacked remotely to take pictures or video of you. And Amazon will store the images it captures indefinitely, creating a further possible source of problems down the line.

    via The Verge

    Amazon Bans Almost All Ads in Alexa’s Third Party ‘Skills’ Apps (Apr 21, 2017)

    This content requires a subscription to Tech Narratives. Subscribe now by clicking on this link, or read more about subscriptions here.

    ★ Google Home Now Recognizes Multiple Users by Voice (Apr 20, 2017)

    This has been a long time coming – in fact, in just a few weeks it’ll be a year since Google debuted Home at its I/O developer conference and implied that it would have multi-user support, though of course it was missing when the device actually launched in the fall. And that’s been a big limitation of a device that’s supposed to get to know you as an individual. So the fact that Google Home now recognizes distinct users by voice is a big deal, and an important differentiator over Amazon Echo. I’ve just tried it with my unit and although it set up accounts for me and my daughter without problems the app conked out when I tried to add my wife, so the results are mixed (I suspect it may be because my wife’s account is a Google Apps account). It does recognize the two voices we set up and will now serve us up different responses, which is great. One big limitation, though, is that each user has to have a Google account and has to download the Google Home app onto their phone, which means it won’t recognize little kids who don’t have Google accounts. And given that it’s using voice recognition rather than, say, different trigger phrases, I can’t set up separate personal and work accounts. But for those who can use it, the Home will now be a much more useful device, serving up calendar information, music preferences and so on on an individualized basis rather than trying everyone in a home as the same person.

    via Google

    ★ Amazon Scales Alexa Back-End by Opening Lex Voice and Text Service to All Developers (Apr 19, 2017)

    So much of the focus of coverage of voice assistants and interfaces is on the dedicated consumer products which use them, and that’s natural: these are the most visible and measurable signs of a company’s success or failure in this space. And yet the scale of those dedicated voice product is still very small relative to smartphones, which carry their own voice assistants. And scale is vital if these products are to improve, because they require lots and lots of training to get better, and so the more users there are training them, the better they become. As such, I suspect the next phase of competition in this space is going to be about developer voice platforms at least as much as it is about first-party hardware and software, and we’re starting to see signs of this from the big companies in the space, including Google and Amazon. Today, Amazon announced that Lex, which is a back-end service that combines many of the technologies behind Alexa, is opening up to all developers. But critically, this isn’t just a voice platform – it supports text and voice processing, which means that many of the developers might use it in chat bots or other similar environments that have nothing to do with voice but still help train Amazon’s natural language processing tools. Google is doing similar things with its own voice processing technology, but it’s doubtful whether Apple will ever open its voice tools up in the same way. That’s not a huge deal, because it has massive scale in voice on smartphones alone, but it may make a bigger difference over time as these other platforms benefit not only from growing first party scale but increasing third party adoption and use too.

    via Amazon