PDA

View Full Version : Voice command



philippe_44
2015-08-10, 07:44
Does somebody have plans to integrate things like Echo or Cortana into the LMS system ? I have an Echo and I see easily how I could use it for some controls :-)

EricBergan
2015-08-10, 11:53
Personally, I'm hoping Amazon (or Logitech, or some 3rd party) does an integration between Echo and Harmony remotes. Then you could use voice controls for the managing the whole AV system...

Julf
2015-08-10, 12:30
Personally, I'm hoping Amazon (or Logitech, or some 3rd party) does an integration between Echo and Harmony remotes. Then you could use voice controls for the managing the whole AV system...

"Turn down the volume. I said TURN DOWN THE VOLUME!!"

epoch1970
2015-08-10, 14:16
Does somebody have plans to integrate things like Echo or Cortana into the LMS system ? I have an Echo and I see easily how I could use it for some controls :-)

I praise your creativity.
Your discussion on a 3D printed case for picore (other thread, earlier) made me think. I thought of a player/server that would include an iThing dock. Seemed too ambitious.
However I went on fantasizing about a player/server that would include Bluetooth "beacon" tech (to get IPeng and others to automatically select the closest player). After a brief search I found this project (https://github.com/etwmc/PersonalHomeKit).

It appears Siri is in the realm of Homekit, as is iBeacon... Please don't take my word for it and check by yourself if you care. All of this is way over my head.

On a personal note, I don't believe at all in voice command, but a lot in presence detection.

Anyways. Always great to see new ideas and initiatives.

philippe_44
2015-08-10, 14:45
"Turn down the volume. I said TURN DOWN THE VOLUME!!"

Yep ... I've been working with VR for the last 25 years and it drove me nuts, especially because every time, there was supposed to be *the* algorythm that will make it fully usable. But I have to admit that after using Echo for a while, it is surprisingly good even with a lot of background noise. I've not paid a lot of attention, but it seems that neural networks are now working thanks to the insane increase of CPU power. I used to work on that topic in mid 80's and although it was fun, computers at that time where clearly lacking performances

pippin
2015-08-10, 15:19
Oh, voice recognition actually works and it has indeed already worked pretty well since the late 90s.
The biggest issue is that you need a huge semantic database to compare with if you want to analyze more complex phrases which used to be a problem for mobile solutions but now that they all just send some meta-analyzed voice (or in the case of echo even the raw recordings) back to some server farm and do all the work there it's obviously more feasible, especially since you no longer have to replicate that database to all devices.

The problem with VR is that it annoys everyone around you if you use it and speech is also pretty slow for a lot of tasks, I think that's more of a limitation than technology.

And then... call me old fashioned, but I don't want to have the NSA run their bots over every piece of sound generated in my home, recorded by an array mike that even detects the faintest whisper in the most remote corner...

philippe_44
2015-08-10, 18:31
Oh, voice recognition actually works and it has indeed already worked pretty well since the late 90s.
Multi-speakers, dealing with accent, noisy environment, text-to-recognition was still a big challenge till very recently (and still is a bit) and there is nothing that irriates people more than having to repeat the same thing a few times to then realize that something else was recognized.


The biggest issue is that you need a huge semantic database to compare with if you want to analyze more complex phrases which used to be a problem for mobile solutions but now that they all just send some meta-analyzed voice (or in the case of echo even the raw recordings) back to some server farm and do all the work there it's obviously more feasible, especially since you no longer have to replicate that database to all devices.

that's right ...


The problem with VR is that it annoys everyone around you if you use it and speech is also pretty slow for a lot of tasks, I think that's more of a limitation than technology.

A bit, but it is also very true if you have to speak loud & slow - if you can use normal tune, then the annoying factor goes away a lot. Using voice as a UI can be slow in some cases, but there are many many simple tasks where it is very convenient and where using any other sort of input system is slow & painful - a sort of sweet spot for VR


And then... call me old fashioned, but I don't want to have the NSA run their bots over every piece of sound generated in my home, recorded by an array mike that even detects the faintest whisper in the most remote corner...
I understand that although I'm afraid anything we do is already scanned, unfortunately. I've given up hope on that

philippe_44
2015-08-10, 18:36
I praise your creativity.
Your discussion on a 3D printed case for picore (other thread, earlier) made me think. I thought of a player/server that would include an iThing dock. Seemed too ambitious.
However I went on fantasizing about a player/server that would include Bluetooth "beacon" tech (to get IPeng and others to automatically select the closest player). After a brief search I found this project (https://github.com/etwmc/PersonalHomeKit).

It appears Siri is in the realm of Homekit, as is iBeacon... Please don't take my word for it and check by yourself if you care. All of this is way over my head.

On a personal note, I don't believe at all in voice command, but a lot in presence detection.

Anyways. Always great to see new ideas and initiatives.

I agree that presence detection is a really nice thing and I would also gladly welcome that on my LMS system. I'm not supporting the idea of full voice command, as pointed by pippin, it is be much less efficient than haptic technology in many cases, but there are simple commands for which it is much faster. I also think that a multi-input UI is the best way to go, taking advantage of the most efficient mode for each situation

pippin
2015-08-10, 18:55
if you can use normal tune, then the annoying factor goes away a lot.

I still feel it's awkward to start talking to a machine to control it when you are not alone. It can disturb others or raise "did you talk to me?" kind of confusion etc. I feel it's primarily for situations where you are alone.
Here in Berlin's tech community I'm recently seeing quite a number of people using Siri on their Apple Watch in public and I can't help it, I'm annoyed by someone adding a reminder next to me. It's a bit like talking on the phone in public, a pest that fortunately got quite significantly reduced in recent years due to the fact that people now tend to use messaging instead...



there are many many simple tasks where it is very convenient and where using any other sort of input system is slow & painful - a sort of sweet spot for VR

Actually I believe it's primarily _complex_ tasks where it shines. Simple tasks like changing volume are much easier done by a physical controller, especially given latency and need to repeat the command.
But something more complex like a search which requires several steps when done through a haptic UI can get much faster with voice because you can phrase more complex commands in a sentence than a simplified UX will usually directly allow.



I understand that although I'm afraid anything we do is already scanned, unfortunately. I've given up hope on that
Oh, I'm pretty sure there are no microphones for mass surveillance in my home right now (never can be sure about _individual_ surveillance but that's a whole different topic).
And saying "I don't care" just makes it too easy for them. Companies like Apple and recently to a certain degree also Google are trying hard to fight back on the privacy front, let's not spoil all that by not caring about even more severe intrusions.
Phones and Watches at least only have near-field microphones, Echo OTOH...

philippe_44
2015-08-10, 19:07
Actually I believe it's primarily _complex_ tasks where it shines. Simple tasks like changing volume are much easier done by a physical controller, especially given latency and need to repeat the command.
But something more complex like a search which requires several steps when done through a haptic UI can get much faster with voice because you can phrase more complex commands in a sentence than a simplified UX will usually directly allow.


Fair point, but don't you think that it can be very efficient for simple tasks because these require you to find your input device and then use your hands for a simple tap where voice would afford the same result while mobilizing much less "resources" ? I feel the efficency of voice as a UI does not vary linearly against complexity of the tasks. It is very situational, though. If I know what I want to listen to, VR is great. If I don't know exactly, vision + touch helps to embrace a larger amount of information / make choice much for efficiently, no ?

pippin
2015-08-10, 19:14
Yea, sure, if I accept to not have an input device for voice but to allow it to listen to my whole home, that's a case, too.
I know what you mean because it's actually the only reason I sometimes wear my Aplle Watch at home: to be able to quickly access basic player controls without having to grab for a device.

Still don't use Siri on it, for some reason it's always set to the wrong language. Actually that's indeed the biggest unsolved issue for me: detecting languages. I switch between English and German all the time when writing and my devices are usually all set to English but then if you e.g. Want to search for a navigation destination in Berlin it simply doesn't recognize them because they have German names.... Same the other way around for music: if I set the language to German but then want to search for an Album with an English name you first have to go figure how Siri feels that name might be pronounced in German....

philippe_44
2015-08-10, 20:31
Yea, sure, if I accept to not have an input device for voice but to allow it to listen to my whole home, that's a case, too.
I know what you mean because it's actually the only reason I sometimes wear my Aplle Watch at home: to be able to quickly access basic player controls without having to grab for a device.

Well, I've not bought yet an apple watch, but the main reason I might one day is to be able to use iPeng :)


Still don't use Siri on it, for some reason it's always set to the wrong language. Actually that's indeed the biggest unsolved issue for me: detecting languages. I switch between English and German all the time when writing and my devices are usually all set to English but then if you e.g. Want to search for a navigation destination in Berlin it simply doesn't recognize them because they have German names.... Same the other way around for music: if I set the language to German but then want to search for an Album with an English name you first have to go figure how Siri feels that name might be pronounced in German....
I know exactly what you mean, I have the same problem, with French obviously, and this is really irritating

spedinfargo
2015-08-12, 07:06
I started a thread a couple years ago and got yelled at by naysayers ;-)

http://forums.slimdevices.com/showthread.php?79734-Voice-Command-for-Squeezebox-Still-Not-Much-Interest&highlight=voice

The key that I think is important is to make it non-comprehensive. Make it SIMPLE:

It doesn't even have to be too complicated - I'm looking for commands
like: "Playlist Shane Favorites Random", "Skip Track", "Mute All",
"Sync All Players", "Play Artist Pink Floyd", "Play Album Dark Side of
the Moon", etc.

Sure there will be lots of easy ways to "trip it up", but being able to say something like "play favorite spa music" would even be helpful...

pippin
2015-08-12, 08:02
"Sync all players" is a wonderful example for something that doesn't work. Plenty of unanswered questions in that command....

philippe_44
2015-08-17, 21:21
So I started to look a tiny bit at that. Seems that you need to add an Alexa Skill https://developer.amazon.com/public/solutions/alexa/alexa-skills-kit/getting-started-guide. Some have done that already for Sonos https://www.youtube.com/watch?v=AlOBJJyulI4. Interesting, I might try my luck later with this :) although it seems like a lot where I would have preferred more local processing of the request. I'll look at how hue/wemo are integrated, it seems that it is a simple UPnP discovery request that then, when answered properly, adds some basic functions where Echo simply invoke the returned URL's when preset actions are requested later

netchord
2015-08-18, 13:30
i wonder if this might be done using IFTTT? i have an Echo as well, and am really intrigued by all the things i might do with it. it's a completely frictionless interface, even more so than Siri, which requires a button push- and the possibilities are fascinating.

Alex, tell Logitech to revive the Squeeze Platform!

philippe_44
2015-08-18, 23:31
i wonder if this might be done using IFTTT? i have an Echo as well, and am really intrigued by all the things i might do with it. it's a completely frictionless interface, even more so than Siri, which requires a button push- and the possibilities are fascinating.

Alex, tell Logitech to revive the Squeeze Platform!

IFTTT would probably work as well, but I think the Amazon 'native' solution might be more simple. I will definitively give it a try at least faking a hue system like some other did, but not in the coming weeks

spedinfargo
2015-09-09, 14:29
Would like to see it as a native Alexa "skill" as well. Definitely could do some cool things with IFTTT, probably similar to what this guy did by hooking up Squeezebox as a "device" in his Vera: http://www.myzwave.net/index.php/adding-voice-control-to-vera-z-wave-systems-using-amazon-echo/

Very basic functionality that way... would be nice if you could pull in some information from your Squeezebox metadata into the Alexa grammar file. I.e., she would know the names of your players, maybe all of the artists and album titles (songs might be too much), favorites, playlist names, etc.

Even better would be able to do some free search on Spotify or Rhapsody and have her pull in results and send them to the Squeezebox. Or maybe seed a Pandora artist station... possibilities are endless really.

hawsey
2016-01-12, 15:20
Hi , i am new on here ,:-) Has anyone tried using an automation program called Vox Commando with a Squeeze device ?
There are some good videos showing it working on You Tube

I know it works on Kodi and was wondering if Kodi plus XSqueeze would work ,
Vox Commando can also do full home Automation , I believe , it can use multiple devices as microphones , Kinnect , Android app on mobile , Remote control with mic , etc
I would be really interested to know if anyone had tried this sort of thing .
Thanks

Sent by SlimKat FirePhone

FredFredrickson
2016-01-12, 20:26
Don't know if you saw it, but I integrated mine with Amazon Echo:

http://forums.slimdevices.com/showthread.php?104890-Amazon-Echo-Alexa-integration-how-to-with-IFTTT

DJanGo
2016-01-13, 00:53
"Turn down the volume. I said TURN DOWN THE VOLUME!!"

Julf:
Already done (https://jasperproject.github.io/)

Julf
2016-01-13, 05:16
Julf:Already done (https://jasperproject.github.io/)

I guess the development was funded by NSA. :)

"Jasper is always on, always listening for commands, and you can speak from meters away."

hawsey
2016-01-13, 07:26
That Jasper thingy looks cool too and I like the way it uses a Pi , I wasn't aware of that :-)

The Echo I love the idea of that as well , I'm in the UK so would have to ship one over .

The Vox Commando software has been on the go for quite a few years now , I tried it maybe 4 years ago and it worked great with XBMC back then .
I might give this a go myself at some point , I have an Acer Revo 3610 in the Kitchen on the back of a monitor that I could load the LMS on and the Vox Commando , then I have a O2 Joggler
https://en.wikipedia.org/wiki/O2_Joggler
( on top of a mini Sony hi fi in the Kitchen , then one on top of a technics in the bedroom and one on top of a Teac Amp in the Summer House ;
I would love to get them all synced up playing at the same time with voice control too , just need the time to set it all up :-)




Sent by SlimKat FirePhone