[18:35:36][D][voice_assistant:422]: Desired state set to STREAMING_MICROPHONE
[18:35:36][D][voice_assistant:523]: Event Type: 1
[18:35:36][D][voice_assistant:526]: Assist Pipeline running
[18:35:36][D][voice_assistant:523]: Event Type: 9
[18:35:39][D][voice_assistant:523]: Event Type: 10
[18:35:39][D][voice_assistant:532]: Wake word detected
[18:35:39][D][switch:012]: 'LedIO13' Turning ON.
[18:35:39][D][switch:055]: 'LedIO13': Sending state ON
[18:35:39][D][voice_assistant:523]: Event Type: 3
[18:35:39][D][voice_assistant:537]: STT started
[18:35:40][D][voice_assistant:523]: Event Type: 11
[18:35:40][D][voice_assistant:677]: Starting STT by VAD
[18:35:42][D][voice_assistant:523]: Event Type: 12
[18:35:42][D][voice_assistant:681]: STT by VAD end
[18:35:42][D][voice_assistant:416]: State changed from STREAMING_MICROPHONE to STOP_MICROPHONE
[18:35:42][D][voice_assistant:422]: Desired state set to AWAITING_RESPONSE
[18:35:42][D][voice_assistant:416]: State changed from STOP_MICROPHONE to STOPPING_MICROPHONE
[18:35:42][D][voice_assistant:416]: State changed from STOPPING_MICROPHONE to AWAITING_RESPONSE
[18:35:43][D][voice_assistant:523]: Event Type: 4
[18:35:43][D][voice_assistant:551]: Speech recognised as: "allumer la lumière marine"
[18:35:43][D][voice_assistant:523]: Event Type: 5
[18:35:43][D][voice_assistant:556]: Intent started
[18:35:43][D][voice_assistant:523]: Event Type: 6
[18:35:43][D][voice_assistant:523]: Event Type: 7
[18:35:43][D][voice_assistant:579]: Response: "Allumé"
[18:35:43][D][voice_assistant:523]: Event Type: 8
[18:35:43][D][voice_assistant:599]: Response URL: "http://192.168.1.32:8123/api/tts_proxy/a54a861193bf0fc5d5dc1b9f543d744a47d3ba31_fr-fr_6c2e43c6c1_google_translate.mp3"
[18:35:43][D][voice_assistant:416]: State changed from AWAITING_RESPONSE to IDLE
[18:35:43][D][voice_assistant:422]: Desired state set to IDLE
[18:35:43][D][voice_assistant:416]: State changed from IDLE to START_PIPELINE
[18:35:43][D][voice_assistant:422]: Desired state set to START_MICROPHONE
[18:35:43][D][voice_assistant:523]: Event Type: 2
[18:35:43][D][voice_assistant:613]: Assist Pipeline ended
[18:35:43][D][voice_assistant:118]: microphone not running
[18:35:43][D][voice_assistant:202]: Requesting start...
[18:35:43][D][voice_assistant:416]: State changed from START_PIPELINE to STARTING_PIPELINE
[18:35:43][D][switch:016]: 'LedIO13' Turning OFF.
[18:35:43][D][switch:055]: 'LedIO13': Sending state OFF
[18:35:43][D][media_player:059]: 'media_sat3' - Setting
[18:35:43][D][media_player:066]: Media URL: http://192.168.1.32:8123/api/tts_proxy/a54a861193bf0fc5d5dc1b9f543d744a47d3ba31_fr-fr_6c2e43c6c1_google_translate.mp3
[18:35:43][D][voice_assistant:118]: microphone not running
[18:35:44][W][component:214]: Component i2s_audio.media_player took a long time for an operation (0.52 s).
[18:35:44][W][component:215]: Components should block for at most 20-30ms.
[18:35:44][D][voice_assistant:437]: Client started, streaming microphone
J’ai testé sans succès avec le Max98357A qui fonctionne sur mon autre montage.
Il y a eu pas mal de changement .
On peut maintenant indiquer un media player comme speaker directement dans la partie voice_assistant du yaml esp. Plus besoin de la « bricole » d’avant.
En exemple je t’invite à regarder ce code :
[13:39:30][D][voice_assistant:551]: Speech recognised as: "allumer la lumière marine"
[13:39:30][D][voice_assistant:523]: Event Type: 5
[13:39:30][D][voice_assistant:556]: Intent started
[13:39:30][D][voice_assistant:523]: Event Type: 6
[13:39:30][D][voice_assistant:523]: Event Type: 7
[13:39:30][D][voice_assistant:579]: Response: "Allumé"
[13:39:30][D][voice_assistant:523]: Event Type: 8
[13:39:30][D][voice_assistant:599]: Response URL: "http://192.168.1.32:8123/api/tts_proxy/a54a861193bf0fc5d5dc1b9f543d744a47d3ba31_fr-fr_6c2e43c6c1_google_translate.mp3"
[13:39:30][D][voice_assistant:416]: State changed from AWAITING_RESPONSE to STREAMING_RESPONSE
[13:39:30][D][voice_assistant:422]: Desired state set to STREAMING_RESPONSE
[13:39:30][D][media_player:059]: 'Media sat3' - Setting
[13:39:30][D][media_player:066]: Media URL: http://192.168.1.32:8123/api/tts_proxy/a54a861193bf0fc5d5dc1b9f543d744a47d3ba31_fr-fr_6c2e43c6c1_google_translate.mp3
[13:39:30][D][media_player:059]: 'Media sat3' - Setting
[13:39:30][D][media_player:066]: Media URL: http://192.168.1.32:8123/api/tts_proxy/a54a861193bf0fc5d5dc1b9f543d744a47d3ba31_fr-fr_6c2e43c6c1_google_translate.mp3
[13:39:30][D][voice_assistant:523]: Event Type: 2
[13:39:30][D][voice_assistant:613]: Assist Pipeline ended
[13:39:31][W][component:214]: Component i2s_audio.media_player took a long time for an operation (0.54 s).
[13:39:31][W][component:215]: Components should block for at most 20-30ms.
[13:39:31][D][switch:016]: 'LedIO13' Turning OFF.
[13:39:31][D][switch:055]: 'LedIO13': Sending state OFF
[13:39:31][D][light:036]: 'My Light' Setting:
[13:39:31][D][light:047]: State: OFF
Pas eu le temps @Krull56 , retour au boulot…
Je testerai ce soir mais c’est surprenant quand même que sur l’autre ESP ça fonctionne et pas sur celui là
Je vois bien le media_player dans les appareils mais impossible par là aussi de le faire lire une phrase.
Bob
Je n’ai pas ce modèle sous la main, mais dans le cadre du contest j’ai vu passer des infos sur le forum anglophone .
pas de media_player mais le speaker « classique » dans le yaml qui à l’air de fonctionner.
Les logs (le fichier généré est un .wav maintenant.
[13:20:42][D][voice_assistant:532]: Wake word detected
[13:20:42][D][switch:012]: 'LedIO13' Turning ON.
[13:20:42][D][switch:055]: 'LedIO13': Sending state ON
[13:20:42][D][light:036]: 'My Light' Setting:
[13:20:42][D][light:047]: State: ON
[13:20:42][D][light:051]: Brightness: 60%
[13:20:42][D][light:059]: Red: 0%, Green: 100%, Blue: 0%
[13:20:42][D][light:109]: Effect: 'Pulse'
[13:20:42][D][voice_assistant:523]: Event Type: 3
[13:20:42][D][voice_assistant:537]: STT started
[13:20:43][D][voice_assistant:523]: Event Type: 11
[13:20:43][D][voice_assistant:677]: Starting STT by VAD
[13:20:44][D][voice_assistant:523]: Event Type: 12
[13:20:44][D][voice_assistant:681]: STT by VAD end
[13:20:44][D][voice_assistant:416]: State changed from STREAMING_MICROPHONE to STOP_MICROPHONE
[13:20:44][D][voice_assistant:422]: Desired state set to AWAITING_RESPONSE
[13:20:44][D][voice_assistant:416]: State changed from STOP_MICROPHONE to STOPPING_MICROPHONE
[13:20:44][D][voice_assistant:416]: State changed from STOPPING_MICROPHONE to AWAITING_RESPONSE
[13:20:46][D][voice_assistant:523]: Event Type: 4
[13:20:46][D][voice_assistant:551]: Speech recognised as: "éteindre la lumière marine"
[13:20:46][D][voice_assistant:523]: Event Type: 5
[13:20:46][D][voice_assistant:556]: Intent started
[13:20:46][D][voice_assistant:523]: Event Type: 6
[13:20:46][D][voice_assistant:523]: Event Type: 7
[13:20:46][D][voice_assistant:579]: Response: "Éteint"
[13:20:46][D][voice_assistant:523]: Event Type: 8
[13:20:46][D][voice_assistant:599]: Response URL: "http://192.168.1.32:8123/api/tts_proxy/bd80c857a9f4a7d75384c8df252b26f9f7fde3a3_fr-fr_2c82848529_google_translate.wav"
[13:20:46][D][voice_assistant:416]: State changed from AWAITING_RESPONSE to STREAMING_RESPONSE
[13:20:46][D][voice_assistant:422]: Desired state set to STREAMING_RESPONSE
[13:20:46][D][voice_assistant:523]: Event Type: 2
[13:20:46][D][voice_assistant:613]: Assist Pipeline ended
[13:20:46][D][i2s_audio.speaker:161]: Starting I2S Audio Speaker
[13:20:46][D][switch:016]: 'LedIO13' Turning OFF.
[13:20:46][D][switch:055]: 'LedIO13': Sending state OFF
[13:20:46][D][light:036]: 'My Light' Setting:
[13:20:46][D][light:047]: State: OFF
[13:20:46][D][light:085]: Transition length: 1.0s
[13:20:46][D][light:091]: Effect: 'None'
[13:20:46][D][voice_assistant:523]: Event Type: 98
[13:20:46][D][voice_assistant:664]: TTS stream start
[13:20:46][D][i2s_audio.speaker:164]: Started I2S Audio Speaker
[13:20:47][D][voice_assistant:351]: Speaker buffer full, trying again next loop
[13:20:47][D][voice_assistant:351]: Speaker buffer full, trying again next loop
[13:20:47][D][voice_assistant:351]: Speaker buffer full, trying again next loop
[13:20:47][D][voice_assistant:351]: Speaker buffer full, trying again next loop
[13:20:47][D][voice_assistant:351]: Speaker buffer full, trying again next loop
[13:20:47][D][voice_assistant:351]: Speaker buffer full, trying again next loop
Dans le cas de l’utilisation de speaker, la lecture du fichier est automatique ou il y a une commande à mettre comme pour le media_player ?
Bonsoir,
Bon je me dit que sur le N16r8 pas moyen (pour moi) d’envoyer du son sur le HP, avec media_player ou speaker donc j’essaie un truc, envoyer sur le media_player d’un autre ESP qui fonctionne:
Pas mieux même si pas d’erreur, dommage, il répond très vite aux demandes, mais pas de sortie audio !
Je vais je crois avec regrets arrêter là avec cet ESP, je passe à mes LED infra rouge pour mes cameras montées sur ESP pour la vision nocturne!
Salut @Bob,
j’ai essayer le media player sur un esp32. J’ai pu lire un mp3 en local et envoyer du TTS.
Par contre j’ai essayer avec youtube et ca fonctionne pas , as tu essayer ?
ca me dit que ca peut pas decoder, peut être le mode: mono.
Bonjour @WarC0zes,
Pour l’instant je n’ai gardé qu’un assistant vocal sur ESP32 T8 V1.7.1.
Sur un ESP32 Wroom 32 classique je n’ai laissé qu’un média player qui fonctionne bien, MP3 et TTS, pour youtube je n’ai jamais essayé, je passe par ma mibox et mon ampli. @Krull56, je viens d’installer Music Assistant, je vais rebrancher mon ESP32-S3 N16R8 pour voir si il veut enfin parler
Merci pour ces nouvelles
Je dois bosser un peu aussi
Bob
De mon côté, j’avance « lentement » sur ma Tablette-Assist avec l’objectif qu’elle remplace mon radio-réveil actuel (mais avec beaucoup plus de fonctionnalités bien sûr)