分享好友 最新动态首页 最新动态分类 切换频道
机器人对话:一个WebGL演示,结合了来自网络浏览器的虚拟现实,语音识别和合成语音(仅适用于GoogleChrome)资源
2024-12-26 06:06
# Talk-to-the-Bot

机器人对话:一个WebGL演示,结合了来自网络浏览器的虚拟现实,语音识别和合成语音(仅适用于GoogleChrome)资源

A WebGL demo that combines virtual reality, speech recognition and synthetic voice from the web browser.<br/><br/> It shows a panorama picture that contains an interactive 3D model that allows the user to interact with voice commands and a VR interaction pointer. <strong>The app is configured to work in English and Spanish detecting the browser language automatically</strong>.<br/><br/> <strong>This demo currently works 100% only in Google Chrome and Google Chrome for Android</strong>. You'll find errors and issues with speech recognition in other browsers, and synthetic voice only works in Chrome as an experimental Javascript API.<br/><br/> <a href=https://download.csdn.net/download/weixin_42129412/"https://www.youtube.com/watch?v=tlX-FjS7eyg&feature=youtu.be&list=PLlmvyH9T-yztt_oK6M0UhlWLiiMHpnrO5" target="blank">I showed this demo in one of my three.js workshops in Spain ( IT STARTS ON 31:40 !!!! )</a> # Install and run - Download or clone the source files and run it in a local web server using Google Chrome or Google Chrome for Android.<br/> - <strong>When it runs from Android, the app will setup automatically for VR mode</strong>. Then tap the screen and the app will setup the browser window in fullscreen mode. You can use a Google Cardboard or any other third-party VR glasses to enjoy the experience. The screen will split in stereoscopic mode and you'll notice that the camera moves using the device's accelerometer. - Use the VR pointer ( using the mouse in desktop ) to interact with the small floating 3D robot. It will show you small text boxes at her side and the sythetic voice API will reproduce the text in Google Chrome. The speech sequence is started and stopped by pointing in or out of the robot. - When the robot asks for your name, Chrome will ask you about allow the microphone for the speech recognition. You must allow it in order to talk to the app. - Then... say your name! The speech commands are configured to say it in different ways ( "I am..." or "My name is..." ), and the synthetic voice will salute you using your given name. - Finally, if you say "Thanks" or "Thank you" after the salute, the robot will answer again. # Practical cases This is a very simple implementation of a sort of "VR bot". It can be taken to the next level using conversational platforms such as API.ai or any other information API using AJAX requests in Javascript. Replacing the synthetic voice with recorded audio would increase the quality of the final product. # Enabling Helper Mode Just modify this line in `js/main.js`: <pre>enableHelperMode(false);</pre> If you set it to `true`, it changes the three.js Perspective Camera properties and it shows a wireframed box that represents a "reference area" to place a 3D object in order to make it "credible" combined with the panorama sphere. It also shows a transparent box overlapping with the robot model. That's because the pointer of the Reticulum plugin only works pointing to three.js basic geometries. So any imported 3D object in the scene needs to be "wrapped" within an invisible box geometry that interacts with the pointer. # Implementation A list of the libraries and assets used for this demo: - WebGL implementation : <a href=https://download.csdn.net/download/weixin_42129412/"https://github.com/mrdoob/three.js/" target="blank">three.js r83 by Mr. Doob</a> - Stereoscopic view : <a href=https://download.csdn.net/download/weixin_42129412/"https://github.com/mrdoob/three.js/blob/73edfb8e72070da688517bdc1b34024ddfde8b08/examples/js/effects/StereoEffect.js" target="blank">StereoEffect.js for three.js by Mr. Doob</a> - Device Orientation Controls : <a href=https://download.csdn.net/download/weixin_42129412/"https://github.com/mrdoob/three.js/blob/6c7f000734f8579da37fb39e5c2e9e5e2dfb14f8/examples/js/controls/DeviceOrientationControls.jss" target="blank">DeviceOrientationControls.js for three.js by richt & WestLangley</a> - VR Pointer : <a href=https://download.csdn.net/download/weixin_42129412/"https://github.com/skezo/Reticulum" target="blank">Reticulum 2.0.2 by Skezo</a> - Speech recognition : <a href=https://download.csdn.net/download/weixin_42129412/"https://github.com/TalAter/annyang" target="blank">Annyang 2.4.0 by Tal Ater</a> - <a href=https://download.csdn.net/download/weixin_42129412/"https://developer.mozilla.org/en-US/docs/Web/API/SpeechSynthesis" target="blank">SpeechSynthesis API in MDN</a> - three.js OBJLoader.js - THREEX dynamic texture extension by Jerome Etienne
最新文章
搜索引擎原理精讲
1. 前言 1.1 搜索引擎是什么? 这里有个概念需要提一下。信息检索 (Information Retrieval 简称 IR) 和 搜索 (Search) 是有区别的,信息检索是一门学科,研究信息的获取、表示、存储、组织和访问,而搜索只是信息
电脑上pdf怎么转换成ppt?这里教你这4种方法!
在办公和学习中,我们经常需要将PDF文件转换为PPT格式以便于编辑和展示。虽然PDF文件的兼容性很高,但当我们需要修改或重新组织内容时,PPT格式则更加灵活。那么电脑上pdf怎么转换成ppt呢?以下是四种在电脑上将PDF转换为PPT的方法。“转转
鹏欣漫城都荟值得买吗?区域配套分析/项目最新情况
鹏欣漫城都荟项目情况怎么样? 地址: 韶光街(联大街地铁站D出口出来前行约150米)项目地址:昆明市呈贡新区雨花片区彩云南路与聚贤街路交叉口占地面积:283亩,用地124亩(A1地块占地384户,A2地块21亩376户,A3地块25亩336户) 建筑面积
人勤春早地生金,家国富强美万民。猴子吃桃指代表什么生肖,前沿解答解释落实_la59.71.30
“人勤春早地生金,家国富强美万民”这一句谚语,蕴含着中华民族深厚的文化底蕴和积极向上的民族精神,它既是对勤劳致富美德的颂扬,也是对国家繁荣昌盛、人民幸福生活的美好祝愿,而“猴子吃桃”这一形象,则在中国传统文化中具有丰富的象
某美漫的传奇人生米一克著 更新时间 2024-06-04 10:37:41
当主角有了能够直接干涉到普朗克尺寸精度的高维度干涉力,将会发生什么,本书主要以无限流的方式记录本人不时的脑洞主角开局第一天就地球无敌,金手指很强劲,金手指没有恶心人的系统小精灵,作者不会闲的蛋疼死压主角战力,大多数副本里主
驰能洗碗机维修24小时人工电话-2025讲信誉+排名一览 - 热点 - 百科知识-蓝心网
驰能洗碗机维修24小时人工电话-2025讲信誉+排名一览驰能洗碗机售后服务维修电话:400-889-3660驰能洗碗机24小时维修客服热线:400-8928-566驰能洗碗机24小时服务电话《今日发布》驰能洗碗机附近上门服务电话7天24小时人工电话客服为您服务
【魅蓝(移动4G)宜搜小说下载】魅族魅蓝 移动4G宜搜小说5.8.9免费下载
专注品质阅读16年,亿万读者的共同选择!新用户8天有惊喜,小说随心看,享多重特权大礼!热爱看书追更,认准宜搜小说!海量热门正版书籍尽在掌中!玄幻、仙侠、穿越、言情、修真、悬疑、异能、系统文、同人文等等等等齐全挚爱分类!书友的
精选7个不错的WordPress问答/论坛插件
问答和论坛可以帮助您改善站点中的用户参与度,从而使您更轻松地建立在线社区。无论您是在寻找功能强大的论坛插件来处理庞大的社区,还是在寻找简单的插件来处理小型讨论,这篇文章都将使您深入了解最佳的WordPress问答/论坛插件。在您的站
让AI智造赋能高质量发展
转自:团结报  □ 本报见习记者 闻 超 记者 王恺强  日前,2024实体经济发展大会在江苏省苏州市昆山成功举办,来自政商学界的领导和企业经营者以及专家学者齐聚一堂,聚焦“AI智造 创新未来”主题,共同探讨人工智能与实体经济的
相关文章
推荐文章
发表评论
0评