Advanced Robotics in Healthcare: Google Home Integration

26 de julio de 2022

Advanced Robotics: Conversational Agents in Healthcare

In recent years, robotics has advanced in leaps and bounds. The term advanced robotics now refers to technologies that interact with the real-world environment to solve real-world problems. Applications of advanced robots, particularly in healthcare, are transforming patient care and operational efficiency.

The Role of Virtual Assistants

Many robotic applications require a virtual assistant to interact with the robot. A virtual assistant is a software agent that performs tasks or services for an individual based on commands or questions. It simulates a conversation, delivering voice or text-based information to a user. This enables interaction with artificial agents through natural speech.

A Conversational Agent for Hospital Use

This project involves creating a customized conversational agent that operates within a hospital, assisting patients and enhancing their experience. The system uses a Google Home Mini smart speaker, chosen for its powerful conversational engine. Although the system is compatible with Alexa, this article focuses on the Google Home Mini implementation.

Project Capabilities

With this system, users can wake up the assistant by saying a customized keyword, initiating a conversation. For example:

«Hello Victory, where is patient Juan?»
Response: «Patient Juan is in room 25.»

The robot can also initiate conversations independently, for instance, greeting users or responding to specific events.

What is ROS?

The Robot Operating System (ROS) is an open-source framework that allows developers to build and reuse code across robotic applications. ROS operates via nodes that communicate by publishing and subscribing to topics or using services for requests and responses.

Communication in ROS

Messages: Publishers send messages that subscribers receive.
Services: Nodes send a request and receive a response.

Hardware Components

Raspberry Pi 4

The Raspberry Pi 4, running the Debian Buster 10 OS, provides the computational power and GPIO pins for external interactions. Its low cost and flexibility make it ideal for this project.

Matrix Voice

The Matrix Voice development board includes an 8-microphone array, 18 RGB LEDs, and an FPGA. It connects directly to the Raspberry Pi, enabling voice-driven behavior and IoT integrations.

Software Capabilities

Custom Wake Word Detection

Using Rhasspy, an open-source voice assistant service, the system detects custom wake words via Pocketsphinx. This lightweight tool enables offline configuration and testing through a web interface.

System Features

Play Noise: Prevents accidental wake-ups by Google Home Mini.
Wake Up Google Home Mini: Uses speakers to pronounce the keyword.
Start Conversations: Initiates dialogue autonomously.

State Machine Implementation

The state machine, developed in C++ using Qt, transitions through states like emitting noise, waiting for wake words, and initiating conversations. Signals trigger state changes based on specific events.

Node-RED

Node-RED facilitates graphical programming of hardware and online services by connecting predefined nodes. Below is a part of the flowchart used in this project:

Dialogflow Integration

Dialogflow simplifies the creation of conversational interfaces. Intents like «which patient is in room X?» are mapped to responses processed in Node-RED.

Current Project Status

The technical components are complete, and the system’s hardware assembly is ready:

The next step is to design a 3D part to integrate the system seamlessly into the robot.

A Collaborative Initiative

This project is a collaboration between Scalian Spain and the University of Málaga, led by Juan Antonio Ramírez and supervised by Francisco Javier Camacho Bermúdez and Alejandro Hidalgo Paniagua.

CONVERSATIONAL AGENT FOR SOCIAL ROBOTICS USING THE ROS FRAMEWORK