Bioengineering (Basel)
September 2023
Human skeleton data obtained using a depth camera have been used for pathological gait recognition to support doctor or physician diagnosis decisions. Most studies for skeleton-based pathological gait recognition have used either raw skeleton sequences directly or gait features, such as gait parameters and joint angles, extracted from raw skeleton sequences. We hypothesize that using skeleton, joint angles, and gait parameters together can improve recognition performance.
View Article and Find Full Text PDFWith the development of wearable devices such as smartwatches, several studies have been conducted on the recognition of various human activities. Various types of data are used, e.g.
View Article and Find Full Text PDFAlthough attention deficit hyperactivity disorder (ADHD) in children is rising worldwide, fewer studies have focused on screening than on the treatment of ADHD. Most previous similar ADHD classification studies classified only ADHD and normal classes. However, medical professionals believe that better distinguishing the ADHD-RISK class will assist them socially and medically.
View Article and Find Full Text PDFThe identification of attention deficit hyperactivity disorder (ADHD) in children, which is increasing every year worldwide, is very important for early diagnosis and treatment. However, since ADHD is not a simple disease that can be diagnosed with a simple test, doctors require a large period of time and substantial effort for accurate diagnosis and treatment. Currently, ADHD classification studies using various datasets and machine learning or deep learning algorithms are actively being conducted for the screening diagnosis of ADHD.
View Article and Find Full Text PDFAmong existing wireless and wearable indoor pedestrian tracking solutions, the ultra-wideband (UWB) and inertial measurement unit (IMU) sensors are the popular options due to their accurate and globally referenced positioning, and low-cost and compact size, respectively. However, the UWB position accuracy is compromised by the indoor non-line of sight (NLOS) and the IMU estimation suffers from orientation drift as well as requiring position initialization. To overcome these limitations, this paper proposes a low-cost foot-placed UWB and IMU fusion-based indoor pedestrian tracking system.
View Article and Find Full Text PDFSpeech is a commonly used interaction-recognition technique in edutainment-based systems and is a key technology for smooth educational learning and user-system interaction. However, its application to real environments is limited owing to the various noise disruptions in real environments. In this study, an audio and visual information-based multimode interaction system is proposed that enables virtual aquarium systems that use speech to interact to be robust to ambient noise.
View Article and Find Full Text PDFConcomitant with the recent advances in deep learning, automatic speech recognition and visual speech recognition (VSR) have received considerable attention. However, although VSR systems must identify speech from both frontal and profile faces in real-world scenarios, most VSR studies have focused solely on frontal face pictures. To address this issue, we propose an end-to-end sentence-level multi-view VSR architecture for faces captured from four different perspectives (frontal, 30°, 45°, and 60°).
View Article and Find Full Text PDFSkeleton data, which is often used in the HCI field, is a data structure that can efficiently express human poses and gestures because it consists of 3D positions of joints. The advancement of RGB-D sensors, such as Kinect sensors, enabled the easy capture of skeleton data from depth or RGB images. However, when tracking a target with a single sensor, there is an occlusion problem causing the quality of invisible joints to be randomly degraded.
View Article and Find Full Text PDFDeep learning technology has encouraged research on noise-robust automatic speech recognition (ASR). The combination of cloud computing technologies and artificial intelligence has significantly improved the performance of open cloud-based speech recognition application programming interfaces (OCSR APIs). Noise-robust ASRs for application in different environments are being developed.
View Article and Find Full Text PDFIn visual speech recognition (VSR), speech is transcribed using only visual information to interpret tongue and teeth movements. Recently, deep learning has shown outstanding performance in VSR, with accuracy exceeding that of lipreaders on benchmark datasets. However, several problems still exist when using VSR systems.
View Article and Find Full Text PDFAnnu Int Conf IEEE Eng Med Biol Soc
July 2019
Gait is an important indicator for specific diseases. Abnormal gait patterns are caused by various factors such as physical, neurological, and sensory problems. If it is possible to recognize abnormal gait patterns in the early stage of the related disease, patients can receive proper treatment early and prevent secondary accidents such as falls caused by unbalanced gait.
View Article and Find Full Text PDFUnlabelled: The purpose of this study was to investigate if multi-domain cognitive training, especially robot-assisted training, alters cortical thickness in the brains of elderly participants. A controlled trial was conducted with 85 volunteers without cognitive impairment who were 60 years old or older. Participants were first randomized into two groups.
View Article and Find Full Text PDF