Annu Int Conf IEEE Eng Med Biol Soc
July 2020
Many prior studies on EEG-based emotion recognition did not consider the spatial-temporal relationships among brain regions and across time. In this paper, we propose a Regionally-Operated Domain Adversarial Network (RODAN), to learn spatial-temporal relationships that correlate between brain regions and time. Moreover, we incorporate the attention mechanism to enable cross-domain learning to capture both spatial-temporal relationships among the EEG electrodes and an adversarial mechanism to reduce the domain shift in EEG signals.
View Article and Find Full Text PDFLifelog photo review is considered to enhance the recall of personal events. While a sizable body of research has explored the neural basis of autobiographical memory (AM), there is limited neural evidence on the retrieval-based enhancement effect on event memory among older adults in the real-world environment. This study examined the neural processes of AM as was modulated by retrieval practice through lifelog photo review in older adults.
View Article and Find Full Text PDFSingle-image dehazing has been an important topic given the commonly occurred image degradation caused by adverse atmosphere aerosols. The key to haze removal relies on an accurate estimation of global air-light and the transmission map. Most existing methods estimate these two parameters using separate pipelines which reduces the efficiency and accumulates errors, thus leading to a suboptimal approximation, hurting the model interpretability, and degrading the performance.
View Article and Find Full Text PDFIEEE Trans Image Process
January 2020
While analyzing the performance of state-of-the-art R-CNN based generic object detectors, we find that the detection performance for objects with low object-region-percentages (ORPs) of the bounding boxes are much lower than the overall average. Elongated objects are examples. To address the problem of low ORPs for elongated object detection, we propose a hybrid approach which employs a Faster R-CNN to achieve robust detections of object parts, and a novel model-driven clustering algorithm to group the related partial detections and suppress false detections.
View Article and Find Full Text PDFIEEE Trans Pattern Anal Mach Intell
August 2019
We introduce a new problem of gaze anticipation on future frames which extends the conventional gaze prediction problem to go beyond current frames. To solve this problem, we propose a new generative adversarial network based model, Deep Future Gaze (DFG), encompassing two pathways: DFG-P is to anticipate gaze prior maps conditioned on the input frame which provides task influences; DFG-G is to learn to model both semantic and motion information in future frame generation. DFG-P and DFG-G are then fused to anticipate future gazes.
View Article and Find Full Text PDFSearching for a target object in a cluttered scene constitutes a fundamental challenge in daily vision. Visual search must be selective enough to discriminate the target from distractors, invariant to changes in the appearance of the target, efficient to avoid exhaustive exploration of the image, and must generalize to locate novel target objects with zero-shot training. Previous work on visual search has focused on searching for perfect matches of a target after extensive category-specific training.
View Article and Find Full Text PDFSocial working memory (SWM) plays an important role in navigating social interactions. Inspired by studies in psychology, neuroscience, cognitive science, and machine learning, we propose a probabilistic model of SWM to mimic human social intelligence for personal information retrieval (IR) in social interactions. First, we establish a semantic hierarchy as social long-term memory to encode personal information.
View Article and Find Full Text PDFInspired by progresses in cognitive science, artificial intelligence, computer vision, and mobile computing technologies, we propose and implement a wearable virtual usher for cognitive indoor navigation based on egocentric visual perception. A novel computational framework of cognitive wayfinding in an indoor environment is proposed, which contains a context model, a route model, and a process model. A hierarchical structure is proposed to represent the cognitive context knowledge of indoor scenes.
View Article and Find Full Text PDFDuring wayfinding in a novel environment, we encounter many new places. Some of those places are encoded by our spatial memory. But how does the human brain "decides" which locations are more important than others, and how do backtracking and repetition priming enhances memorization of these scenes? In this work, we explore how backtracking improves encoding of encountered locations.
View Article and Find Full Text PDFIn this paper, we propose a hybrid architecture that combines the image modeling strengths of the bag of words framework with the representational power and adaptability of learning deep architectures. Local gradient-based descriptors, such as SIFT, are encoded via a hierarchical coding scheme composed of spatial aggregating restricted Boltzmann machines (RBM). For each coding layer, we regularize the RBM by encouraging representations to fit both sparse and selective distributions.
View Article and Find Full Text PDFIEEE Trans Pattern Anal Mach Intell
January 2014
This paper presents a visual saliency modeling technique that is efficient and tolerant to the image scale variation. Different from existing approaches that rely on a large number of filters or complicated learning processes, the proposed technique computes saliency from image histograms. Several two-dimensional image co-occurrence histograms are used, which encode not only "how many" (occurrence) but also "where and how" (co-occurrence) image pixels are composed into a visual image, hence capturing the "unusualness" of an object or image region that is often perceived by either global "uncommonness" (i.
View Article and Find Full Text PDFAnnu Int Conf IEEE Eng Med Biol Soc
July 2015
The study of stem cells is one of the most important biomedical research. Understanding their development could allow multiple applications in regenerative medicine. For this purpose, automated solutions for the observation of stem cell development process are needed.
View Article and Find Full Text PDFContext: According to Nottingham grading system, mitosis count in breast cancer histopathology is one of three components required for cancer grading and prognosis. Manual counting of mitosis is tedious and subject to considerable inter- and intra-reader variations.
Aims: The aim is to investigate the various texture features and Hierarchical Model and X (HMAX) biologically inspired approach for mitosis detection using machine-learning techniques.
Annu Int Conf IEEE Eng Med Biol Soc
August 2012
Live imaging of neural stem cells and progenitors is important to follow the biology of these cells. Non-invasive imaging techniques, such as phase contrast microscopy, are preferred as neural stem cells are very sensitive to photoxic damage cause by excitation of fluorescent molecules. However, large illumination variations and weak foreground/background contrast make phase contrast images challenging for image processing.
View Article and Find Full Text PDFAnnu Int Conf IEEE Eng Med Biol Soc
June 2012
Cataract remains a leading cause for blindness worldwide. Cataract diagnosis via human grading is subjective and time-consuming. Several methods of automatic grading are currently available, but each of them suffers from some drawbacks.
View Article and Find Full Text PDFAnnu Int Conf IEEE Eng Med Biol Soc
March 2011
Cataract is the leading cause of blindness and posterior subcapsular cataract (PSC) leads to significant visual impairment. An automatic approach for detecting PSC opacity in retro-illumination images is investigated. The features employed include intensity, edge, size and spatial location.
View Article and Find Full Text PDFPurpose: To validate a new computer-aided diagnosis (CAD) imaging program for the assessment of nuclear lens opacity.
Methods: Slit-lamp lens photographs from the Singapore Malay Eye Study (SiMES) were graded using both the CAD imaging program and manual assessment method by a trained grader using the Wisconsin Cataract Grading System. Cataract was separately assessed clinically during the study using Lens Opacities Classification System III (LOCS III).
IEEE Trans Biomed Eng
January 2011
Under the framework of computer-aided eye disease diagnosis, this paper presents an automatic optic disc (OD) detection technique. The proposed technique makes use of the unique circular brightness structure associated with the OD, i.e.
View Article and Find Full Text PDFMed Image Comput Comput Assist Interv
November 2010
A video recording of an examination by Wireless Capsule Endoscopy (WCE) may typically contain more than 55,000 video frames, which makes the manual visual screening by an experienced gastroenterologist a highly time-consuming task. In this paper, we propose a novel method of epitomized summarization of WCE videos for efficient visualization to a gastroenterologist. For each short sequence of a WCE video, an epitomized frame is generated.
View Article and Find Full Text PDFIEEE Trans Med Imaging
January 2011
In clinical diagnosis, a grade indicating the severity of nuclear cataract is often manually assigned by a trained ophthalmologist to a patient after comparing the lens' opacity severity in his/her slit-lamp images with a set of standard photos. This grading scheme is often subjective and time-consuming. In this paper, a novel computer-aided diagnosis method via ranking is proposed to facilitate nuclear cataract grading following conventional clinical decision-making process.
View Article and Find Full Text PDFIEEE Trans Biomed Eng
October 2010
Under the framework of computer-aided diagnosis, optical coherence tomography (OCT) has become an established ocular imaging technique that can be used in glaucoma diagnosis by measuring the retinal nerve fiber layer thickness. This letter presents an automated retinal layer segmentation technique for OCT images. In the proposed technique, an OCT image is first cut into multiple vessel and nonvessel sections by the retinal blood vessels that are detected through an iterative polynomial smoothing procedure.
View Article and Find Full Text PDFCell enumeration and diagnosis using peripheral blood smears are routine tasks in many biological and pathological examinations. Not every area in the smear is appropriate for such tasks due to severe cell clumping or sparsity. Manual working-area selection is slow, subjective, inconsistent, and statistically biased.
View Article and Find Full Text PDFCataracts are the leading cause of blindness worldwide, and nuclear cataract is the most common form of cataract. An algorithm for automatic diagnosis of nuclear cataract is investigated in this paper. Nuclear cataract is graded according to the severity of opacity using slit lamp lens images.
View Article and Find Full Text PDFAnnu Int Conf IEEE Eng Med Biol Soc
April 2010
An automatic diagnosis system of nuclear cataract is presented in this paper. Nuclear cataract is graded according to the severity of opacity using slit-lamp lens images. Anatomical structure in the lens image is detected using a modified active shape model (ASM).
View Article and Find Full Text PDFAnnu Int Conf IEEE Eng Med Biol Soc
March 2010
With the advances of computer technology, more and more computer-aided diagnosis (CAD) systems have been developed to provide the "second opinion". This paper reports an automatic fundus image classification technique that is designed to screen out the severely degraded fundus images that cannot be processed by traditional CAD systems. The proposed technique classifies fundus images based on the image range property.
View Article and Find Full Text PDF