• Home
  • Advanced Search
  • Directory of Libraries
  • About lib.ir
  • Contact Us
  • History

عنوان
Biologically Inspired Efficiencies in Computer Vision and Audition

پدید آورنده
Ebrahimpour, Mohammadkazem

موضوع

رده

کتابخانه
Center and Library of Islamic Studies in European Languages

محل استقرار
استان: Qom ـ شهر: Qom

Center and Library of Islamic Studies in European Languages

تماس با کتابخانه : 32910706-025

NATIONAL BIBLIOGRAPHY NUMBER

Number
TL1jj4n9fp

LANGUAGE OF THE ITEM

.Language of Text, Soundtrack etc
انگلیسی

TITLE AND STATEMENT OF RESPONSIBILITY

Title Proper
Biologically Inspired Efficiencies in Computer Vision and Audition
General Material Designation
[Thesis]
First Statement of Responsibility
Ebrahimpour, Mohammadkazem
Subsequent Statement of Responsibility
Noelle, David C

.PUBLICATION, DISTRIBUTION, ETC

Name of Publisher, Distributor, etc.
UC Merced
Date of Publication, Distribution, etc.
2020

DISSERTATION (THESIS) NOTE

Body granting the degree
UC Merced
Text preceding or following the note
2020

SUMMARY OR ABSTRACT

Text of Note
Computer perception is one of the fundamental problems in artificial intelligence. Given an image or a recorded audio, a human can quickly recognize and detect objects based on image or sound or both. In computer vision, Object Detection is concerned with recognizing objects in images and drawing a bounding box around them. Researchers have been working on developing algorithms to recognize, detect, and segment objects/scenes in images for decades. Numerous challenges make these problems significantly challenging in real-world scenarios, since objects usually appear in different conditions, such as viewpoints, scales, and with background noise, and they even may deform into different shapes, parts, or poses. Real-time object detection has many important applications, such as autonomous driving cars and video surveillance. In this dissertation, we approach visual understanding in the following ways: First, we utilize implicit information in trained neural networks to localize all objects of interest in an image using a sensitivity analysis approach. Second, we introduce a novel framework for object detection called "Ventral- Dorsal" Neural Networks, inspired by the structure of the human brain. Third, we expand the Ventral-Dorsal framework, focusing on attaining real-time performance needed for online applications. Forth, we compare human attention with deep neural network attention algorithms in order to understand whether neural network attention matches human attention. Also, auditory perception is crucial in artificial intelligence systems. Until recently, auditory object recognition pipelines were in need of substantial hand engineering for feature extraction. Engineered features need to be tuned for every individual problem. Also, some popular feature extraction methods are time-consuming, limiting real-time applications.Here we attempt to avoid these problems using end-to-end training. Due to the recent improvements in deep neural networks, we are able to eliminate feature learning by optimizing feature extraction and classification jointly in one network. In this dissertation, we approach the auditory object recognition problem in the following ways: we proposed a novel "end-to-end" deep neural network architecture that takes raw audio as input and maps it to class labels. We also applied our proposed architecture to a new dataset of infant vocalization sounds for further investigation.

PERSONAL NAME - PRIMARY RESPONSIBILITY

Ebrahimpour, Mohammadkazem

PERSONAL NAME - SECONDARY RESPONSIBILITY

Noelle, David C

CORPORATE BODY NAME - SECONDARY RESPONSIBILITY

UC Merced

ELECTRONIC LOCATION AND ACCESS

Electronic name
 مطالعه متن کتاب 

p

[Thesis]
276903

a
Y

Proposal/Bug Report

Warning! Enter The Information Carefully
Send Cancel
This website is managed by Dar Al-Hadith Scientific-Cultural Institute and Computer Research Center of Islamic Sciences (also known as Noor)
Libraries are responsible for the validity of information, and the spiritual rights of information are reserved for them
Best Searcher - The 5th Digital Media Festival