Search results

Items from 1 to 20 out of 24 results

chapter

Efficient implementation of a generalized convolutional neural networks based on weighted euclidean distance

Keivan Nalaie, Kamaledin Ghiasi-Shirazi, Modhammad-R. Akbarzadeh-T.

2017 7th International Conference on Computer and Knowledge Engineering (ICCKE) > 211 - 216

2017 7th International Conference on Computer and Knowledge Engineering (ICCKE)

Convolutional Neural Networks (CNNs) are multi-layer deep structures that have been very successful in visual recognition tasks. These networks basically consist of the convolution, pooling, and the nonlinearity layers, each of which operates on the representation produced by the preceding layer and generates a new representation. Convolution layers naturally compute some inner product between a plane...

chapter

Real-time and authentic blood simulation for surgical training

Changsheng Xiao, Yuanjing Feng, Yongqiang Li, Qingrun Zeng, more

2017 29th Chinese Control And Decision Conference (CCDC) > 6832 - 6837

2017 29th Chinese Control And Decision Conference (CCDC)

Blood simulation is an important part in the virtual surgery training system. However, the huge computational complexity and authenticity of blood simulation is of great challenge to the surgical training system. In this paper, a simulation method based on GPU-accelerated is used for blood simulation in surgical training system. The grid method is used to divide the target area, create space grid...

chapter

Pattern classification using updated fuzzy hyper-line segment neural network and it's GPU parallel implementation for large datasets using CUDA

Priyadarshan Dhabe, Prashant Vyas, Devrat Ganeriwal, Aditya Pathak

2016 International Conference on Computing, Analytics and Security Trends (CAST) > 24 - 29

2016 International Conference on Computing, Analytics and Security Trends (CAST)

Fuzzy hyper-line segment neural network (FHLSNN) is a hybrid system of fuzzy logic and neural network and is used for pattern classification. It learns patterns in terms of n-dimensional hyper line segment (HLS). Modified fuzzy hyperline segment neural network (MFHLSNN) is a modified version of FHLSNN that improves the quality of reasoning and recall time per pattern using modified fuzzy membership...

chapter

A Statistical-Feature ML Approach to IP Traffic Classification Based on CUDA

Zhengyang Chen, Renjie Chen, Yu Zhang, Jianzhong Zhang, more

2016 IEEE Trustcom/BigDataSE/ISPA > 2235 - 2239

2016 IEEE Trustcom/BigDataSE/ISPA

In modern networks, there exist different applications which generate various different types of network traffic. In order to improve the performance of network management, it is important to identify and classify the internet traffic. The machine learning (ML) technique based on per-flow statistics has been widely used in traffic classification. Different from traditional classification methods,...

chapter

Parallel execution of SVM training using graphics processing units (SVMTrGPUs)

Nur Shakirah Md Salleh, Muhammad Fahim Baharim

2015 IEEE International Conference on Control System, Computing and Engineering (ICCSCE) > 260 - 263

2015 IEEE International Conference on Control System, Computing and Engineering (ICCSCE)

Parallel computing is a simultaneous use of multiple compute resources, for example, processors to solve complex computational problems. It has been used in high-end computing areas such as pattern recognition, medical diagnosis, national defense, and web search engine. This paper focuses on the implementation of pattern classification technique, Support Vector Machine (SVM) using vector processor...

chapter

Classification with Extreme Learning Machine on GPU

Toma Jeowicz, Petr Gajdo, Vojtech Uher, Vaclav Snael

2015 International Conference on Intelligent Networking and Collaborative Systems > 116 - 122

2015 International Conference on Intelligent Networking and Collaborative Systems (INCOS)

The general classification is a machine learning task that tries to assign the best class to a given unknown input vector based on past observations (training data). Most of developed algorithms are very time consuming for large datasets (Support Vector Machine, Deep Neural Networks, etc.). Extreme Learning Machine (ELM) is a high quality classification algorithm that gains much popularity in recent...

chapter

Fast Quadratic Discriminant Analysis Using GPGPU for Sea Ice Forecasting

Shadi Alawneh, Carl Howell, Martin Richard

2015 IEEE 17th International Conference on High Performance Computing and Communications, 2015 IEEE 7th International Symposium on Cyberspace Safety and Security, and 2015 IEEE 12th International Conference on Embedded Software and Systems > 1585 - 1590

2015 IEEE 17th International Conference on High Performance Computing and Communications (HPCC), 2015 IEEE 7th International Symposium on Cyberspace Safety and Security (CSS) and 2015 IEEE 12th International Conf on Embedded Software and Systems (ICESS)

General Purpose computing on Graphics Processor Units (GPGPU) brings massively parallel computing (hundreds of compute cores) to the desktop at a reasonable cost, but requires that algorithms be carefully designed to take advantage of this power. The present work explores the possibilities of CUDA (NVIDIA Compute Unified Device Architecture) using GPGPU for Quadratic Discriminant (QD) analysis. QD...

chapter

Parallel back-propagation neural network training technique using CUDA on multiple GPUs

Shunlu Zhang, Pavan Gunupudi, Qi-Jun Zhang

2015 IEEE MTT-S International Conference on Numerical Electromagnetic and Multiphysics Modeling and Optimization (NEMO) > 1 - 3

2015 IEEE MTT-S International Conference on Numerical Electromagnetic and Multiphysics Modeling and Optimization (NEMO)

A parallel Back-Propagation(BP) neural network training technique using Compute Unified Device Architecture (CUDA) on multiple Graphics Processing Units(GPUs) is proposed. To exploit the maximum performance of GPUs, we propose to implement batch mode BP training by building input neurons, hidden neurons and output neurons into matrix form. The implementation includes CUDA Basic Linear Algebra Subroutines...

chapter

A GPU Based SVM Method with Accelerated Kernel Matrix Calculation

Bo Yan, Yitian Ren, Zijiang Yang

2015 IEEE International Congress on Big Data > 41 - 46

2015 IEEE International Congress on Big Data (BigData Congress)

Support vector machine (SVM) is a popular classifier dealing with small-scale datasets. It has outstanding performance compared to other classifiers. However the execution time is extremely long when training Big Data. The Graphics Processing Unit (GPU) is a massively parallel device which performs very well as a co-processor. NVIDIA proposed a programming platform, CUDA, in 2006, which makes it much...

chapter

Super-fast parallel eigenface implementation on GPU for face recognition

Urvesh Devani, Valmik B. Nikam, B. B. Meshram

2014 International Conference on Parallel, Distributed and Grid Computing > 130 - 136

2014 International Conference on Parallel, Distributed and Grid Computing (PDGC)

Eigenface is one of the most common appearance based approaches for face recognition. Eigenfaces are the principal components which represent the training faces. Using Principal Component Analysis, each face is represented by very few parameters called weight vectors or feature vectors. While this makes testing process easy, it also includes cumbersome process of generating eigenspace and projecting...

chapter

CUDA-based real-time face recognition system

Ren Meng, Zhang Shengbing, Lei Yi, Zhang Meng

2014 Fourth International Conference on Digital Information and Communication Technology and its Applications (DICTAP) > 237 - 241

2014 Fourth International Conference on Digital Information and Communication Technology and its Applications (DICTAP)

This paper proposes a real-time face recognition system based on the Compute Unified Device Architecture (CUDA) platform, which effectively completed the face detection and recognition tasks. In the face detection phase with Viola-Jones cascade classifier, we implemented and improved novel parallel methodologies of image integral, calculation scan window processing and the amplification and correction...

chapter

Method to accelerate prediction of membrane protein types by CUDA

Yukun Zhong, Liao Gang, M A LongFei, Zeng Yu

2013 IEEE International Conference on Bioinformatics and Biomedicine > 27 - 32

2013 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)

This paper introduce a parallel computing method to improve the efficiency of prediction of membrane protein types by SVM. With early hardware limitations of the GPU(lack of synchronization primitives and limited memory caching mechanisms)can make GPU-based computation inefficient. We present this efficient method for prediction of membrane protein type for Intel(R) Core(TM) i3–3110m quad-core and...

chapter

Design and implementation of parallel SOM model on GPGPU

Saad Qasim Khan, Muhammad Ali Ismail

2013 5th International Conference on Computer Science and Information Technology > 233 - 237

2013 5th International Conference on Computer Science and Information Technology (CSIT)

Parallel implementation of neural networks is amongst major area of research in computer science. Self Organizing Map (SOM) is a neural network that has been under spotlight throughout last decade for implementation in parallel architecture. SOM trains itself through unsupervised learning by retrieving inherent topological features of applied input data. In this paper design and implementation of...

chapter

Parallelizing Principal Component Analysis for Robust Facial Recognition Using CUDA

Todd Goodall, Scott Gibson, Melissa C. Smith

2012 Symposium on Application Accelerators in High Performance Computing > 121 - 124

2012 Symposium on Application Accelerators in High Performance Computing (SAAHPC)

Facial recognition techniques are of interest for tracking and identification in densely populated areas where security is an important concern. Traditional recognition techniques have yielded acceptable results with high repeatability but require special conditions such as a voluntary and stationary subject, close proximity, and appropriate lighting. Because no single algorithm yields robust results...

chapter

Multi-biomarker panel selection on a GPU

David Johnson, Brandon Shafer, Jaehwan John Lee, Jake Y. Chen

2012 IEEE International Conference on Electro/Information Technology > 1 - 6

2012 IEEE International Conference on Electro/Information Technology (EIT 2012)

Liquid chromatography-based tandem mass spectrometry (LC-MS) technique allows for identification and quantification of thousands of proteins in parallel. This technique coupled with a feed-forward artificial neural network provides a technique to analyze and select protein panels for use in multi-biomarker panel discovery applications. In this study, we enhance this technique by utilizing massively...

chapter

Accelerating the Training of HTK on GPU with CUDA

Zhihui Du, Xiangyu Li, Ji Wu

2012 IEEE 26th International Parallel and Distributed Processing Symposium Workshops & PhD Forum > 1907 - 1914

2012 26th IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)

The training procedure of Hidden Markov Model (HMM) based Speech Recognition is often very time consuming because of its high computational complexity. The new parallel hardware like GPU can provide multi-thread processing and very high floating-point capability. We take advantage of GPU to accelerate a popular HMM-based Speech Recognition package ¨C HTK. Based on the sequential code of HTK, we design...

chapter

Parallel Training of a Back-Propagation Neural Network Using CUDA

X Sierra-Canto, F Madera-Ramirez, V Uc-Cetina

2010 Ninth International Conference on Machine Learning and Applications > 307 - 312

2010 Ninth International Conference on Machine Learning and Applications (ICMLA 2010)

The Artificial Neural Networks (ANN) training represents a time-consuming process in machine learning systems. In this work we provide an implementation of the back-propagation algorithm on CUDA, a parallel computing architecture developed by NVIDIA. Using CUBLAS, a CUDA implementation of the Basic Linear Algebra Subprograms library (BLAS), the process is simplified, however, the use of kernels was...

chapter

On accelerating iterative algorithms with CUDA: A case study on Conditional Random Fields training algorithm for biological sequence alignment

Zhihui Du, Zhaoming Yin, Wenjie Liu, D Bader

2010 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW) > 543 - 548

2010 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW 2010)

The accuracy of Conditional Random Fields (CRF) is achieved at the cost of huge amount of computation to train model. In this paper we designed the parallelized algorithm for the Gradient Ascent based CRF training methods for biological sequence alignment. Our contribution is mainly on two aspects: 1) We flexibly parallelized the different iterative computation patterns, and the according optimization...

chapter

CuParcone A High-Performance Evolvable Neural Network Model

Xiaoxi Chen, Lin Gao, Hugo de Garis

2010 International Conference on Intelligent Computation Technology and Automation > 1 > 1070 - 1074

2010 International Conference on Intelligent Computation Technology and Automation (ICICTA 2010)

An algorithm for evolving recurrent neural network via the genetic algorithm was implemented on the CUDA, resulting in a system called CuParcone (CUDA based Partially Connected Neural Evolutionary). Run on a Nvidia Tesla “GPU supercomputer, ” CuParcone achieves a performance increase of 323 times in face gender recognition compared to the comparable Parcone algorithm on a state-of-the-art, commodity...

chapter

Parallel implementation of artificial neural network training

Stefano Scanzio, Sandro Cumani, Roberto Gemello, Franco Mana, more

2010 IEEE International Conference on Acoustics, Speech and Signal Processing > 4902 - 4905

2010 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2010

In this paper we describe the implementation of a complete ANN training procedure for speech recognition using the block mode back-propagation learning algorithm. We exploit the high performance SIMD architecture of GPU using CUDA and its C-like language interface. We also compare the speed-up obtained implementing the training procedure only taking advantage of the multi-thread capabilities of multi-core...

Keywords:
TRAINING

Publication date

Set your own date range

Keywords

GRAPHICS PROCESSING UNITS (13)
GRAPHICS PROCESSING UNIT (9)
GPU (8)
COMPUTATIONAL MODELING (7)
KERNEL (7)
GPGPU (6)
NEURONS (6)
NEURAL NETWORKS (5)
ARTIFICIAL NEURAL NETWORKS (4)
COMPUTER GRAPHIC EQUIPMENT (4)
INSTRUCTION SETS (4)
SUPPORT VECTOR MACHINES (4)
ALGORITHM DESIGN AND ANALYSIS (3)
COPROCESSORS (3)
FACE RECOGNITION (3)
GRAPHICS (3)
HIDDEN MARKOV MODELS (3)
PARALLEL COMPUTING (3)
VECTORS (3)
YARN (3)
ACCELERATION (2)
BACK-PROPAGATION (2)
CLASSIFICATION (2)
CLASSIFICATION ALGORITHMS (2)
CLUSTERING ALGORITHMS (2)
COMPUTE UNIFIED DEVICE ARCHITECTURE (2)
COMPUTER ARCHITECTURE (2)
CONVOLUTION (2)
CUBLAS (2)
DATABASES (2)
EUCLIDEAN DISTANCE (2)
FACE (2)
FACE DETECTION (2)
GENETIC ALGORITHMS (2)
INTERNET (2)
LEARNING (ARTIFICIAL INTELLIGENCE) (2)
MATHEMATICAL MODEL (2)
NEURAL NETS (2)
NEURAL NETWORK (2)
PARALLEL ALGORITHM (2)
PARALLEL ARCHITECTURES (2)
PARALLEL PROCESSING (2)
PROGRAMMING (2)
PROTEINS (2)
SPEECH RECOGNITION (2)
SUPPORT VECTOR MACHINE (2)
ACCELERATING ITERATIVE ALGORITHMS (1)
ACCURACY (1)
ANNS (1)
ARRAYS (1)
ARTIFICIAL NEURAL NETWORK (1)
ARTIFICIAL NEURAL NETWORK PARALLEL TRAINING (1)
ARTIFICIAL NEURAL NETWORK TRAINING (1)
AUTOMATA (1)
BACK PROPAGATION (1)
BACKPROPAGATION (1)
BACKPROPAGATION ALGORITHM (1)
BIG DATA (1)
BIOINFORMATICS (1)
BIOLOGICAL NEURAL NETWORKS (1)
BIOLOGICAL SEQUENCE ALIGNMENT (1)
BIOLOGY (1)
BIOMARKER PANEL DISCOVERY (1)
BIOMEMBRANES (1)
BLAS (1)
BLOCK MODE BACK-PROPAGATION LEARNING ALGORITHM (1)
BLOOD (1)
BLOOD SIMULATION (1)
C LANGUAGE (1)
CAVITY RESONATORS (1)
CELLULAR AUTOMATA (1)
CELLULAR AUTOMATA RULE DESIGN (1)
CENTRAL PROCESSING UNIT (1)
CNN (1)
CODEBOOK GENERATION ALGORITHM (1)
COMPLEXITY THEORY (1)
COMPUTER GRAPHICS (1)
COMPUTERS (1)
CONDITIONAL RANDOM FIELDS (1)
CONDITIONAL RANDOM FIELDS TRAINING ALGORITHM (1)
CONVOLUTIONAL CODES (1)
CONVOLUTIONAL NEURAL NETWORKS (1)
COPROCESSOR (1)
CORE 2 PROCESSOR (1)
COVARIANCE MATRIX (1)
CROSS VALIDATION (1)
CUD A (1)
CUDA MULTI-THREAD MODEL (1)
CUDA-ENABLED GPU (1)
CUPARCONE (1)
DATA COMPRESSION (1)
DATA MINING (1)
DATA PARALLEL COMPUTING (1)
DISTANCE MEASUREMENT (1)
EDUCATIONAL INSTITUTIONS (1)
EIGENFACE (1)
EVOLUTIONARY ALGORITHMS (1)
EVOLVABLE NEURAL NETWORK MODEL (1)
more

INFONA - science communication portal

Search results

Efficient implementation of a generalized convolutional neural networks based on weighted euclidean distance

Real-time and authentic blood simulation for surgical training

Pattern classification using updated fuzzy hyper-line segment neural network and it's GPU parallel implementation for large datasets using CUDA

A Statistical-Feature ML Approach to IP Traffic Classification Based on CUDA

Parallel execution of SVM training using graphics processing units (SVMTrGPUs)

Classification with Extreme Learning Machine on GPU

Fast Quadratic Discriminant Analysis Using GPGPU for Sea Ice Forecasting

Parallel back-propagation neural network training technique using CUDA on multiple GPUs

A GPU Based SVM Method with Accelerated Kernel Matrix Calculation

Super-fast parallel eigenface implementation on GPU for face recognition

CUDA-based real-time face recognition system

Method to accelerate prediction of membrane protein types by CUDA

Design and implementation of parallel SOM model on GPGPU

Parallelizing Principal Component Analysis for Robust Facial Recognition Using CUDA

Multi-biomarker panel selection on a GPU

Accelerating the Training of HTK on GPU with CUDA

Parallel Training of a Back-Propagation Neural Network Using CUDA

On accelerating iterative algorithms with CUDA: A case study on Conditional Random Fields training algorithm for biological sequence alignment

CuParcone A High-Performance Evolvable Neural Network Model

Parallel implementation of artificial neural network training

Filter options

Publication date

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options