The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Research shows that speech dereverberation (SD) with Deep Neural Network (DNN) achieves the state-of-the-art results by learning spectral mapping, which, simultaneously, lacks the characterization of the local temporal spectral structures (LTSS) of speech signal and calls for a large storage space that is impractical in real applications. Contrarily, the Convolutional Neural Network (CNN) offers a...
We demonstrated experimentally a stable 1μm multi-wavelength YDF based random fiber laser by using a polarization independent Sagnac-loop mirror with output power of 1.6W and high OSNR of 30dB.
We present the SRAM bitcell offering from 22FDXTM (a 22nm FDSOI technology) with competitive 1.46mV-µm FinFET-like transistor mismatch coefficient (AVt) built with low cost planar architecture. Extremely low minimum operating voltages (Vmin) are reported for both the high-density (HD) 0.110μm2 and high-current (HC) 0.124μm2 bitcells without any assist, showing 95% limited yield (LY) Vmin values of...
Channel variability is one of the largest challenges for speaker verification (SV) techniques. Techniques in the feature, model and score domains have been applied to mitigate the channel impact. In this paper, we strive to study on robust deep feature learning with the deep belief network (DBN) by using traditional spectral features such as MFCC or PLP. In detail, during the training phase, a DBN...
Device-free localization (DFL) aims at locating the positions of targets without carrying any emitting devices by monitoring the received signals of preset wireless devices. Research showed that the localization accuracy of conventional DFL algorithms decreases in presence of noise and outliers. To tackle this problem, this paper firstly proposes to study the DFL via sparse representation and the...
For existing mainstream visual object counting (VOC) methods, training data insufficiency will lead to significant performance degradation. To address this challenge, we propose a novel sparsity-constrained example-based VOC method. Given a test image, its counts are estimated by integrating over its density map, and our method will predict such density map based on patch using training examples....
This paper reports on the development of a packaged coherent photonic mixer (CPX) for coherent Radio-over-Fiber (RoF) systems. The developed integrated CPX performs direct optical-to-RF up-conversion with a 5 dB better conversion efficiency at 60 GHz as compared to a commercially available 110 GHz photodiode. The 3 dB bandwidth and maximum RF output power of the CPX are 65 GHz and +7 dBm, respectively...
Big traffic data analysis for intelligent transportation is attracting more and more attention. Due to different designs of vehicles in the same class and the similarity of shape and textures between different classes, vehicle classification is remaining a challenge. In this paper, different from traditional methods that only classify vehicles to two or three types in one viewpoint, a novel method...
For mobile speech application, speaker DOA estimation accuracy, interference robustness and compact physical size are three key factors. Considering the size, we utilized acoustic vector sensor (AVS) and proposed a DOA estimation algorithm previously [1], offering high accuracy with larger-than-15dB SNR but is deteriorated by nonspeech interferences (NSI). This paper develops a robust speaker DOA...
In this paper, the magnetic properties of an individual square permalloy (Ni80Fe20) nanoelement at the center of an array with both in-plane shape anisotropy and interelement interaction were investigated by the 3-D object oriented micromagnetic framework. Variation of the interelement coupling was studied by changing the element lateral size (a) and interelement spacing (s). Magnetic hysteresis loop...
An integrated 110 GHz coherent photonic mixer (CPX) is designed and fabricated for coherent RoF (CRoF) mobile backhaul links. The CPX simultaneously performs optical WDM channel selection and direct optical-to-RF conversion. Due to its broadband performance, the CPX simultaneously supports future wireless systems operating in the 57–64 GHz, 71–76 GHz, 81–86 GHz bands and even research-type W-band...
Speech emotion recognition (SER) is a challenging task since it is unclear what kind of features are able to reflect the characteristics of human emotion from speech. However, traditional feature extractions perform inconsistently for different emotion recognition tasks. Obviously, different spectrogram provides information reflecting difference emotion. This paper proposes a systematical approach...
This paper investigates the formation of ad-hoc microphone arrays for the purpose of recording multiple sound sources by clustering microphones spatially distributed within a room. A novel codebook-based unsupervised method for cluster formation using features derived from the Room Impulse Responses (RIRs) corresponding to each microphone is proposed and compared with baseline clustering and classification...
The performance of speaker verification system (SVS) declines dramatically in noisy environments. To suppress the adverse impact of the noise on SVS, this paper investigates employing the nonnegative matrix factorization (NMF) technique to reconstruct the speech based on the pre-trained speech basis matrix (SBM) and noise basis matrix (NBM). The contribution of this research lies in utilizing the...
Generally, in multi-lingual communities, non-native speakers may produce speech sound which is either part of their own native language or established via merging characteristics of native pronunciation with non-native pronunciation. Recently, a Two-pass phone clustering based on Confusion Matrix (TCM) approach has been proposed to address the one-to-one phone mappings between Chinese syllables and...
The sampling jitter is one of the main problems in the direct RF bandpass sampling receiver architecture. Sampling jitter will seriously degrade the performance of the receiver, which can be improved effectively by digital compensation algorithm. This paper investigates the sampling jittering mitigation for the direct RF bandpass sampling receiver. Under the direct RF bandpass sampling receiver architecture,...
This paper proposes an approach to eliminate redundant images adaptively for Wireless Capsule Endoscopy (WCE) video summarization by considering temporal correlation and feature similarity between adjacent WCE frames. The color and texture features, generated by HSV color histogram model and Gray Level Co-occurrence Matrix, have been taken into account. It is noted that frames from different WCE videos...
Wireless capsule endoscopy (WCE) is an innovative solution for gastrointestinal disease detection. Limited by WCE hardware and cost of manufacture, WCE image resolution is commonly low, which creates problems for attention to image details and visual perception in medical diagnosis. Under the sparse representation framework, we propose an adaptive dictionary pair learning method to obtain more appropriate...
An experiment system of a scaled-down monolithic radial transmission line (MRTL) for future Z-pinch drivers was established to testify the validity of 3-D electromagnetic (EM) simulation. The MRTL had a hyperbolic impedance profile. Being immersed in deionized water, the MRTL was composed of two flat aluminum plates separated by a distance of 1 cm. The radius of both plates was 0.5 m, which corresponds...
Accurate DOA estimation based on clustering the inter-sensor data ratios (ISDRs) of a single acoustic vector sensor (AVS), referred as AVS-ISDR, relies on reliable extraction of time-frequency points with high local signal-to-noise ratio (HLSNR-TFPs) and its performance degrades in noisy environments. This paper investigates deep neural networks (DNNs) trained with noisy-clean speech pairs under different...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.