skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Speech recognition systems on the Cell Broadband Engine

Abstract

In this paper we describe our design, implementation, and first results of a prototype connected-phoneme-based speech recognition system on the Cell Broadband Engine{trademark} (Cell/B.E.). Automatic speech recognition decodes speech samples into plain text (other representations are possible) and must process samples at real-time rates. Fortunately, the computational tasks involved in this pipeline are highly data-parallel and can receive significant hardware acceleration from vector-streaming architectures such as the Cell/B.E. Identifying and exploiting these parallelism opportunities is challenging, but also critical to improving system performance. We observed, from our initial performance timings, that a single Cell/B.E. processor can recognize speech from thousands of simultaneous voice channels in real time--a channel density that is orders-of-magnitude greater than the capacity of existing software speech recognizers based on CPUs (central processing units). This result emphasizes the potential for Cell/B.E.-based speech recognition and will likely lead to the future development of production speech systems using Cell/B.E. clusters.

Authors:
; ; ; ; ;
Publication Date:
Research Org.:
Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States)
Sponsoring Org.:
USDOE
OSTI Identifier:
940892
Report Number(s):
UCRL-JRNL-230195
Journal ID: ISSN 0018-8646; IBMJAE; TRN: US200824%%388
DOE Contract Number:  
W-7405-ENG-48
Resource Type:
Journal Article
Resource Relation:
Journal Name: IBM Journal of Research and Development, vol. 51, no. 5, August 11, 2007, pp. 583-592; Journal Volume: 51; Journal Issue: 5
Country of Publication:
United States
Language:
English
Subject:
99 GENERAL AND MISCELLANEOUS; ACCELERATION; CAPACITY; DESIGN; ENGINES; IMPLEMENTATION; PERFORMANCE; PIPELINES; PROCESSING; PRODUCTION; SPEECH

Citation Formats

Liu, Y, Jones, H, Vaidya, S, Perrone, M, Tydlitat, B, and Nanda, A. Speech recognition systems on the Cell Broadband Engine. United States: N. p., 2007. Web. doi:10.1147/rd.515.0583.
Liu, Y, Jones, H, Vaidya, S, Perrone, M, Tydlitat, B, & Nanda, A. Speech recognition systems on the Cell Broadband Engine. United States. doi:10.1147/rd.515.0583.
Liu, Y, Jones, H, Vaidya, S, Perrone, M, Tydlitat, B, and Nanda, A. Fri . "Speech recognition systems on the Cell Broadband Engine". United States. doi:10.1147/rd.515.0583. https://www.osti.gov/servlets/purl/940892.
@article{osti_940892,
title = {Speech recognition systems on the Cell Broadband Engine},
author = {Liu, Y and Jones, H and Vaidya, S and Perrone, M and Tydlitat, B and Nanda, A},
abstractNote = {In this paper we describe our design, implementation, and first results of a prototype connected-phoneme-based speech recognition system on the Cell Broadband Engine{trademark} (Cell/B.E.). Automatic speech recognition decodes speech samples into plain text (other representations are possible) and must process samples at real-time rates. Fortunately, the computational tasks involved in this pipeline are highly data-parallel and can receive significant hardware acceleration from vector-streaming architectures such as the Cell/B.E. Identifying and exploiting these parallelism opportunities is challenging, but also critical to improving system performance. We observed, from our initial performance timings, that a single Cell/B.E. processor can recognize speech from thousands of simultaneous voice channels in real time--a channel density that is orders-of-magnitude greater than the capacity of existing software speech recognizers based on CPUs (central processing units). This result emphasizes the potential for Cell/B.E.-based speech recognition and will likely lead to the future development of production speech systems using Cell/B.E. clusters.},
doi = {10.1147/rd.515.0583},
journal = {IBM Journal of Research and Development, vol. 51, no. 5, August 11, 2007, pp. 583-592},
number = 5,
volume = 51,
place = {United States},
year = {Fri Apr 20 00:00:00 EDT 2007},
month = {Fri Apr 20 00:00:00 EDT 2007}
}