| 000 | 05155nam a2200553Ii 4500 | ||
|---|---|---|---|
| 005 | 20250919002921.0 | ||
| 008 | 150410s2013 caua foabi 001 0 eng | ||
| 020 |
_a9781608454730 _qpaperback _cRM168.30 |
||
| 039 | 9 |
_a201507060848 _blan _c201507060847 _dlan _c201506181226 _drahah _y04-10-2015 _zrahah |
|
| 040 |
_aCaBNvSL _beng _cJ2I _dJ2I _dWAU _dOCLCO _dE7B _dUMC _dNST _dYDXCP _dUMI _dCOO _dUKM _erda |
||
| 090 | _aTK7882.S65H647 3 | ||
| 090 |
_aTK7882.S65 _bH647 3 |
||
| 100 | 1 |
_aHori, Takaaki _eauthor. |
|
| 245 | 1 | 0 |
_aSpeech recognition algorithms using weighted finite-state transducers / _cTakaaki Hori and Atsushi Nakamura. |
| 264 | 1 |
_aSan Rafael, Calif. : _bMorgan & Claypool, _c2013. |
|
| 264 | 4 | _c©2013. | |
| 300 |
_a1 online resource (xii, 150 p.) : _billustrations., digital file ; _c24 cm. |
||
| 336 |
_atext _2rdacontent |
||
| 337 |
_acomputer _2rdamedia |
||
| 338 |
_aonline resource _2rdacarrier |
||
| 490 | 1 |
_aSynthesis lectures on speech and audio processing, _x1932-1678 ; _v# 10. |
|
| 504 | _aIncludes bibliographical references (p. 137-148). | ||
| 505 | 0 | _aPreface -- 1. Introduction -- 1.1 Speech recognition and computation -- 1.2 Why WFST? -- 1.3 Purpose of this book -- 1.4 Book organization -- | |
| 505 | 0 | _a2. Brief overview of speech recognition -- 2.1 Statistical framework of speech recognition -- 2.2 Speech analysis -- 2.3 Acoustic model -- 2.3.1 Hidden Markov model -- 2.3.2 Computation of acoustic likelihood -- 2.3.3 Output probability distribution -- 2.4 Subword models and pronunciation lexicon -- 2.5 Context-dependent phone models -- 2.6 Language model -- 2.6.1 Finite-state grammar -- 2.6.2 N-gram model -- 2.6.3 Back-off smoothing -- 2.7 Decoder -- 2.7.1 Viterbi algorithm for continuous speech recognition -- 2.7.2 Time-synchronous Viterbi beam search -- 2.7.3 Practical techniques for LVCSR -- 2.7.4 Context-dependent phone search network -- 2.7.5 Lattice generation and N-best search -- | |
| 505 | 0 | _a3. Introduction to weighted finite-state transducers -- 3.1 Finite automata -- 3.2 Basic properties of finite automata -- 3.3 Semiring -- 3.4 Basic operations -- 3.5 Transducer composition -- 3.6 Optimization -- 3.6.1 Determinization -- 3.6.2 Weight pushing -- 3.6.3 Minimization -- 3.7 Epsilon removal -- | |
| 505 | 8 | _a4. Speech recognition by weighted finite-state transducers -- 4.1 Overview of WFST-based speech recognition -- 4.2 Construction of component WFSTs -- 4.2.1 Acoustic models -- 4.2.2 Phone context dependency -- 4.2.3 Pronunciation lexicon -- 4.2.4 Language models -- 4.3 Composition and optimization -- 4.4 Decoding algorithm using a single WFST -- 4.5 Decoding performance -- | |
| 505 | 0 | _a5. Dynamic decoders with on-the-fly WFST operations -- 5.1 Problems in the native WFST approach -- 5.2 On-the-fly composition and optimization -- 5.3 Known problems of on-the-fly composition approach -- 5.4 Look-ahead composition -- 5.4.1 How to obtain prospective output labels -- 5.4.2 Basic principle of look-ahead composition -- 5.4.3 Realization of look-ahead composition using a filter transducer -- 5.4.4 Look-ahead composition with weight pushing -- 5.4.5 Generalized composition -- 5.4.6 Interval representation of label sets -- 5.5 On-the-fly rescoring approach -- 5.5.1 Construction of component WFSTs for on-the-fly rescoring -- 5.5.2 Concept -- 5.5.3 Algorithm -- 5.5.4 Approximation in decoding -- 5.5.5 Comparison with look-ahead composition -- | |
| 505 | 0 | _a6. Summary and perspective -- 6.1 Realization of advanced speech recognition techniques using WFSTs -- 6.1.1 WFSTs for extended language models -- 6.1.2 Dynamic grammars based on WFSTs -- 6.1.3 Wide-context-dependent HMMs -- 6.1.4 Extension of WFSTs for multi-modal inputs -- 6.1.5 Use of WFSTs for learning -- 6.2 Integration of speech and language processing -- 6.3 Other speech applications using WFSTs -- 6.4 Conclusion -- | |
| 505 | 0 | _aBibliography -- Authors' biographies. | |
| 650 | 0 | _aSpeech processing systems. | |
| 650 | 0 | _aAutomatic speech recognition. | |
| 650 | 0 | _aTransducers. | |
| 700 | 1 | _aNakamura, Atsushi. | |
| 830 | 0 |
_aSynthesis lectures on speech and audio processing, _x1932-1678 ; _v# 10. |
|
| 856 | 4 | 0 |
_3Abstract with links to full text _uhttp://dx.doi.org/10.2200/S00462ED1V01Y201212SAP010. |
| 856 | 4 | 0 |
_uhttp://www.morganclaypool.com/doi/abs/10.2200/S00462ED1V01Y201212SAP010 _3Morgan & Claypool. |
| 856 | 4 | 0 |
_uhttp://oclc-marc.ebrary.com/Doc?id=10649981 _3ebrary. |
| 856 | 4 | 0 |
_3EBSCOhost _uhttp://search.ebscohost.com/login.aspx?direct=true&scope=site&db=nlebk&db=nlabk&AN=503575. |
| 856 | 4 | 0 |
_uhttp://site.ebrary.com/id/10649981 _3ebrary. |
| 856 | 4 | 0 |
_zAvailable by subscription from Safari Books Online _uhttp://proquest.safaribooksonline.com/?fpi=9781608454730. |
| 907 |
_a.b16118078 _b2019-11-12 _c2019-11-12 |
||
| 942 |
_c01 _n0 _kTK7882.S65H647 3 |
||
| 914 | _avtls003583506 | ||
| 990 | _arab | ||
| 991 | _aFakulti Kejuruteraan dan Seni Bina | ||
| 998 |
_al _b2015-10-04 _cm _da _feng _gcau _y0 _z.b16118078 |
||
| 999 |
_c590767 _d590767 |
||