System and method for combining frame and segment level processing, via temporal pooling, for phonetic classification

US-201113281102-A
Stocking
Nationwide
Liên hệ
0x0
0 (gram)

(en)Disclosed herein are systems, methods, and non-transitory computer-readable storage media for combining frame and segment level processing, via temporal pooling, for phonetic classification. A frame processor unit receives an input and extracts the time-dependent features from the input. A plurality of pooling interface units generates a plurality of feature vectors based on pooling the time-dependent features and selecting a plurality of time-dependent features according to a plurality of selection strategies. Next, a plurality of segmental classification units generates scores for the feature vectors. Each segmental classification unit (SCU) can be dedicated to a specific pooling interface unit (PIU) to form a PIU-SCU combination. Multiple PIU-SCU combinations can be further combined to form an ensemble of combinations, and the ensemble can be diversified by varying the pooling operations used by the PIU-SCU combinations. Based on the scores, the plurality of segmental classification units selects a class label and returns a result.

You are commenting for System and method for combining frame and segment level processing, via temporal pooling, for phonetic classification


You are contracting for System and method for combining frame and segment level processing, via temporal pooling, for phonetic classification


Expert System and method for combining frame and segment level processing, via temporal pooling, for phonetic classification

Full name: Đoàn Thị Kiều Oanh

VTEX2208
(+84) 982 982 604
kieuoanh.doan@gmail.com

Address : Phường Quyết Tâm, TP. Sơn La, Tỉnh Sơn La