KIT-Bibliothek

16: Kognitive Systeme, Vorlesung, SS 2018, 25.06.2018

This audio or video file is copyrighted. Access is only allowed via computers of the Karlsruhe Institute of Technology (KIT).

Author

Alexander Waibel

Editor

KIT | Webcast

Participating institute

Institut für Anthropomatik und Robotik (IAR)

Genre

Vorlesung

Description

  • 0:00:00 Starten
  • 0:01:37 Vocal Tract Model of Speech
  • 0:07:11 Speech Recognition (System Overview)
  • 0:10:03 How good is a Recognizer?
  • 0:19:01 Dimensions of Difficulty
  • 0:27:45 Error Rates vs. Recognition Tasks
  • 0:35:13 Die Fundamentalformel der Spracherkennung
  • 0:42:32 Speech Recognition (Components)
  • 0:46:36 Voiced and Unvoiced Phonemes
  • 0:49:36 Spectrogram
  • 0:52:16 Frequency Response of the Basilar Membrane
  • 0:54:10 Front End Processing
  • 0:56:00 Voiced and Unvoiced Phonemes
  • 0:59:48 Speech Recognition (system components)
  • 1:00:47 Markov Models
  • 1:05:03 Single Fair Coin
  • 1:06:11 Discrete Observation HMM
  • 1:11:40 Hidden Markov Models
  • 1:14:30 Acoustic Modeling
  • 1:18:02 HMM Problems and Solutions
  • 1:20:54 Evaluation
  • 1:24:05 The Forward Algorithm

Duration (hh:mm:ss)

01:27:47

Series

Kognitive Systeme, Vorlesung, SS 2018

Published on

28.06.2018

Subject area

Computer science

License

KITopen Licence

Resolution 1280 x 720 Pixel
Aspect ratio 16:9
Audio bitrate 128000 bps
Audio channels 2
Audio Codec aac
Audio Sample Rate 48000 Hz
Total Bitrate 934130 bps
Color Space yuv420p
Container mov,mp4,m4a,3gp,3g2,mj2
Media Type video/mp4
Duration 5267 s
Filename DIVA-2018-500_hd.mp4
File Size 615.011.238 byte
Frame Rate 25
Video Bitrate 800035 bps
Video Codec h264

Embed Code