KIT-Bibliothek
Audio-/Videodatei publizieren

18: Kognitive Systeme, Vorlesung, SS 2018, 02.07.2018

Diese Audio- bzw. Video-Datei ist urheberrechtlich geschützt. Der Zugriff ist nur über Rechner des Karlsruher Instituts für Technologie (KIT) erlaubt.

Autor

Alexander Waibel

Herausgeber

KIT | Webcast

Beteiligtes Institut

Institut für Anthropomatik und Robotik (IAR)

Genre

Vorlesung

Beschreibung

  • 0:00:00 Start
  • 0:00:05 Speech Recognition (Components)
  • 0:06:07 Language Models
  • 0:08:06 Language Model Performance
  • 0:10:24 Estimation of Language model Quality
  • 0:19:23 Decoder
  • 0:19:55 Decoder - Assembling the Pieces
  • 0:20:59 Decoding with Beam Search
  • 0:21:22 Beam vs. WER
  • 0:21:56 Improving Speed on Cooperative speech
  • 0:22:11 Viterbi Alignment
  • 0:22:59 Measuring Recognizer Performance
  • 0:23:22 Sloppy Speech
  • 0:24:17 Neural Nets
  • 0:25:36 Neural Language Model
  • 0:26:35 How good does it have to be
  • 0:30:48 Neural Language Processing and Machine Translation
  • 0:31:30 Natural Language Processing
  • 0:33:39 Speech Deployment
  • 0:37:56 Voice Agents
  • 0:43:49 Emotional Speech
  • 0:45:25 NLU with slot filling
  • 0:46:22 Intelligent System and a Language Transparent World
  • 0:47:36 How to pick a Research Project
  • 0:50:55 Human Interaction
  • 0:51:00 Connecting a Multilingual World
  • 0:52:08 Everyone Speaks English
  • 0:52:31 Human effort
  • 0:52:41 Laguage Transparence
  • 0:52:47 Language Text
  • 0:52:52 Social Text
  • 0:53:17 Lectures
  • 0:53:35 Human Language Challenges
  • 0:54:10 Body Language and Facial Expressions
  • 0:54:45 Can Technology provide a Solution
  • 0:54:59 Language is Ambiguous
  • 0:57:01 Neural Network
  • 0:58:10 In Image processing
  • 0:58:35 Exponential Increase in Computing
  • 0:59:03 English Text Copora
  • 0:59:36 Deep Neural Nets
  • 1:00:00 Conversational Speech
  • 1:00:23 Machine Translation
  • 1:01:05 Statistical Machine Translation
  • 1:01:33 Alignmenr/Decoding in MT
  • 1:01:54 Recurrent Neurak Nets
  • 1:02:14 RNN Encoder - Decoder
  • 1:03:09 MT Benchmarks - KIT Performance
  • 1:03:45 Consecutive Interpretation
  • 1:04:16 Interprating Machine
  • 1:05:49 First Speech Translation Videocall
  • 1:06:52 Jibbigo on Apple Commercials
  • 1:10:28 unlimited Domain Simultaneous
  • 1:14:49 Human-Machine symbiosis
  • 1:15:02 Voting Sessions
  • 1:16:07 German Compounding
  • 1:16:29 Words
  • 1:19:13 The Long Tail of Language
  • 1:19:44 Language Adaptive Networks
  • 1:21:39 Neural ''Interlingua''?
  • 1:22:56 Multi-Modal Translation
  • 1:23:46 Meeting of the Future
  • 1:23:59 Grand Challenges
  • 1:28:26 Neuronale Netze zur Drohnensteuerung
  • 1:28:38 Bitvraze Crazyflie
  • 1:29:08 Steuerung der Crazyflie
  • 1:30:39 Regelschleife
  • 1:31:49 Regelung mit Neuralen Netzen

Laufzeit (hh:mm:ss)

01:33:48

Serie

Kognitive Systeme, Vorlesung, SS 2018

Publiziert am

05.07.2018

Fachgebiet

Informatik

Lizenz

KITopen-Lizenz

Auflösung 1280 x 720 Pixel
Seitenverhältnis 16:9
Audiobitrate 128000 bps
Audio Kanäle 2
Audio Codec aac
Audio Abtastrate 48000 Hz
Gesamtbitrate 934269 bps
Farbraum yuv420p
Container mov,mp4,m4a,3gp,3g2,mj2
Medientyp video/mp4
Dauer 5628 s
Dateiname DIVA-2018-516_hd.mp4
Dateigröße 657.256.581 byte
Bildwiederholfrequenz 25
Videobitrate 800167 bps
Video Codec h264

Embed-Code