{"id":983,"date":"2022-04-06T14:43:16","date_gmt":"2022-04-06T14:43:16","guid":{"rendered":"https:\/\/uplatz.com\/blog\/?p=983"},"modified":"2023-09-17T08:39:34","modified_gmt":"2023-09-17T08:39:34","slug":"automatic-speech-recognition-systems-working-components-and-challenges","status":"publish","type":"post","link":"https:\/\/uplatz.com\/blog\/automatic-speech-recognition-systems-working-components-and-challenges\/","title":{"rendered":"Automatic Speech Recognition Systems: Working, Components and Challenges"},"content":{"rendered":"<h2><b><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-984\" src=\"https:\/\/uplatz.com\/blog\/wp-content\/uploads\/2022\/04\/voiceeee.jpg\" alt=\"\" width=\"5760\" height=\"3840\" srcset=\"https:\/\/uplatz.com\/blog\/wp-content\/uploads\/2022\/04\/voiceeee.jpg 5760w, https:\/\/uplatz.com\/blog\/wp-content\/uploads\/2022\/04\/voiceeee-300x200.jpg 300w, https:\/\/uplatz.com\/blog\/wp-content\/uploads\/2022\/04\/voiceeee-768x512.jpg 768w, https:\/\/uplatz.com\/blog\/wp-content\/uploads\/2022\/04\/voiceeee-1024x683.jpg 1024w\" sizes=\"auto, (max-width: 5760px) 100vw, 5760px\" \/><span style=\"color: #800000;\">What is ASR?<\/span><\/b><\/h2>\n<p><span style=\"font-weight: 400; color: #000000;\">ASR is an Automatic Speech Recognition system. It is a technology that converts speech into text. With the help of this technology, we can talk to our machines in a natural way like the way we talk to humans.<\/span><\/p>\n<p><span style=\"color: #000000;\"><b>Example<\/b><b> \u2013 <\/b><span style=\"font-weight: 400;\">Youtube, Alexa, Smart TV, etc.<\/span><\/span><\/p>\n<h2><span style=\"color: #800000;\"><b>Components in ASR\u00a0<\/b><\/span><\/h2>\n<p><span style=\"font-weight: 400; color: #000000;\">There are three major components in ASR.<\/span><\/p>\n<ol>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400; color: #000000;\">LEXICON<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400; color: #000000;\">Acoustic Model<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400; color: #000000;\">Language Model<\/span><\/li>\n<\/ol>\n<p><span style=\"color: #000000;\"><span style=\"color: #800000;\"><b>LEXICON<\/b><\/span><b> \u2013 <\/b><span style=\"font-weight: 400;\">Lexicon is the primary or first step in decoding speech and creating a comprehensive lexical design for an ASR system and including all fundamental elements of spoken language and written vocabulary. The lexicon is the building block of the acoustic model for every vocal input.<\/span><\/span><\/p>\n<p><span style=\"color: #000000;\"><span style=\"color: #800000;\"><b>Acoustic Model<\/b><\/span><b> \u2013 <\/b><span style=\"font-weight: 400;\">Acoustic model is the second step in ASR, its work is to separate an audio signal into minute time frames. Then it analyzes each frame and provides the probability of using different phonemes\u00a0 (Phonemes are the basic building block sounds of language and words ) in that section of audio Simply put. The acoustic model aims to analyze which sound is spoken in each frame.\u00a0<\/span><\/span><\/p>\n<p><span style=\"font-weight: 400; color: #000000;\">The acoustic model is very important because different people pronounce the same phrase in multiple ways and background noise, and accents can make the same sentence sound different and it\u2019s also depending on the speaker.<\/span><\/p>\n<p><span style=\"font-weight: 400; color: #000000;\">Acoustic Models use deep learning algorithms to determine the relationship between audio frames and phonemes.<\/span><\/p>\n<p><span style=\"font-weight: 400; color: #000000;\">A Very commonly used acoustic model in ASR is the Hidden Markov Model. Which is based on the Markov Chain Model. This model is used to predict the probability of an event based on a situation&#8217;s current state. In that way, the acoustic model works.<\/span><\/p>\n<h2><span style=\"color: #800000;\"><b>Language Model<\/b><b>\u00a0<\/b><\/span><\/h2>\n<p><span style=\"font-weight: 400; color: #000000;\">The language model is the third step in the ASR system. The language model is used to recognize the intent of spoken phrases and used to compose word sequences and operates in a similar way to the acoustic model by using deep learning algorithms to train text data to estimate the probability of which word comes next in a phrase.<\/span><\/p>\n<p><span style=\"font-weight: 400; color: #000000;\">It is common for speech recognition software to use N-gram probability to translate spoken words into text.<\/span><\/p>\n<p><span style=\"font-weight: 400; color: #000000;\">So with the help of these three component ASR systems, it is able to make close-to-close accurate predictions of words or sentences in the audio input.<\/span><\/p>\n<h2><span style=\"color: #800000;\"><b>How does ASR work?\u00a0<\/b><\/span><\/h2>\n<p><span style=\"font-weight: 400; color: #000000;\">You ask your device, what is the weather forecast, then your device creates a wave file of your words then background noise is reduced and the volume is normalized then the filtered waveform is broken into phonemes ( phonemes are the sound used to build words). Then each phoneme is like a link in a chain based on the first phoneme statistical analysis is used to find the most likely phonemes.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400; color: #000000;\">The ASR focuses on tagged words meaning &#8211; the vocabulary of an ASR consists of 60 thousand or more words so its means is over 215 trillion possible word combinations if you just speak three words in a sequence to it so it would impractical for an ASR system to scan its entire vocabulary for each word and process them individually so the ASR reacts to certain \u201ctagged\u201d words and phrases like \u201cweather forecast\u201d,\u201d Check my balance\u201d etc.<\/span><\/p>\n<h2><span style=\"color: #800000;\"><b>How does ASR learn from Humans?<\/b><\/span><\/h2>\n<p><span style=\"color: #000000;\"><b>\u00a0\u00a0\u00a0\u00a0\u00a0<\/b><span style=\"font-weight: 400;\">ASR learning is based on two mechanisms.<\/span><\/span><\/p>\n<ol>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400; color: #000000;\">Human Tuning<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400; color: #000000;\">Active learning<\/span><\/li>\n<\/ol>\n<p><span style=\"color: #000000;\"><span style=\"color: #800000;\"><b>Human Tuning<\/b><\/span><b> \u2013 <\/b><span style=\"font-weight: 400;\">In this learning, ASR is learned through the conversation logs of a given ASR. Software interface and commonly used words are present but it is not in the pre-programmed vocabulary. By adding those words to the software vocabulary, the software will be able to understand speech better.<\/span><\/span><\/p>\n<p><span style=\"color: #000000;\"><span style=\"color: #800000;\"><b>Active Learning <\/b><\/span><b>\u2013<\/b><span style=\"font-weight: 400;\"> Active learning is much more advanced learning than ASR. It constantly expands its vocabulary by learning autonomously and adapting new words during this learning process.<\/span><\/span><\/p>\n<h2><span style=\"color: #000000;\"><b><span style=\"color: #800000;\">Challenges in ASR<\/span>\u00a0<\/b><\/span><\/h2>\n<ul>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400; color: #000000;\">Background Noise<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400; color: #000000;\">Difficult accents<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400; color: #000000;\">Lack of trust and privacy issues<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400; color: #000000;\">Touchless screens<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400; color: #000000;\">Due to Bad Recording equipment, it is hard to identify what the user said<\/span><\/li>\n<\/ul>\n","protected":false},"excerpt":{"rendered":"<p>What is ASR? ASR is an Automatic Speech Recognition system. It is a technology that converts speech into text. With the help of this technology, we can talk to our <span class=\"readmore\"><a href=\"https:\/\/uplatz.com\/blog\/automatic-speech-recognition-systems-working-components-and-challenges\/\">Read More &#8230;<\/a><\/span><\/p>\n","protected":false},"author":2,"featured_media":984,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[86],"tags":[],"class_list":["post-983","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-technology"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.4 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Automatic Speech Recognition Systems: Working, Components and Challenges | Uplatz Blog<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/uplatz.com\/blog\/automatic-speech-recognition-systems-working-components-and-challenges\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Automatic Speech Recognition Systems: Working, Components and Challenges | Uplatz Blog\" \/>\n<meta property=\"og:description\" content=\"What is ASR? ASR is an Automatic Speech Recognition system. It is a technology that converts speech into text. With the help of this technology, we can talk to our Read More ...\" \/>\n<meta property=\"og:url\" content=\"https:\/\/uplatz.com\/blog\/automatic-speech-recognition-systems-working-components-and-challenges\/\" \/>\n<meta property=\"og:site_name\" content=\"Uplatz Blog\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/Uplatz-1077816825610769\/\" \/>\n<meta property=\"article:published_time\" content=\"2022-04-06T14:43:16+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2023-09-17T08:39:34+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/uplatz.com\/blog\/wp-content\/uploads\/2022\/04\/voiceeee.jpg\" \/>\n\t<meta property=\"og:image:width\" content=\"5760\" \/>\n\t<meta property=\"og:image:height\" content=\"3840\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"author\" content=\"uplatzblog\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@uplatz_global\" \/>\n<meta name=\"twitter:site\" content=\"@uplatz_global\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"uplatzblog\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"4 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\\\/\\\/uplatz.com\\\/blog\\\/automatic-speech-recognition-systems-working-components-and-challenges\\\/#article\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/uplatz.com\\\/blog\\\/automatic-speech-recognition-systems-working-components-and-challenges\\\/\"},\"author\":{\"name\":\"uplatzblog\",\"@id\":\"https:\\\/\\\/uplatz.com\\\/blog\\\/#\\\/schema\\\/person\\\/8ecae69a21d0757bdb2f776e67d2645e\"},\"headline\":\"Automatic Speech Recognition Systems: Working, Components and Challenges\",\"datePublished\":\"2022-04-06T14:43:16+00:00\",\"dateModified\":\"2023-09-17T08:39:34+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\\\/\\\/uplatz.com\\\/blog\\\/automatic-speech-recognition-systems-working-components-and-challenges\\\/\"},\"wordCount\":657,\"commentCount\":0,\"publisher\":{\"@id\":\"https:\\\/\\\/uplatz.com\\\/blog\\\/#organization\"},\"image\":{\"@id\":\"https:\\\/\\\/uplatz.com\\\/blog\\\/automatic-speech-recognition-systems-working-components-and-challenges\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/uplatz.com\\\/blog\\\/wp-content\\\/uploads\\\/2022\\\/04\\\/voiceeee.jpg\",\"articleSection\":[\"Technology\"],\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\\\/\\\/uplatz.com\\\/blog\\\/automatic-speech-recognition-systems-working-components-and-challenges\\\/#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/uplatz.com\\\/blog\\\/automatic-speech-recognition-systems-working-components-and-challenges\\\/\",\"url\":\"https:\\\/\\\/uplatz.com\\\/blog\\\/automatic-speech-recognition-systems-working-components-and-challenges\\\/\",\"name\":\"Automatic Speech Recognition Systems: Working, Components and Challenges | Uplatz Blog\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/uplatz.com\\\/blog\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/uplatz.com\\\/blog\\\/automatic-speech-recognition-systems-working-components-and-challenges\\\/#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/uplatz.com\\\/blog\\\/automatic-speech-recognition-systems-working-components-and-challenges\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/uplatz.com\\\/blog\\\/wp-content\\\/uploads\\\/2022\\\/04\\\/voiceeee.jpg\",\"datePublished\":\"2022-04-06T14:43:16+00:00\",\"dateModified\":\"2023-09-17T08:39:34+00:00\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/uplatz.com\\\/blog\\\/automatic-speech-recognition-systems-working-components-and-challenges\\\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/uplatz.com\\\/blog\\\/automatic-speech-recognition-systems-working-components-and-challenges\\\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/uplatz.com\\\/blog\\\/automatic-speech-recognition-systems-working-components-and-challenges\\\/#primaryimage\",\"url\":\"https:\\\/\\\/uplatz.com\\\/blog\\\/wp-content\\\/uploads\\\/2022\\\/04\\\/voiceeee.jpg\",\"contentUrl\":\"https:\\\/\\\/uplatz.com\\\/blog\\\/wp-content\\\/uploads\\\/2022\\\/04\\\/voiceeee.jpg\",\"width\":5760,\"height\":3840},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/uplatz.com\\\/blog\\\/automatic-speech-recognition-systems-working-components-and-challenges\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/uplatz.com\\\/blog\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Automatic Speech Recognition Systems: Working, Components and Challenges\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/uplatz.com\\\/blog\\\/#website\",\"url\":\"https:\\\/\\\/uplatz.com\\\/blog\\\/\",\"name\":\"Uplatz Blog\",\"description\":\"Uplatz is a global IT Training &amp; Consulting company\",\"publisher\":{\"@id\":\"https:\\\/\\\/uplatz.com\\\/blog\\\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/uplatz.com\\\/blog\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\\\/\\\/uplatz.com\\\/blog\\\/#organization\",\"name\":\"uplatz.com\",\"url\":\"https:\\\/\\\/uplatz.com\\\/blog\\\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/uplatz.com\\\/blog\\\/#\\\/schema\\\/logo\\\/image\\\/\",\"url\":\"https:\\\/\\\/uplatz.com\\\/blog\\\/wp-content\\\/uploads\\\/2016\\\/11\\\/Uplatz-Logo-Copy-2.png\",\"contentUrl\":\"https:\\\/\\\/uplatz.com\\\/blog\\\/wp-content\\\/uploads\\\/2016\\\/11\\\/Uplatz-Logo-Copy-2.png\",\"width\":1280,\"height\":800,\"caption\":\"uplatz.com\"},\"image\":{\"@id\":\"https:\\\/\\\/uplatz.com\\\/blog\\\/#\\\/schema\\\/logo\\\/image\\\/\"},\"sameAs\":[\"https:\\\/\\\/www.facebook.com\\\/Uplatz-1077816825610769\\\/\",\"https:\\\/\\\/x.com\\\/uplatz_global\",\"https:\\\/\\\/www.instagram.com\\\/\",\"https:\\\/\\\/www.linkedin.com\\\/company\\\/7956715?trk=tyah&amp;amp;amp;amp;trkInfo=clickedVertical:company,clickedEntityId:7956715,idx:1-1-1,tarId:1464353969447,tas:uplatz\"]},{\"@type\":\"Person\",\"@id\":\"https:\\\/\\\/uplatz.com\\\/blog\\\/#\\\/schema\\\/person\\\/8ecae69a21d0757bdb2f776e67d2645e\",\"name\":\"uplatzblog\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/7f814c72279199f59ded4418a8653ad15f5f8904ac75e025a4e2abe24d58fa5d?s=96&d=mm&r=g\",\"url\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/7f814c72279199f59ded4418a8653ad15f5f8904ac75e025a4e2abe24d58fa5d?s=96&d=mm&r=g\",\"contentUrl\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/7f814c72279199f59ded4418a8653ad15f5f8904ac75e025a4e2abe24d58fa5d?s=96&d=mm&r=g\",\"caption\":\"uplatzblog\"}}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Automatic Speech Recognition Systems: Working, Components and Challenges | Uplatz Blog","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/uplatz.com\/blog\/automatic-speech-recognition-systems-working-components-and-challenges\/","og_locale":"en_US","og_type":"article","og_title":"Automatic Speech Recognition Systems: Working, Components and Challenges | Uplatz Blog","og_description":"What is ASR? ASR is an Automatic Speech Recognition system. It is a technology that converts speech into text. With the help of this technology, we can talk to our Read More ...","og_url":"https:\/\/uplatz.com\/blog\/automatic-speech-recognition-systems-working-components-and-challenges\/","og_site_name":"Uplatz Blog","article_publisher":"https:\/\/www.facebook.com\/Uplatz-1077816825610769\/","article_published_time":"2022-04-06T14:43:16+00:00","article_modified_time":"2023-09-17T08:39:34+00:00","og_image":[{"width":5760,"height":3840,"url":"https:\/\/uplatz.com\/blog\/wp-content\/uploads\/2022\/04\/voiceeee.jpg","type":"image\/jpeg"}],"author":"uplatzblog","twitter_card":"summary_large_image","twitter_creator":"@uplatz_global","twitter_site":"@uplatz_global","twitter_misc":{"Written by":"uplatzblog","Est. reading time":"4 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/uplatz.com\/blog\/automatic-speech-recognition-systems-working-components-and-challenges\/#article","isPartOf":{"@id":"https:\/\/uplatz.com\/blog\/automatic-speech-recognition-systems-working-components-and-challenges\/"},"author":{"name":"uplatzblog","@id":"https:\/\/uplatz.com\/blog\/#\/schema\/person\/8ecae69a21d0757bdb2f776e67d2645e"},"headline":"Automatic Speech Recognition Systems: Working, Components and Challenges","datePublished":"2022-04-06T14:43:16+00:00","dateModified":"2023-09-17T08:39:34+00:00","mainEntityOfPage":{"@id":"https:\/\/uplatz.com\/blog\/automatic-speech-recognition-systems-working-components-and-challenges\/"},"wordCount":657,"commentCount":0,"publisher":{"@id":"https:\/\/uplatz.com\/blog\/#organization"},"image":{"@id":"https:\/\/uplatz.com\/blog\/automatic-speech-recognition-systems-working-components-and-challenges\/#primaryimage"},"thumbnailUrl":"https:\/\/uplatz.com\/blog\/wp-content\/uploads\/2022\/04\/voiceeee.jpg","articleSection":["Technology"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/uplatz.com\/blog\/automatic-speech-recognition-systems-working-components-and-challenges\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/uplatz.com\/blog\/automatic-speech-recognition-systems-working-components-and-challenges\/","url":"https:\/\/uplatz.com\/blog\/automatic-speech-recognition-systems-working-components-and-challenges\/","name":"Automatic Speech Recognition Systems: Working, Components and Challenges | Uplatz Blog","isPartOf":{"@id":"https:\/\/uplatz.com\/blog\/#website"},"primaryImageOfPage":{"@id":"https:\/\/uplatz.com\/blog\/automatic-speech-recognition-systems-working-components-and-challenges\/#primaryimage"},"image":{"@id":"https:\/\/uplatz.com\/blog\/automatic-speech-recognition-systems-working-components-and-challenges\/#primaryimage"},"thumbnailUrl":"https:\/\/uplatz.com\/blog\/wp-content\/uploads\/2022\/04\/voiceeee.jpg","datePublished":"2022-04-06T14:43:16+00:00","dateModified":"2023-09-17T08:39:34+00:00","breadcrumb":{"@id":"https:\/\/uplatz.com\/blog\/automatic-speech-recognition-systems-working-components-and-challenges\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/uplatz.com\/blog\/automatic-speech-recognition-systems-working-components-and-challenges\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/uplatz.com\/blog\/automatic-speech-recognition-systems-working-components-and-challenges\/#primaryimage","url":"https:\/\/uplatz.com\/blog\/wp-content\/uploads\/2022\/04\/voiceeee.jpg","contentUrl":"https:\/\/uplatz.com\/blog\/wp-content\/uploads\/2022\/04\/voiceeee.jpg","width":5760,"height":3840},{"@type":"BreadcrumbList","@id":"https:\/\/uplatz.com\/blog\/automatic-speech-recognition-systems-working-components-and-challenges\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/uplatz.com\/blog\/"},{"@type":"ListItem","position":2,"name":"Automatic Speech Recognition Systems: Working, Components and Challenges"}]},{"@type":"WebSite","@id":"https:\/\/uplatz.com\/blog\/#website","url":"https:\/\/uplatz.com\/blog\/","name":"Uplatz Blog","description":"Uplatz is a global IT Training &amp; Consulting company","publisher":{"@id":"https:\/\/uplatz.com\/blog\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/uplatz.com\/blog\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/uplatz.com\/blog\/#organization","name":"uplatz.com","url":"https:\/\/uplatz.com\/blog\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/uplatz.com\/blog\/#\/schema\/logo\/image\/","url":"https:\/\/uplatz.com\/blog\/wp-content\/uploads\/2016\/11\/Uplatz-Logo-Copy-2.png","contentUrl":"https:\/\/uplatz.com\/blog\/wp-content\/uploads\/2016\/11\/Uplatz-Logo-Copy-2.png","width":1280,"height":800,"caption":"uplatz.com"},"image":{"@id":"https:\/\/uplatz.com\/blog\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/www.facebook.com\/Uplatz-1077816825610769\/","https:\/\/x.com\/uplatz_global","https:\/\/www.instagram.com\/","https:\/\/www.linkedin.com\/company\/7956715?trk=tyah&amp;amp;amp;amp;trkInfo=clickedVertical:company,clickedEntityId:7956715,idx:1-1-1,tarId:1464353969447,tas:uplatz"]},{"@type":"Person","@id":"https:\/\/uplatz.com\/blog\/#\/schema\/person\/8ecae69a21d0757bdb2f776e67d2645e","name":"uplatzblog","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/secure.gravatar.com\/avatar\/7f814c72279199f59ded4418a8653ad15f5f8904ac75e025a4e2abe24d58fa5d?s=96&d=mm&r=g","url":"https:\/\/secure.gravatar.com\/avatar\/7f814c72279199f59ded4418a8653ad15f5f8904ac75e025a4e2abe24d58fa5d?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/7f814c72279199f59ded4418a8653ad15f5f8904ac75e025a4e2abe24d58fa5d?s=96&d=mm&r=g","caption":"uplatzblog"}}]}},"_links":{"self":[{"href":"https:\/\/uplatz.com\/blog\/wp-json\/wp\/v2\/posts\/983","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/uplatz.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/uplatz.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/uplatz.com\/blog\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/uplatz.com\/blog\/wp-json\/wp\/v2\/comments?post=983"}],"version-history":[{"count":5,"href":"https:\/\/uplatz.com\/blog\/wp-json\/wp\/v2\/posts\/983\/revisions"}],"predecessor-version":[{"id":1989,"href":"https:\/\/uplatz.com\/blog\/wp-json\/wp\/v2\/posts\/983\/revisions\/1989"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/uplatz.com\/blog\/wp-json\/wp\/v2\/media\/984"}],"wp:attachment":[{"href":"https:\/\/uplatz.com\/blog\/wp-json\/wp\/v2\/media?parent=983"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/uplatz.com\/blog\/wp-json\/wp\/v2\/categories?post=983"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/uplatz.com\/blog\/wp-json\/wp\/v2\/tags?post=983"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}