<!– wp:heading {“level”:1} –>
<h1 class=”wp-block-heading”>Introduction</h1>
<!– /wp:heading –>
<!– wp:heading {“className”:”wp-block-heading”} –>
<h2 class=”wp-block-heading”>Rule Based POS Tagging</h2>
<!– /wp:heading –>
<!– wp:paragraph {“className”:”wp-block-heading”} –>
<p class=”wp-block-heading”>One of the oldest techniques of tagging is rule-based POS tagging. Rule-based taggers use a dictionary or lexicon for getting possible tags for each word. If the word has more than one possible tag, then rule-based taggers use hand-written rules to identify the correct tag. Disambiguation can also be performed in rule-based tagging by analyzing the linguistic features of a word along with its preceding as well as following words. For example, suppose if the preceding word of a word is an article then word must be a noun.</p>
<!– /wp:paragraph –>
<!– wp:paragraph {“className”:”wp-block-heading”} –>
<p class=”wp-block-heading”>Stochastic Parts of Speech Tagging With NLTK is a comprehensive guide that covers the process of utilizing NLTK for accurate and efficient parts of speech tagging. This invaluable resource provides a formal and thorough understanding of the techniques and methodologies involved in stochastic POS tagging with NLTK. Dive into the world of natural language processing and enhance your skills with this authoritative guide.</p>
<!– /wp:paragraph –>
<!– wp:paragraph –>
<p>Another technique of tagging is Stochastic POS Tagging. Now, the question that arises here is which model can be stochastic. The model that includes frequency or probability (statistics) can be called stochastic. Any number of different approaches to the problem of part-of-speech tagging can be referred to as stochastic tagger. The simplest stochastic tagger applies the following approaches for POS tagging:</p>
<!– /wp:paragraph –>
<!– wp:heading {“className”:”wp-block-heading”} –>
<h2 class=”wp-block-heading”>Word Frequency Approach</h2>
<!– /wp:heading –>
<!– wp:paragraph –>
<p>In this approach, the stochastic taggers disambiguate the words based on the probability that a word occurs with a particular tag. We can also say that the tag encountered most frequently with the word in the training set is the one assigned to an ambiguous instance of that word.</p>
<!– /wp:paragraph –>
<!– wp:heading {“className”:”wp-block-heading”} –>
<h2 class=”wp-block-heading”>Tag Sequence Probabilities</h2>
<!– /wp:heading –>
<!– wp:paragraph –>
<p>It is another approach of stochastic tagging, where the tagger calculates the probability of a given sequence of tags occurring. It is also called n-gram approach. It is called so because the best tag for a given word is determined by the probability at which it occurs with the n previous tags.</p>
<!– /wp:paragraph –>
<!– wp:paragraph –>
<p>Part-of-speech tagging is harder than just having a list of words and their parts of speech, because some words can represent more than one part of speech at different times, and because some parts of speech are complex or unspoken, a large percentage of word-forms are ambiguous. For example, even “dogs”, which is usually thought of as just a plural noun, can also be a verb:</p>
<!– /wp:paragraph –>
<!– wp:paragraph –>
<p><strong>The sailor dogs the hatch</strong></p>
<!– /wp:paragraph –>
<!– wp:paragraph –>
<p>Correct grammatical tagging will reflect that “dogs” is here used as a verb, not as the more common plural noun. Grammatical context is one way to determine this; <a href=”https://en.wikipedia.org/wiki/Semantic_analysis_(linguistics)”>semantic analysis</a> can also be used to infer that “sailor” and “hatch” implicate “dogs” as 1) in the nautical context and 2) an action applied to the object “hatch” (in this context, “dogs” is a <a href=”https://en.wikipedia.org/wiki/Seamanship”>nautical</a> term meaning “fastens (a watertight door securely.</p>
<!– /wp:paragraph –>
<!– wp:paragraph –>
<p>So, for something like the sentence above the word can has several semantic meanings. One being a model for question formation, another being a container for holding food or liquid, and yet another being a verb denoting the ability to do something.</p>
<!– /wp:paragraph –>
<!– wp:paragraph –>
<p><strong>Let’s learn with a NLTK Part of Speech example</strong>:</p>
<!– /wp:paragraph –>
<!– wp:paragraph –>
<p>POS tag list:</p>
<!– /wp:paragraph –>
<!– wp:paragraph –>
<p>CC coordinating conjunction</p>
<!– /wp:paragraph –>
<!– wp:paragraph –>
<p>CD cardinal digit</p>
<!– /wp:paragraph –>
<!– wp:paragraph –>
<p>DT determiner</p>
<!– /wp:paragraph –>
<!– wp:paragraph –>
<p>IN preposition/subordinating conjunction</p>
<!– /wp:paragraph –>
<!– wp:paragraph –>
<p>JJ adjective ‘big’</p>
<!– /wp:paragraph –>
<!– wp:paragraph –>
<p>JJR adjective, comparative ‘bigger’</p>
<!– /wp:paragraph –>
<!– wp:paragraph –>
<p>JJS adjective, superlative ‘biggest’</p>
<!– /wp:paragraph –>
<!– wp:paragraph –>
<p>MD modal could, will</p>
<!– /wp:paragraph –>
<!– wp:paragraph –>
<p>NN noun, singular ‘desk’</p>
<!– /wp:paragraph –>
<!– wp:paragraph –>
<p>NNS noun plural ‘desks’</p>
<!– /wp:paragraph –>
<!– wp:paragraph –>
<p>NNP proper noun, singular ‘Harrison’</p>
<!– /wp:paragraph –>
<!– wp:paragraph –>
<p>NNPS proper noun, plural ‘Americans’</p>
<!– /wp:paragraph –>
<!– wp:paragraph –>
<p>PRP personal pronoun I, he, she</p>
<!– /wp:paragraph –>
<!– wp:paragraph –>
<p>PRP$ possessive pronoun my, his, hers</p>
<!– /wp:paragraph –>
<!– wp:paragraph –>
<p>RB adverb very, silently,</p>
<!– /wp:paragraph –>
<!– wp:paragraph –>
<p>RBR adverb, comparative better</p>
<!– /wp:paragraph –>
<!– wp:paragraph –>
<p>RBS adverb, superlative best</p>
<!– /wp:paragraph –>
<!– wp:paragraph –>
<p>UH interjection errrrrrrrm</p>
<!– /wp:paragraph –>
<!– wp:paragraph –>
<p>VB verb, base form take</p>
<!– /wp:paragraph –>
<!– wp:gallery {“linkTo”:”none”} –>
<figure class=”wp-block-gallery has-nested-images columns-default is-cropped”><!– wp:image {“id”:95,”sizeSlug”:”large”,”linkDestination”:”none”} –>
<figure class=”wp-block-image size-large”><img src=”https://preview.themeinwp.net/newsxpress/wp-content/uploads/sites/18/2023/09/round-motorcycle-race-team-sports-racing-689828-pxhere.com_-1024×679.jpg” alt=”” class=”wp-image-95″/></figure>
<!– /wp:image –>
<!– wp:image {“id”:79,”sizeSlug”:”large”,”linkDestination”:”none”} –>
<figure class=”wp-block-image size-large”><img src=”https://preview.themeinwp.net/newsxpress/wp-content/uploads/sites/18/2023/09/sport-game-running-male-recreation-flag-917611-pxhere.com_.jpg” alt=”” class=”wp-image-79″/></figure>
<!– /wp:image –>
<!– wp:image {“id”:71,”sizeSlug”:”large”,”linkDestination”:”none”} –>
<figure class=”wp-block-image size-large”><img src=”https://preview.themeinwp.net/newsxpress/wp-content/uploads/sites/18/2023/09/man-bicycle-male-guy-cyclist-vehicle-1392437-pxhere.com_-1024×683.jpg” alt=”” class=”wp-image-71″/></figure>
<!– /wp:image –>
<!– wp:image {“id”:74,”sizeSlug”:”large”,”linkDestination”:”none”} –>
<figure class=”wp-block-image size-large”><img src=”https://preview.themeinwp.net/newsxpress/wp-content/uploads/sites/18/2023/09/woman-running-jump-runner-cycling-marathon-155327-pxhere.com_-1024×819.jpg” alt=”” class=”wp-image-74″/></figure>
<!– /wp:image –>
<!– wp:image {“id”:82,”sizeSlug”:”large”,”linkDestination”:”none”} –>
<figure class=”wp-block-image size-large”><img src=”https://preview.themeinwp.net/newsxpress/wp-content/uploads/sites/18/2023/09/sport-game-action-soccer-runner-football-747045-pxhere.com_-1024×683.jpg” alt=”” class=”wp-image-82″/></figure>
<!– /wp:image –>
<!– wp:image {“id”:94,”sizeSlug”:”large”,”linkDestination”:”none”} –>
<figure class=”wp-block-image size-large”><img src=”https://preview.themeinwp.net/newsxpress/wp-content/uploads/sites/18/2023/09/sport-vehicle-motorcycle-extreme-speed-race-533999-pxhere.com_-1024×681.jpg” alt=”” class=”wp-image-94″/></figure>
<!– /wp:image –></figure>
<!– /wp:gallery –>
<!– wp:paragraph –>
<p></p>
<!– /wp:paragraph –>
Post Disclaimer
Disclaimer/Publisher’s Note: The content provided on this website is for informational purposes only. The statements, opinions, and data expressed are those of the individual authors or contributors and do not necessarily reflect the views or opinions of Lexsense. The statements, opinions, and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of Lexsense and/or the editor(s). Lexsense and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
Comments are closed.