What am I missing in this code, why is the output not each unique token in the text the user inputs and their frequency?

write a program that takes as input a plain text file containing natural language, and produces a list of each normalized type (unique token) and the number of times it appears in the file, sorted from largest to smallest. The user should be able to type: myfile.txt ()And the results should be printed to the screen. E.g.,$ python normalize_text.py myfile.txt --lower --stemshould produce something like:the 5000dog 1204ran 506...and so on. Options that the user can control must include: lowercasing, one ofeither stemming or lemmatization, stopword removal, and at least one additionaloption you added.

` import argparseimport refrom string import punctuation

def normalize_text(text, lower=True, remove_punctuation=False):    #apply lowercasing    if lower:        text = text.lower()#remove punctuation    if remove_punctuation:        text = text.translate(str.maketrans('', '', punctuation))#tokenization    tokens = re.findall(r'\b\w+\b', text)    return tokensdef simple_stem(tokens):#remove common suffixes    suffixes = ['s', 'es', 'ed', 'ing']    for suffix in suffixes:        if tokens.endswith(suffix):            return tokens[:-len(suffix)]    return tokensdef count_tokens(tokens):#count the occurrences of each word    counts = {}    for token in tokens:        counts[token] = counts.get(token, 0) + 1    return countsdef main():    parser = argparse.ArgumentParser(description='Text Normalization and Frequency Analysis')    parser.add_argument('filename', type=str, help='Input text file')    parser.add_argument('--lower', action='store_true', help='Lowercase the text')    parser.add_argument('--stem', action='store_true', help='Apply stemming to tokens')    parser.add_argument('--remove_stopwords', action='store_true', help='Remove common stopwords')    parser.add_argument('--remove_punctuation', action='store_true', help='Remove punctuation from text')    args = parser.parse_args()    with open(args.filename, 'r', encoding='utf-8') as file:        content = file.read()    tokens = normalize_text(text, lower=args.lower, stem=args.stem, remove_punctuation=args.remove_punctuation, remove_stopwords=args.remove_stopwords)     counts = count_tokens(tokens)#count token frequencies    wordcounts = count_tokens(content)#sort by frequency in descending order    sorted_wordcounts = sorted(wordcounts.items(), key=lambda x: x[1], reverse=True)#print the results    for word, count in sorted_wordcounts:        print(f'{word} {count}')if __name__ == '__main__':    main()`

What am I missing in this code, why is the output not each unique token in the text the user inputs and their frequency?

Trending Articles

RAMAYAMPET Mandal Sarpanch | Upa-Sarpanch | Ward member Mobile Numbers Medak...

लड़कियां सेक्स के दौरान क्यों करती है उह! आह!लड़कियां सेक्स के दौरान क्यों करती...

Neem Baba Extra Questions Answer Class 6 English Poorvi

Throw Back: 4×4 — Sikilitele (Ft Castro) Prod by JQ

Rajasthan Board 10th Result 2016 Roll No wise & Name Wise

Lowe faces four theft charges

Practice Sheet of Right form of verbs for HSC Students

Mafia, Murder & Mayhem In The Motor City: Detroit Mob Hit Timeline (1937-2007)

The 10 Tennessee Cities With The Largest Black Population For 2021

Materials Around Us Class 6 Worksheet Science Chapter 6

デスクトップヒープの枯渇

Best Suvichar in Hindi |बेस्ट सुविचार |शुभ विचार हिंदी में

Kanulanu Thaake Lyrics and translation | Manam (2014)

Korean Sex Porn Videos: XXX Videos & Free Porn Movies

Teen Shot In Miami Drive-By Dies From Injuries

Download: IQ Muzatasha feat Shy D & Pmj – Ulesi NiFertilizer Yamavuto

Mahakal Attitude Status

Property developer set up cannabis factory to help pay off debts...

♡

KB: How to troubleshoot issues when adding a Hyper-V host in System Center...