Channel: Active questions tagged python - Stack Overflow

↧

Get previous sentence while using SpaCy matcher

April 1, 2024, 5:35 pm

≫ Next: How to build and install python file in Dockerfile?

≪ Previous: undetected_chrome_driver headless mode triggers captcha

I am running a SpaCy Matcher line-by-line on a text file. My file has each text entry on a separate line. I am trying to extract 1) the matched instance, 2) the full sentence, and 3) the previous sentence. I am able to get the first two, but I am having trouble getting the previous sentence, given that there isn't a sentence index (from this post). Here is my code:

with open('file.txt', 'r') as f:    for line in iter(f.readline, ''):        doc = nlp(line)        matcher = Matcher(nlp.vocab)        matcher.add("pattern_of_interest", [pattern])        matches = matcher(doc)        for match_id, start, end in matches:            string_id = nlp.vocab.strings[match_id]            span = doc[start:end]         for sent in doc.sents:            if matcher(sent):                instances.append(pd.Series({"instance":str(span.text), "sentence":str(sent.text),"previous_sentence":str(sent[-1].text)}))

I understand that the bolded part is giving me the previous token, not sentence (I tried to get around this with the list, but it doesn't work). Any advice for retrieving the previous sentence would be greatly appreciated. Thank you!

↧

Latest Images

7 clever tricks Primark does to keep you walking & buying more than you need...

7 clever tricks Primark does to keep you walking & buying more than you need...

July 20, 2025, 5:14 am

Art for Everyone! Autism advocacy, local stories, and indigenous pride in one...

Art for Everyone! Autism advocacy, local stories, and indigenous pride in one...

July 20, 2025, 5:06 am

Paintings of English Downs 2

Paintings of English Downs 2

July 20, 2025, 4:30 am

How Kerala Women Rescued a Dying Forest and Turned It Into a Safe Haven for...

How Kerala Women Rescued a Dying Forest and Turned It Into a Safe Haven for...

July 20, 2025, 3:30 am

Met Eireann warns of heavy rain & spot flooding for DAYS before big...

Met Eireann warns of heavy rain & spot flooding for DAYS before big...

July 20, 2025, 1:14 am

Who is Kevin Lerena’s wife Geraldine?

Who is Kevin Lerena’s wife Geraldine?

July 20, 2025, 12:57 am

Man stabs woman, baby to death inside Queens home, police say

Man stabs woman, baby to death inside Queens home, police say

July 19, 2025, 11:00 pm

Ang papel ni whistleblower Julie Patidongan sa kaso ng mga nawawalang sabungero

Ang papel ni whistleblower Julie Patidongan sa kaso ng mga nawawalang sabungero

July 19, 2025, 9:45 pm

Telangana Human Rights Commission (TGHRC) seeks report from revenue dept on...

Telangana Human Rights Commission (TGHRC) seeks report from revenue dept on...

July 19, 2025, 7:29 pm

Crisis-hit NHS fat cats raking in MASSIVE salaries as frontline services cry...

Crisis-hit NHS fat cats raking in MASSIVE salaries as frontline services cry...

July 19, 2025, 2:11 pm

Trending Articles

Download: Apple Choice ft Rich Bizzy & Chinzuze- nkumbabulili (BeatbyBizzy)

June 5, 2017, 4:03 pm

Copied a VM from an enterprise server to VM workstation but networking error

October 29, 2018, 11:34 am

Jessica Jones Episode Descriptions Revealed For The First Season

November 1, 2015, 8:31 pm

The 10 Tennessee Cities With The Largest Black Population For 2021

December 21, 2020, 10:12 am

Elle Duncan’s Husband Omar Abdul Ali

January 28, 2020, 10:35 am

Practice Sheet of Right form of verbs for HSC Students

September 22, 2019, 11:40 pm

Craig Hillman – Barton

September 6, 2013, 2:00 am

“Preachers Of Detroit” Bishop Charles Ellis Comes Close To Being Removed As...

March 22, 2015, 1:00 am

JAILED: Grimsby man who ignored child abduction notice & kept seeing...

July 29, 2014, 11:34 pm

CalCen

June 4, 2020, 6:35 pm

मतलबी दोस्त स्टेट्स | Matlabi Dost Status in Hindi – Selfish Friends Status

February 13, 2020, 3:12 am

Cheating at Statistics 18: Volosovo Vanishing

April 10, 2017, 3:19 pm

Seven men arrested after body found in Warlingham well are released on bail

November 17, 2013, 3:51 pm

Missing woman located, Lawrence Avenue East and East Avenue area, Nicole...

April 22, 2016, 11:49 pm

Students hit streets to save Agriculture College land in city

October 13, 2018, 2:20 am

Detroit Strip Club Mogul Weathers Philly Mob Shakedown Efforts

July 6, 2015, 6:36 am

Windows Update / Microsoft Update の接続先 URL について

February 27, 2017, 12:32 am

Who Is Sisanda Jonas? | Biography| Profile| History Of South African Media...

June 22, 2017, 7:22 pm

It’s Kind of a Funny Story 2010 Dual Audio 720p BRRip [Hindi – English] ESubs

June 8, 2016, 6:15 am

[RELEASE THREAD]--_A-Team_--Cricket_Dream_5G

September 25, 2022, 7:14 pm

Latest Images

7 clever tricks Primark does to keep you walking & buying more than you need...

7 clever tricks Primark does to keep you walking & buying more than you need...

July 20, 2025, 5:14 am

Art for Everyone! Autism advocacy, local stories, and indigenous pride in one...

Art for Everyone! Autism advocacy, local stories, and indigenous pride in one...

July 20, 2025, 5:06 am

Paintings of English Downs 2

Paintings of English Downs 2

July 20, 2025, 4:30 am

How Kerala Women Rescued a Dying Forest and Turned It Into a Safe Haven for...

How Kerala Women Rescued a Dying Forest and Turned It Into a Safe Haven for...

July 20, 2025, 3:30 am

Met Eireann warns of heavy rain & spot flooding for DAYS before big...

Met Eireann warns of heavy rain & spot flooding for DAYS before big...

July 20, 2025, 1:14 am

Who is Kevin Lerena’s wife Geraldine?

Who is Kevin Lerena’s wife Geraldine?

July 20, 2025, 12:57 am

Man stabs woman, baby to death inside Queens home, police say

Man stabs woman, baby to death inside Queens home, police say

July 19, 2025, 11:00 pm

Ang papel ni whistleblower Julie Patidongan sa kaso ng mga nawawalang sabungero

Ang papel ni whistleblower Julie Patidongan sa kaso ng mga nawawalang sabungero

July 19, 2025, 9:45 pm

Telangana Human Rights Commission (TGHRC) seeks report from revenue dept on...

Telangana Human Rights Commission (TGHRC) seeks report from revenue dept on...

July 19, 2025, 7:29 pm

Crisis-hit NHS fat cats raking in MASSIVE salaries as frontline services cry...

Crisis-hit NHS fat cats raking in MASSIVE salaries as frontline services cry...

July 19, 2025, 2:11 pm

© 2025 //www.rssing.com