Question 1

How does it detect sentence boundaries?

Accepted Answer

Period, question mark, or exclamation point followed by whitespace and a capital letter — with abbreviation exceptions. 'Dr. Smith said yes.' is one sentence, not two. The exception list covers common cases; rare abbreviations may still cause splits.

Question 2

Can I extract sentences by keyword?

Accepted Answer

Yes — case-insensitive substring match. Sentences containing the keyword are returned in order. Useful for grep-like extraction from documents, finding all mentions of a term in a transcript, or building a summary by phrase.

Question 3

Does it handle quoted dialogue?

Accepted Answer

Quoted text within a larger sentence stays in that sentence. 'He said, "Yes."' is one sentence with embedded quoted material. Multi-sentence quotes become multiple sentences if the period inside the quotes is followed by whitespace and a capital.

Question 4

What about lists and bullet points?

Accepted Answer

Bullet items are treated as separate sentences if they end with appropriate punctuation. List items without ending punctuation might merge with adjacent items. Pre-process bulleted content with a paragraph break per item for cleaner results.

Question 5

Can I extract by sentence position?

Accepted Answer

Yes — first, last, second, second-to-last, or arbitrary index. Useful for getting opening lines from each paragraph (often the topic sentence) or final lines (often the conclusion or transition). Position-based extraction is reliable because it doesn't depend on content matching.

Question 6

Does it work for non-English text?

Accepted Answer

Latin-script languages (Spanish, French, German) work well. Chinese, Japanese, Korean use different sentence-ending punctuation (。 instead of .) which is supported but less polished. Arabic and Hebrew right-to-left scripts work. The English-tuned heuristics are the gold standard; other languages get adequate but not optimal results.

Sentence Extractor

Related Tools

About This Tool

Frequently Asked Questions