Text Summarization for Urdu: Part 1

Text Summarization is an important task for large documents to get the idea of the document. There are two main summarization techniques used in NLP for text summarization.

Extractive Text Summarization: This approach's name is self-explanatory. Most important sentences or phrases are extracted from the original text and a short summary provided with these important sentences. See the figure for the explanation.

Abstractive Text Summarization: This approach uses more advanced deep learning techniques to generate new sentences by learning from the original text. It is a complex task and requires heavy computing power such as GPU.

Let's dive into the code for generating the text summary. I'm using Arabic as a parameter because the contributor did an excellent job of handling a lot of things like stemming, Urdu characters support, etc.

from summa.summarizer import summarize

text = """ اسلام آباد: صدر مملکت ڈاکٹر عارف علوی بھی کورونا وائرس کا شکار ہوگئے۔
سماجی رابطے کی ویب سائٹ ٹویٹر پر ڈاکٹر عارف علوی نے لکھا کہ میرا کورونا ٹیسٹ مثبت آگیا ہے،
 اللہ سب کورونا متاثرین پر رحم فرمائے، ویکسین کی پہلی خوراک لی تھی جب کہ دوسری ڈوز ایک ہفتے
 بعد لگنی تھی جس کے بعد اینٹی باڈیز بننا شروع ہوتی ہیں، برائے مہربانی محتاط رہیں۔"""

summary = summarize(text, ratio=0.2, language="arabic", words=15)
print(summary)

and here is the output:

سماجی رابطے کی ویب سائٹ ٹویٹر پر ڈاکٹر عارف علوی نے لکھا کہ میرا کورونا ٹیسٹ مثبت آگیا ہے،

Isn't it easy!!! Let me know if you have any questions.

Comments

ibm30 May 2021 at 06:28
Irfan! Sent u an email regarding development of an urdu sentiment analysis library.
ReplyDelete
Replies
Unknown14 December 2021 at 00:34
Hi Irfan, can you help doing urdu text summarization using spacy.
ReplyDelete
Replies
Muhammad Irfan17 December 2021 at 01:21
SpaCy does not provide summarization.
ReplyDelete
Replies
Unknown29 January 2022 at 20:18
which algorithm or technique you are using
ReplyDelete
Replies
Anonymous26 April 2022 at 00:07
This only works with the provided text.If you change the text, it shows nothing.
ReplyDelete
Replies
Muhammad Irfan20 September 2022 at 03:56
virtuoso.irfan@gmail.com
ReplyDelete
Replies
Anonymous12 July 2023 at 01:03
hey its only working for the given text
ReplyDelete
Replies

Add comment

UrduNLP

Search This Blog

Text Summarization for Urdu: Part 1

Comments

Post a Comment

Popular posts from this blog

Transformer Based QA System for Urdu

Urdu News Classification