A PMI-DRIVEN APPROACH WITH CONVENTIONAL BERT FOR OPTIMIZING TEXT SUMMARIZATION

R. Ramesh; N. Subalakshmi; S. Selvarani; K. Kavitha; M. Jeyakarthic

doi:10.70102/afts.2025.1834.084

Original scientific article

Published: December 2025

<< Prev | Next >>

PDF

https://doi.org/10.70102/afts.2025.1834.084

A PMI-DRIVEN APPROACH WITH CONVENTIONAL BERT FOR OPTIMIZING TEXT SUMMARIZATION

Abstract

Text summarization plays a crucial role in natural language processing by condensing large volumes of textual information into concise and meaningful summaries. With the rapid growth of digital content, existing summarization approaches often struggle to balance contextual understanding and semantic relevance. This paper presents a PMI-driven BERT-based text summarization framework that integrates Pointwise Mutual Information (PMI) as a statistical pre-processing mechanism with a fine-tuned Conventional BERT model to enhance summary quality. PMI is employed to identify and rank semantically significant terms based on co-occurrence patterns, enabling effective keyword and phrase prioritization before summarization. The ranked textual representation is then processed using a summarization-specific decoder layer added on top of the BERT encoder to generate coherent and context-aware summaries. The proposed framework is evaluated on the CNN/Daily Mail dataset comprising over 300,000 news articles, using ROUGE-1, ROUGE-2, and ROUGE-L metrics for performance assessment. Experimental results demonstrate that the proposed method achieves ROUGE-1, ROUGE-2, and ROUGE-L scores of 46.9, 27.61, and 45.68 respectively, outperforming baseline models such as Seq2Seq, Seq2Sick, and Prefix-Tuning by an average margin of 2–3%. The experiments were conducted using Python with the PyTorch deep learning framework on a CPU-based environment. The results indicate that PMI-based pre-processing significantly improves contextual relevance and semantic consistency in generated summaries. This framework demonstrates robustness and scalability, making it suitable for large-scale document summarization tasks.

Keywords:

text summarization,

pointwise mutual information,

BERT,

keyword extraction,

rank terms,

rouge.

References

Belwal RC, Gupta A. Automatic text summarization techniques: categorization and contemporary challenges. Information Processing and Management. 2025;62(2):103612.

Aswani S, Choudhary K, Shetty S, Nur N. Automatic text summarization of scientific articles using transformers—A brief review. Journal of Autonomous Intelligence. 2024;7(5).

Wibawa AP, Kurniawan F. A survey of text summarization: Techniques, evaluation and challenges. Natural Language Processing Journal. 2024 Jun 1;7:100070. .

Zhang Y, Jin H, Meng D, Wang J, Tan J. A comprehensive survey on automatic text summarization with exploration of LLM-based methods. Neurocomputing. 2025 Nov 3:131928.

Liu W, Sun Y, Yu B, Wang H, Peng Q, Hou M, Guo H, Wang H, Liu C. Automatic text summarization method based on improved Text Rank algorithm and K-means clustering. Knowledge-Based Systems. 2024 Mar 5;287:111447.

Citation

Copyright

This is an open access article distributed under the Creative Commons Attribution Non-Commercial License (CC BY-NC) License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Article metrics

Google scholar: See link

Issue 34, 2025

THE MODEL OF GREEN ENTREPRENEURSHIP FACTORS ON THE INTERNATIONALIZATION PERFORMANCE OF SMES IN CHINA: A CONCEPTUAL FRAMEWORK HORNED LIZARD-CATBOOST FRAMEWORK FOR CYBERBULLYING PREVENTION IN SOCIAL NETWORKS ON LEVERAGING GENERATIVE ARTIFICIAL INTELLIGENCE (GENAI) FOR BEHAVIOR LEARNING AND PERSONALIZED MARKETING OPTIMIZATION ENHANCING IP COMMERCIALIZATION PERFORMANCE IN SOCIAL SCIENCE ACADEMICS AND THE ROLE OF ENTREPRENEURIAL ORIENTATION, UNIVERSITY SUPPORT, AND SELF-EFFICACY DETERMINANTS OF EMPLOYEE ENGAGEMENT IN ORGANIZED RETAIL: AN ANALYTICAL STUDY See full issue

About us

Editorial policy

A PMI-DRIVEN APPROACH WITH CONVENTIONAL BERT FOR OPTIMIZING TEXT SUMMARIZATION

Abstract

Keywords:

References

Citation

Copyright

Article metrics

Issue 34, 2025

Citations

Disclaimer