×
Home Current Archive Editorial board
Instructions for papers
For Authors Aim & Scope Contact
Original scientific article

CONTEXT-AWARE RULE-BASED MATH EXPRESSION NORMALISER AND VERBALIZER USING LATEX2TEXT FOR ENHANCED DOCUMENT PREPROCESSING

By
J. Joice Orcid logo ,
J. Joice

Research Scholar, PG and Research Department of Computer Science, Government Arts and Science College , Tiruppur , India

C. Sathya Orcid logo
C. Sathya

Assistant Professor, PG and Research Department of Computer Science, Government Arts and Science College , Tiruppur , India

Abstract

Blind students usually are subjected to a substantial impediment of reading and accessing electronic documents, especially data that are noisy and those that are carefully designed. Traditional NLP models severely underestimate or misinterpret mathematical expressions in which symbols are represented as notation. It is a critical problem in the educational field, accessibility, and report generation programs, where in-depth knowledge of mathematical content is a priority. State-of-the-art document summarisation systems tend to fail in noisy text, disordered document structures, and non-textual content, e.g., equations, images, and charts. This paper introduces a powerful preprocessing model that focuses on improving input quality, semantic coherence, and readability. The process consists of sophisticated text cleaning, discerning structuring, and an extensive content interpretation model. The paper presents a proposal to simplify and verbalise mathematical expressions using a rule-based, context-sensitive language called the Verbalizer Rule (VR). The system translates complex mathematical syntax into human-readable natural-language descriptions by pattern-matching expressions and translating semantic meaning using clues in the context. Experiments demonstrate that this method achieves much higher readability scores and summarisation quality than state-of-the-art models. In the assessment, the Proposed CARMEN model, using the ROUGE metrics 1, 2, and L, yields a ROUGE score of more than 0.8333 among the other verbalizers.

References

1.
Mukhiddinov M, Kim SY. A systematic literature review on the automatic creation of tactile graphics for the  blind and visually impaired. Processes. 2021 Sep 26;9(10):1726.
2.
Aguinis H, Gottfredson RK, Joo H. Best-practice recommendations for defining, identifying, and handling  outliers. Organizational research methods. 2013 Apr;16(2):270-301.
3.
Rahimi I, Gandomi AH, Chen F, Mezura-Montes E. A review on constraint handling techniques for  population-based algorithms: from single-objective to multi-objective optimization. Archives of  Computational Methods in Engineering. 2023 Apr;30(3):2181-209.
4.
Wang S, Cheah JH, Wong CY, Ramayah T. Progress in partial least squares structural equation modeling  use in logistics and supply chain management in the last decade: a structured literature review. International  Journal of Physical Distribution & Logistics Management. 2024 Oct 17;54(7/8):673-704.
5.
Psomas E. Future research methodologies of lean manufacturing: a systematic literature review. International  Journal of Lean Six Sigma. 2021 Nov 19;12(6):1146-83.

Citation

This is an open access article distributed under the  Creative Commons Attribution Non-Commercial License (CC BY-NC) License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. 

Article metrics

Google scholar: See link

The statements, opinions and data contained in the journal are solely those of the individual authors and contributors and not of the publisher and the editor(s). We stay neutral with regard to jurisdictional claims in published maps and institutional affiliations.