Mathematical formulae represent complex semantic information in a concise form. Especially in Science, Technology, Engineering, and Mathematics, mathematical formulae are crucial for communicating information, e.g., in scientific papers, and to perform computations using computer algebra systems. Enabling computers to access the information encoded in mathematical formulae requires machine-readable formats that can represent both the presentation and content, i.e., the semantics, of formulae. Exchanging such information between systems additionally requires conversion methods for mathematical representation formats. We analyze how the semantic enrichment of formulae improves the format conversion process and show that considering the textual context of formulae reduces the error rate of such conversions. Our main contributions are: (1) providing an openly available benchmark dataset for the mathematical format conversion task consisting of a newly created test collection, an extensive, manually curated gold standard and task-specific evaluation metrics; (2) performing a quantitative evaluation of state-of-the-art tools for mathematical format conversions; (3) presenting a new approach that considers the textual context of formulae to reduce the error rate for mathematical format conversions. Our benchmark dataset facilitates future research on mathematical format conversions as well as research on many problems in mathematical information retrieval. Because we annotated and linked all components of formulae, e.g., identifiers, operators and other entities, to Wikidata entries, the gold standard can, for instance, be used to train methods for formula concept discovery and recognition. Such methods can then be applied to improve mathematical information retrieval systems, e.g., for semantic formula search, recommendation of mathematical content, or detection of mathematical plagiarism.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8474120PMC
http://dx.doi.org/10.1145/3197026.3197058DOI Listing

Publication Analysis

Top Keywords

mathematical formulae
16
mathematical format
16
mathematical
13
textual context
12
format conversions
12
formulae
9
considering textual
8
format conversion
8
context formulae
8
error rate
8

Similar Publications

The role of oscillations in grid cells' toroidal topology.

PLoS Comput Biol

January 2025

Kavli Institute for Systems Neuroscience and Centre for Algorithms in the Cortex, Faculty of Medicine and Health Sciences, Norwegian University of Science and Technology, Trondheim, Norway.

Persistent homology applied to the activity of grid cells in the Medial Entorhinal Cortex suggests that this activity lies on a toroidal manifold. By analyzing real data and a simple model, we show that neural oscillations play a key role in the appearance of this toroidal topology. To quantitatively monitor how changes in spike trains influence the topology of the data, we first define a robust measure for the degree of toroidality of a dataset.

View Article and Find Full Text PDF

Re-locative guided search optimized self-sparse attention enabled deep learning decoder for quantum error correction.

Sci Rep

January 2025

Department of Mathematics, School of Advanced Sciences, VIT-AP University, Besides AP Secretariate, Amaravati, Andhra Pradesh, 522237, India.

Heavy hexagonal coding is a type of quantum error-correcting coding in which the edges and vertices of a low-degree graph are assigned auxiliary and physical qubits. While many topological code decoders have been presented, it is still difficult to construct the optimal decoder due to leakage errors and qubit collision. Therefore, this research proposes a Re-locative Guided Search optimized self-sparse attention-enabled convolutional Neural Network with Long Short-Term Memory (RlGS2-DCNTM) for performing effective error correction in quantum codes.

View Article and Find Full Text PDF

Soliton solutions of the (2 + 1)-dimensional Jaulent-Miodek evolution equation via effective analytical techniques.

Sci Rep

January 2025

Department of Mathematics, Saveetha School of Engineering, Saveetha Institute of Medical and Technical Sciences, Saveetha University, Chennai, Tamil Nadu, 602105, India.

In this study, we investigate the [Formula: see text]-D Jaulent-Miodek (JM) equation, which is significant due to its energy-based Schrödinger potential and applications in fields such as optics, soliton theory, signal processing, geophysics, fluid dynamics, and plasma physics. Given its broad utility, a rigorous mathematical analysis of the JM equation is essential. The primary objective of this work is to derive exact soliton solutions using the Modified Sub-Equation (MSE) and Modified Auxiliary Equation (MAE) techniques.

View Article and Find Full Text PDF

A simple plan strategy to optimize the biological effective dose delivered in robotic radiosurgery of vestibular schwannomas.

Phys Med Biol

January 2025

Radiotherapy and Radiosurgery department, Iatropolis Clinic, 54 Ethnikis Antistaseos ave., Athens, Attica, 15231, GREECE.

Using the concept of biologically effective dose (BED), the effect of sublethal DNA damage repair (SLR) on the bio-efficacy of prolonged radiotherapy treatments can be quantified (BED). Such treatments, lasting more than 20 min, are typically encountered in stereotactic radiosurgery (SRS) applications using the CyberKnife (CK) and Gamma knife systems. Evaluating the plan data from 45 Vestibular Schwannoma (VS) cases treated with single fraction CK-SRS, this work demonstrates a statistically significant correlation between the marginal BEDSLR delivered to the target (m-BEDSLR) and the ratio of the mean collimator size weighted by the fraction of total beams delivered with each collimator ((_w^m)Cs), to the tumor volume (Tv).

View Article and Find Full Text PDF

Atopic dermatitis (AD) is a chronic inflammatory skin condition characterized by dry skin, severe itching, redness, and inflammation. Its complex etiology, involving genetic, immunological, and environmental factors, necessitates innovative therapeutic approaches. This study investigates nanostructured lipid carriers (NLCs) formulated with traditional fermented coconut (Cocos nucifera L.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!