This article presents SUBTLEX-AR, a digital database providing an extensive collection of attributes related to Modern Standard Arabic words (Arabic for short). SUBTLEX-AR combines a novel dataset of 120 million word tokens from movie subtitles with 40 million tokens from newspaper articles originally collected in ARALEX (Boudelaa & Marslen-Wilson, Behavior Research Methods, 42, 481-487, 2010), ensuring comprehensive coverage. SUBTLEX-AR provides information about the statistical properties of Arabic words at the orthographic, phonological, morphological, and semantic levels. The database also includes information on sub-word structure properties like bigram and trigram frequencies, as well as lemmas and part-of-speech information along with their corresponding frequencies. The online interface of SUBTLEX-AR allows users either to upload a set of words to receive their properties or to receive a set of words matching constraints on predefined properties. The properties themselves are easily extensible and will be expanded over time. SUBTLEX-AR is freely accessible here: https://subtlexar.uaeu.ac.ae/.

Download full-text PDF

Source
http://dx.doi.org/10.3758/s13428-024-02560-8DOI Listing

Publication Analysis

Top Keywords

movie subtitles
8
subtlex-ar
6
properties
5
subtlex-ar arabic
4
arabic word
4
word distributional
4
distributional characteristics
4
characteristics based
4
based movie
4
subtitles article
4

Similar Publications

In this paper, a new method for producing movie trailers is presented. In the proposed method, the problem is divided into two sub-problems: "genre identification" and "genre-based trailer production". To solve the first sub-problem, the poster image and subtitle text processing strategy has been used in which, a convolutional neural network (CNN) model has been used to extract features related to the movie genre from its poster image.

View Article and Find Full Text PDF

Using movies to educate medical students.

Indian J Med Ethics

March 2025

IMU Centre for Education, IMU University, Bukit Jalil, Kuala Lumpur, MALAYSIA.

I read with great interest the article by Khaliq et al on using trigger films to educate undergraduate medical students about the doctor-patient relationship [1]. The authors used Bollywood movie clips with English subtitles.

View Article and Find Full Text PDF

This article presents SUBTLEX-AR, a digital database providing an extensive collection of attributes related to Modern Standard Arabic words (Arabic for short). SUBTLEX-AR combines a novel dataset of 120 million word tokens from movie subtitles with 40 million tokens from newspaper articles originally collected in ARALEX (Boudelaa & Marslen-Wilson, Behavior Research Methods, 42, 481-487, 2010), ensuring comprehensive coverage. SUBTLEX-AR provides information about the statistical properties of Arabic words at the orthographic, phonological, morphological, and semantic levels.

View Article and Find Full Text PDF

Movies often use allusions to add depth, create connections, and enrich the storytelling. However, translators may face challenges when subtitling movie allusions, as they must render both meaning and culture accurately despite existing language and cultural barriers. These challenges could be further complicated by the use of available AI tools attempting to subtitle movie allusions, while probably unaware of existing cultural complexities.

View Article and Find Full Text PDF

Radiologist representation in cinema.

Eur J Radiol

December 2024

University of Southern California, Department of Radiology, 1500 San Pablo St, 2nd Floor, Imaging, Los Angeles, CA 90033, USA. Electronic address:

Rationale And Objectives: There is limited representation of radiologists in the media, which has been proposed to be a factor in the lack of patient awareness towards radiologist. This study is attempt to look into radiology representation in film.

Materials And Methods: The IMDb website was searched for feature films containing the words "radiologist" and "radiology" in the plot summaries.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!