Short text classification approach to identify child sexual exploitation material.

Sci Rep

Department of Electrical, Systems and Automation Engineering, Universidad de León, León, Spain.

Published: September 2023

Producing or sharing Child Sexual Exploitation Material (CSEM) is a severe crime that Law Enforcement Agencies (LEAs) fight daily. When the LEA seizes a computer from a potential producer or consumer of the CSEM, it analyzes the storage devices of the suspect looking for evidence. Manual inspection of CSEM is time-consuming given the limited time available for Spanish police to use a search warrant. Our approach to speeding up the identification of CSEM-related files is to analyze only the file names and their absolute paths rather than their content. The main challenge lies in handling short and sparse texts that are deliberately distorted by file owners using obfuscated words and user-defined naming patterns. We present two approaches to CSEM identification. The first employs two independent classifiers, one for the file name and the other for the file path, and their outputs are then combined. Conversely, the second approach uses only the file name classifier to iterate over an absolute path. Both operate at the character n-gram level, whereas novel binary and orthographic features are presented to enrich the text representation. We benchmarked six classification models based on machine learning and convolutional neural networks. The proposed classifier has an F1 score of 0.988, which can be a promising tool for LEAs.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10522674PMC
http://dx.doi.org/10.1038/s41598-023-42902-8DOI Listing

Publication Analysis

Top Keywords

child sexual
8
sexual exploitation
8
exploitation material
8
file
5
short text
4
text classification
4
classification approach
4
approach identify
4
identify child
4
material producing
4

Similar Publications

This study aimed to evaluate a new modified fixed appliance for rehabilitation of premature loss of anterior teeth in preschool children versus a modified Nance appliance on maxillary arch growth with parental satisfaction. The study was conducted as a clinical trial and it was carried out at Pediatric Dentistry Department, Faculty of Dentistry, Tanta University. Forty preschool children from both genders aged from 3-5 years were included in the study.

View Article and Find Full Text PDF

Background: Males with cystic fibrosis (MwCF) face general and disease-specific sexual and reproductive health (SRH) concerns. Using concept mapping (CM), this study identified the SRH topics valued by members of the CF community.

Methods: MwCF 18 years and older, parents and partners of MwCF, and healthcare providers participated in an online CM study.

View Article and Find Full Text PDF

Background: Intimate partner violence (IPV) poses a significant threat to the well-being of women and girls and is a highly prevalent form of gender-based violence. Evidence regarding the nutritional implications of IPV has focused primarily on intergenerational relationships with child nutrition and growth. There remains a knowledge gap regarding the association with women's own dietary intake.

View Article and Find Full Text PDF

Background: One of the devastating long-term outcomes of childhood sexual abuse (CSA) is its effect on sexual assertiveness, manifested by the limited ability to initiate desired sexual interactions, express one's sexuality, and refuse unwanted sexual activities.

Objective: This study examined a model in which the relation between CSA and sexual assertiveness was mediated by survivors' subjective experience of their sexuality, as reflected by their subjective experience of sexual fantasy.

Participants And Setting: Three-hundred-and-sixty-three adults participated in this longitudinal study.

View Article and Find Full Text PDF

Purpose: Adolescent girls are at high risk for depression and human immunodeficiency virus (HIV) acquisition. Poor mental health can increase vulnerability to risky sexual behaviours. Therefore, this study aims to determine the prevalence of depressive symptomology and explore the convergence of HIV risk factors with depressive symptoms amongst cis-gender adolescent girls and young women (AGYW) in rural KwaZulu-Natal (KZN) and peri-urban Western Cape (WC) communities in South Africa.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!