We apply tree-based classification algorithms, namely the classification trees, with the use of the rpart algorithm, random forests and XGBoost methods to detect mood disorder in a group of 2508 lower secondary school students. The dataset presents many challenges, the most important of which is many missing data as well as the being heavily unbalanced (there are few severe mood disorder cases). We find that all algorithms are specific, but only the rpart algorithm is sensitive; i.e., it is able to detect cases of real cases mood disorder. The conclusion of this paper is that this is caused by the fact that the rpart algorithm uses the surrogate variables to handle missing data. The most important social-studies-related result is that the adolescents' relationships with their parents are the single most important factor in developing mood disorders-far more important than other factors, such as the socio-economic status or school success.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8468933PMC
http://dx.doi.org/10.3390/e23091210DOI Listing

Publication Analysis

Top Keywords

mood disorder
16
missing data
12
rpart algorithm
12
classification trees
8
random forests
8
forests xgboost
8
mood
5
disorder detection
4
detection adolescents
4
adolescents classification
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!