We develop a number of data-driven investment strategies that demonstrate how machine learning and data analytics can be used to guide investments in peer-to-peer loans. We detail the process starting with the acquisition of (real) data from a peer-to-peer lending platform all the way to the development and evaluation of investment strategies based on a variety of approaches. We focus heavily on how to apply and evaluate the data science methods, and resulting strategies, in a real-world business setting. The material presented in this article can be used by instructors who teach data science courses, at the undergraduate or graduate levels. Importantly, we go beyond just evaluating predictive performance of models, to assess how well the strategies would actually perform, using real, publicly available data. Our treatment is comprehensive and ranges from qualitative to technical, but is also modular-which gives instructors the flexibility to focus on specific parts of the case, depending on the topics they want to cover. The learning concepts include the following: data cleaning and ingestion, classification/probability estimation modeling, regression modeling, analytical engineering, calibration curves, data leakage, evaluation of model performance, basic portfolio optimization, evaluation of investment strategies, and using Python for data science.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6154448PMC
http://dx.doi.org/10.1089/big.2018.0092DOI Listing

Publication Analysis

Top Keywords

investment strategies
16
data science
16
data
9
data-driven investment
8
peer-to-peer lending
8
evaluation investment
8
strategies
6
strategies peer-to-peer
4
lending case
4
case study
4

Similar Publications

The importance of conserving plant genetic diversity has been recognised since the 1980's, but genetic research tools for improving conservation remain largely absent from standard planning. Using an Australian case study framework of the New South Wales Government's Saving our Species program, we outline the costs and benefits associated with conducting genomic analysis within a conservation strategy to inform for example, taxonomic resolution, targeted monitoring, translocations and ex situ collections. Despite a reported sentiment that costs are prohibitive, our study identified that where genetics reports have been provided (32 to date), the cost of genetic sampling, analysis and advice is less than 10% of the total Government investment (SoS program) and will continue decreasing proportionally throughout the years as other management occurs.

View Article and Find Full Text PDF

Global prevalence and solutions for burnout among rheumatologists.

Z Rheumatol

January 2025

Faculty of Data Science, Musashino University, 3-3-3 Ariake Koto-ku, 135-8181, Tokyo, Japan.

Burnout among rheumatologists is globally prevalent, driven by low personal accomplishment, younger age, dissatisfaction with the specialty, low income, long hours, emotional exhaustion, and depersonalization. Mitigation strategies include addressing modifiable risk factors, implementing organizational measures, investing in well-being, assessing individual grit, and managing workload with virtual care platforms.

View Article and Find Full Text PDF

Purpose: To investigate potential modes of programmed cell death in the lens epithelial cells (LECs) of patients with early age-related cortical cataract (ARCC) and to explore early-stage intervention strategies.

Methods: Anterior lens capsules were collected from early ARCC patients for comprehensive analysis. Ultrastructural examination of LECs was performed using transmission electron microscopy.

View Article and Find Full Text PDF

Purpose: Infertility is defined as the inability to conceive after 1 year of unprotected intercourse, affecting approximately 15-20% of couples in Western countries. It is a shared problem within the couple; when the main issue lies with one of the partners, it is preferable to refer to "male factor" or "female factor" infertility rather than simply male or female infertility. Despite male factor infertility accounting for half of all couple infertility cases, the clinical approach to the male partner is not uniformly standardized across international guidelines.

View Article and Find Full Text PDF

Prokaryote evolution is driven in large part by the incessant arms race with viruses. Genomic investments in antivirus defense can be coarsely classified into two categories, immune systems that abrogate virus reproduction resulting in clearance, and altruistic programmed cell death (PCD) systems. Prokaryotic defense systems are enormously diverse, as revealed by an avalanche of recent discoveries, but the basic ecological determinants of defense strategy remain poorly understood.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!