Deep gradient reinforcement learning for music improvisation in cloud computing framework.

Fadwa Alrowais Munya A Arasi Saud S Alotaibi Mohammed Alonazi Radwa Marzouk Ahmed S Salama

PeerJ Comput Sci

Department of Electrical Engineering, Faculty of Engineering & Technology, Future University in Egypt, New Cairo, New Cairo, Egypt.

Published: January 2025

Artificial intelligence (AI) in music improvisation offers promising new avenues for developing human creativity. The difficulty of writing dynamic, flexible musical compositions in real time is discussed in this article. We explore using reinforcement learning (RL) techniques to create more interactive and responsive music creation systems. Here, the musical structures train an RL agent to navigate the complex space of musical possibilities to provide improvisations. The melodic framework in the input musical data is initially identified using bi-directional gated recurrent units. The lyrical concepts such as notes, chords, and rhythms from the recognised framework are transformed into a format suitable for RL input. The deep gradient-based reinforcement learning technique used in this research formulates a reward system that directs the agent to compose aesthetically intriguing and harmonically cohesive musical improvisations. The improvised music is further rendered in the MIDI format. The Bach Chorales dataset with six different attributes relevant to musical compositions is employed in implementing the present research. The model was set up in a containerised cloud environment and controlled for smooth load distribution. Five different parameters, such as pitch frequency (PF), standard pitch delay (SPD), average distance between peaks (ADP), note duration gradient (NDG) and pitch class gradient (PCG), are leveraged to assess the quality of the improvised music. The proposed model obtains +0.15 of PF, -0.43 of SPD, -0.07 of ADP and 0.0041 NDG, which is a better value than other improvisation methods.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11784531	PMC
http://dx.doi.org/10.7717/peerj-cs.2265	DOI Listing

Publication Analysis

Top Keywords

reinforcement learning

music improvisation

musical compositions

improvised music

musical

music

deep gradient

gradient reinforcement

learning music

improvisation cloud

Similar Publications

Exploring the Evolution of Altruistic Punishment with a PDE Model of Cultural Multilevel Selection.

Bull Math Biol

March 2025

Department of Mathematics, University of Illinois Urbana-Champaign, Urbana, IL, USA.

Daniel B Cooney

Two mechanisms that have been used to study the evolution of cooperative behavior are altruistic punishment, in which cooperative individuals pay additional costs to punish defection, and multilevel selection, in which competition between groups can help to counteract individual-level incentives to cheat. Boyd, Gintis, Bowles, and Richerson have used simulation models of cultural evolution to suggest that altruistic punishment and pairwise group-level competition can work in concert to promote cooperation, even when neither mechanism can do so on its own. In this paper, we formulate a PDE model for multilevel selection motivated by the approach of Boyd and coauthors, modeling individual-level birth-death competition with a replicator equation based on individual payoffs and describing group-level competition with pairwise conflicts based on differences in the average payoffs of the competing groups.

View Article and Find Full Text PDF

Similar Publications

Of rats and robots: A mutual learning paradigm.

J Exp Anal Behav

March 2025

Behavioral Neuroscience Laboratory, Department of Psychology, Boğaziçi University, Istanbul, Turkey.

Oguzcan Nas Defne Albayrak Gunes Unal

Robots are increasingly used alongside Skinner boxes to train animals in operant conditioning tasks. Similarly, animals are being employed in artificial intelligence research to train various algorithms. However, both types of experiments rely on unidirectional learning, where one partner-the animal or the robot-acts as the teacher and the other as the student.

View Article and Find Full Text PDF

Similar Publications

Designing Baby-Friendly Hospital Step 2 Training A Hospital Case Study.

J Hum Lact

March 2025

Jersey Shore University Medical Center, Hackensack Meridian Health, Neptune, NJ, USA.

Cassandra P Leahy

Maintaining Baby-Friendly Hospital Initiative (BFHI) standards within a complex healthcare system presents unique challenges. This case study from a regional perinatal center in the northeast United States details the design and implementation of a program to address BFHI Step 2, which requires ongoing competency assessment and team member training to ensure breastfeeding support. The shift of BFHI competencies to continuous professional development introduced logistical challenges, compounded by staff turnover and budget constraints.

View Article and Find Full Text PDF

Similar Publications

Advancements in cache management: a review of machine learning innovations for enhanced performance and security.

Front Artif Intell

February 2025

Department of Computer Science & Engineering, Indian Institute of Technology Ropar, Rupnagar, India.

Keshav Krishna

Machine learning techniques have emerged as a promising tool for efficient cache management, helping optimize cache performance and fortify against security threats. The range of machine learning is vast, from reinforcement learning-based cache replacement policies to Long Short-Term Memory (LSTM) models predicting content characteristics for caching decisions. Diverse techniques such as imitation learning, reinforcement learning, and neural networks are extensively useful in cache-based attack detection, dynamic cache management, and content caching in edge networks.

View Article and Find Full Text PDF

Similar Publications

Control-Informed Reinforcement Learning for Chemical Processes.

Ind Eng Chem Res

March 2025

Department of Chemical Engineering, Imperial College London, London, South Kensington SW7 2AZ, U.K.

Maximilian Bloor Akhil Ahmed Niki Kotecha Mehmet Mercangöz Calvin Tsay

This work proposes a control-informed reinforcement learning (CIRL) framework that integrates proportional-integral-derivative (PID) control components into the architecture of deep reinforcement learning (RL) policies, incorporating prior knowledge from control theory into the learning process. CIRL improves performance and robustness by combining the best of both worlds: the disturbance-rejection and set point-tracking capabilities of PID control and the nonlinear modeling capacity of deep RL. Simulation studies conducted on a continuously stirred tank reactor system demonstrate the improved performance of CIRL compared to both conventional model-free deep RL and static PID controllers.

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!