MidiScale

Piano Knowledge Representation and Generalization

We don’t know how piano knowledge is represented in the mind; how does learning on one piece transfer to learning on another piece; Can we use AI to generate practice exercises to get the user from learning state A to desired learning state B.

These ideas are the forefront of research and will modernize the learning experience. So thank you again for coming on this journey with me and paving the way to make learning a more equitable experience.

Lebron James, one of the best basketball players of all time, and Jacob Collier, a highly awarded musician, are at the edge of human ability in their respective fields. Critical to accomplishing their striking feats are their finely tuned motor skills that have been honed through rigorous practice. This prompts two questions, are some forms of practice better than others and how do representations of knowledge change over time? Piano playing provides an ideal case study for investigating how humans learn a complex motor skill. To play moving melodies and crisp chords learners must train their mind and hands to accomplish this goal. What is the optimal training regime to get to this point? There are several hypotheses that may achieve this goal: 1) The low-generalization hypothesis assumes practice only benefits learning on the practiced exercise or piece at hand. 2) The high-generalization hypothesis assumes that all practice is equally beneficial. 3) On the other hand, a skill hypothesis assumes that there are specific skills that can be trained and generalized to enhance future learning. In the simplest form, these skills take the form of rhythmicity, playing notes on time, and tonal accuracy, playing the correct note. Concretely, these different forms of practice may look like 1) someone who practices on a variety of piano pieces to practice as many scenarios as possible, 2) someone who only practices on one piano piece, 3) someone who practices on specific skills like rhythmicity and timing in a variety of contexts. Currently, there has been no systematic study probing different training styles that tap into humans’ cognitive architecture to learn the piano. Furthering our understanding of how our mind’s cognitive architecture digests information learned during practice is key to enhancing pedagogy and figuring out how the mind represents knowledge. In Aim 1, we will address a fundamental question of how piano knowledge is represented over time and the structure of this knowledge. Aim 2 will address how prior learning generalizes and affects future learning. Aim 3 will investigate how personalized exercises can be generated to prevent learners from generating future errors. All findings will be shared through publications and conference presentations. Data, code, and the pre-registration of the study will be uploaded to a public repository such as OSF.

Aim 1: Changes in piano knowledge representation. As a learner increases their skill in piano, how they represent knowledge about playing will change and understanding these changes will lay foundational work for understanding how to optimize the future practice of a learner. We can understand how knowledge representation changes over time by examining the errors learners make. Take the simple example of playing a C-major chord. The first time a learner plays it, they could make an error due to not fully knowing what notes make up the C-major chord. Overtime, errors due to this nature significantly decrease in likelihood. Other possible errors may arise due to selecting the wrong hand configuration to play the chord or through a motor execution error. Learning must occur in all of these cases to play the C-major chord. The non-generalization hypothesis predicts that learning C-major in one exercise will fail to transfer to a future exercise. The high-generalization hypothesis predicts that learning C-major in one exercise will generalize to all situations and to future learning of similar chords. The skill hypothesis predicts that learning C-major in one context will generalize only to contexts in which the tonal position and timing are similar across exercises. This simple example can be expanded to other necessary skills like learning a finger tuck, a motor skill necessary for playing scales. Using a dataset from Melodics, a software that teaches piano using exercises, we will categorize types of errors as either tonal (e.g. playing E instead of C) or as timing errors. In Melodics, users complete diverse exercises that train different motor and musical skills. The figure shows what participants see while playing an exercise. The learner has a physical keyboard while they look at virtual notes that move in time towards them. The learner must play current notes while looking ahead to prepare to play future notes. Inevitably while learning new exercises, the learner will make errors. For new exercises that employ priorly learned chords, the low-generalization hypothesis predicts errors will be high at the beginning of each exercise because prior learning will not transfer to new exercises. The high-generalization hypothesis predicts that as the amount of overall practice time increases, errors will decrease for all pieces of the same difficulty. The skill hypothesis predicts two possible outcomes. 1) tonal and timing learning increase in parallel and both timing and tonal errors reduce over time. 2) tonal learning must occur before timing learning and once tonal errors have sufficiently reduced, timing errors will then decrease. These findings will reveal how changes in errors are reflective of internal representations of piano skill.

Using the categorized errors, we will examine how prior experience and practice affect future performance. To do this, we will train an artificial neural network (ANN) to predict learners’ future errors on a novel exercise. The ANN will train on learners’ previously practiced exercises. Once the ANN is trained on the practice history of individuals, it will be used to predict future behavior of learners on novel exercises: 1) Tonal errors, 2) timing errors. Further, based on a learner’s performance on a novel exercise, we can ask the ANN to predict, 3) how long has the learner been practicing, and 4) have they practiced on a specific exercise. Successful prediction on the first two metrics will support the conclusions that learning sub-skills like tonal and timing, are essential for piano learning, supporting the skill-hypothesis. Successful prediction of the last two metrics would support the high-generalization hypothesis. This will be the first characterization of piano learning across thousands of people and these analyses will enable future insights on piano pedagogy. Further, this work will be an initial step to creating a robot that will learn to play the piano like humans do.

Aim 2: Learning generalization. For the past century, piano learners have been using Hanon exercises that are supposed to train essential piano motor skills2. However, these exercises have not been validated and it is unclear if/what specific sub-skills they are training. Taking the simple example from earlier of learning how to play a C-major chord, does learning how to play this chord help with learning to play other chords like D-major or B-minor? Using the ANN trained in aim 1, we can investigate what exercises and learned information are necessary for predicting future performance. Concretely, to predict future tonal errors, what information did the ANN use to make that prediction? How the ANN represents information cannot be interpreted directly so we must infer how it makes decisions by changing the information it was trained on. We will ablate timing errors by artificially correcting them in the dataset the ANN is trained on. If this information is important for the ANN to make predictions about tonal errors, this will indicate that tonal knowledge is necessary to learn timing information. We will use this same methodology to probe timing knowledge as well. There are a couple of possible outcomes: 1) timing errors influence the ANN’s ability to predict future tonal errors and vice versa. 2) Timing errors fail to influence the ANN’s ability to predict future tonal errors and vice versa. If the first outcome is true, this would suggest that these two skills may be part of a hierarchical structure of skill learning. However, if the second outcome is true, this would suggest that learning timing and tonal skill learning are orthogonal to each other and develop independently. Ultimately, these findings will further our understanding of what information is essential for superior future performance.

Aim 3: Generating Optimal Practice. An essential part of practice is ensuring that errors are not repeated in the future. What would be the optimal practice to achieve this goal? Ideally, personalized tutoring and practice for each learner is superior to general curricula. Using our findings from aims 1 & 2, we will create an ANN that recommends exercises that will rectify prevalent errors in a learner. This recommender system is similar to the one that video streaming services like Netflix use to recommend new movies based on what the users have watched in the past. For example, if a learner is generating many errors in exercises that are in the scale of C-major, the ANN will recognize this and assign exercises to rectify these errors. However, in piano learning, the recommender system would be limited to an existing set of exercises and would not take into account musical preferences of the learner. To further expand personalized training, we will build an ANN that can generate an infinite number of exercises that develop learning of specific skills. Further, the exercises’ style will be to the learner’s preference. For example, if a learner likes jazz and would prefer to have all of their exercises in a jazz style, we could transform exercises like the Hanon exercises to fit their preference: jazz Hanon.

References: [1] Krumhansl, C. L. (2000). Psychological bulletin.[2] Hanon (1900). New York: G. Schirmer.

Join the mailing list to gain access to our bi-weekly newsletter and software updates!

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.