Innovations in Cover Song Detection: A Lyrics-Based Approach

Read original: arXiv:2406.04384 - Published 6/10/2024 by Maximilian Balluff, Peter Mandl, Christian Wolff

Innovations in Cover Song Detection: A Lyrics-Based Approach

Overview

This paper presents a novel approach to cover song detection using lyrics-based features.
The researchers developed a dataset of cover songs and their original versions, and used this to train machine learning models to identify cover songs.
The key innovation is the focus on lyrics-based features, rather than the more common approach of using audio-based features.

Plain English Explanation

The researchers have come up with a new way to automatically detect when a song is a cover version of another song. Instead of looking at the musical features of the songs, they look at the lyrics.

To do this, the researchers first created a dataset of cover songs and their original versions. This gave them examples to train their machine learning models on.

The idea behind their approach is that even if the music is different, the lyrics of a cover song will be very similar to the original. So by analyzing the lyrics, the models can learn to recognize when a song is a cover.

This is different from most previous work, which has focused on using the musical features of the songs to detect covers. The researchers think their lyrics-based approach could be more effective, especially for certain types of music where the melodies and arrangements change a lot between the original and the cover.

Technical Explanation

The key technical components of this paper are:

Dataset Construction: The researchers compiled a dataset of cover songs and their original versions, drawing from online databases and user-contributed tagging. This provided a corpus of examples to train and evaluate their models on.
Lyrics-Based Feature Extraction: Rather than relying on audio features, the researchers extracted features from the song lyrics, including lexical, syntactic, and semantic properties. This included things like word frequencies, part-of-speech tags, and sentiment analysis.
Machine Learning Models: The researchers trained several machine learning models, including logistic regression, support vector machines, and neural networks, to classify whether a given song was a cover or an original based on the lyrics-derived features.
Evaluation: The models were tested on held-out portions of the dataset, and their performance was compared against baseline audio-based approaches. The lyrics-based models showed promising results, particularly for certain genres where cover versions tend to diverge significantly from the original in musical style.

Critical Analysis

The researchers acknowledge several limitations in their work. The dataset, while large, may not be fully representative of the diversity of cover songs in the real world. Additionally, the lyrics-based approach could struggle with covers that heavily modify or rewrite the original lyrics.

Some open questions remain around how the lyrics-based features interact with audio-based features, and whether a hybrid approach could further improve performance. The researchers also note that their models do not provide explanations for their decisions, which could limit their interpretability and real-world applicability.

Overall, this work represents an interesting and novel direction in cover song detection. By shifting the focus to lyrics, the researchers have opened up new avenues for research and potential practical applications. However, further validation and development will be needed to fully realize the potential of this approach.

Conclusion

This paper presents a novel approach to cover song detection that relies on analyzing the lyrics of songs rather than their audio features. The researchers developed a dataset of cover songs and their original versions, and used this to train machine learning models to identify covers based on lexical, syntactic, and semantic properties of the lyrics.

The results show promise, particularly for genres where cover versions tend to diverge significantly from the original in musical style. This lyrics-based approach represents an interesting new direction in this research area, with potential applications in music information retrieval, copyright enforcement, and music recommendation systems.

While the work has some limitations, it lays the groundwork for further exploration of how textual features can complement or even replace audio-based approaches for certain music analysis tasks. As the field continues to evolve, this type of innovative thinking will be crucial for driving progress.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

Innovations in Cover Song Detection: A Lyrics-Based Approach

Maximilian Balluff, Peter Mandl, Christian Wolff

Cover songs are alternate versions of a song by a different artist. Long being a vital part of the music industry, cover songs significantly influence music culture and are commonly heard in public venues. The rise of online music platforms has further increased their prevalence, often as background music or video soundtracks. While current automatic identification methods serve adequately for original songs, they are less effective with cover songs, primarily because cover versions often significantly deviate from the original compositions. In this paper, we propose a novel method for cover song detection that utilizes the lyrics of a song. We introduce a new dataset for cover songs and their corresponding originals. The dataset contains 5078 cover songs and 2828 original songs. In contrast to other cover song datasets, it contains the annotated lyrics for the original song and the cover song. We evaluate our method on this dataset and compare it with multiple baseline approaches. Our results show that our method outperforms the baseline approaches.

6/10/2024

🤯

Detecting Synthetic Lyrics with Few-Shot Inference

Yanis Labrak, Gabriel Meseguer-Brocal, Elena V. Epure

In recent years, generated content in music has gained significant popularity, with large language models being effectively utilized to produce human-like lyrics in various styles, themes, and linguistic structures. This technological advancement supports artists in their creative processes but also raises issues of authorship infringement, consumer satisfaction and content spamming. To address these challenges, methods for detecting generated lyrics are necessary. However, existing works have not yet focused on this specific modality or on creative text in general regarding machine-generated content detection methods and datasets. In response, we have curated the first dataset of high-quality synthetic lyrics and conducted a comprehensive quantitative evaluation of various few-shot content detection approaches, testing their generalization capabilities and complementing this with a human evaluation. Our best few-shot detector, based on LLM2Vec, surpasses stylistic and statistical methods, which are shown competitive in other domains at distinguishing human-written from machine-generated content. It also shows good generalization capabilities to new artists and models, and effectively detects post-generation paraphrasing. This study emphasizes the need for further research on creative content detection, particularly in terms of generalization and scalability with larger song catalogs. All datasets, pre-processing scripts, and code are available publicly on GitHub and Hugging Face under the Apache 2.0 license.

6/24/2024

LyCon: Lyrics Reconstruction from the Bag-of-Words Using Large Language Models

Haven Kim, Kahyun Choi

This paper addresses the unique challenge of conducting research in lyric studies, where direct use of lyrics is often restricted due to copyright concerns. Unlike typical data, internet-sourced lyrics are frequently protected under copyright law, necessitating alternative approaches. Our study introduces a novel method for generating copyright-free lyrics from publicly available Bag-of-Words (BoW) datasets, which contain the vocabulary of lyrics but not the lyrics themselves. Utilizing metadata associated with BoW datasets and large language models, we successfully reconstructed lyrics. We have compiled and made available a dataset of reconstructed lyrics, LyCon, aligned with metadata from renowned sources including the Million Song Dataset, Deezer Mood Detection Dataset, and AllMusic Genre Dataset, available for public access. We believe that the integration of metadata such as mood annotations or genres enables a variety of academic experiments on lyrics, such as conditional lyric generation.

8/28/2024

SONICS: Synthetic Or Not -- Identifying Counterfeit Songs

Md Awsafur Rahman, Zaber Ibn Abdul Hakim, Najibul Haque Sarker, Bishmoy Paul, Shaikh Anowarul Fattah

The recent surge in AI-generated songs presents exciting possibilities and challenges. While these tools democratize music creation, they also necessitate the ability to distinguish between human-composed and AI-generated songs for safeguarding artistic integrity and content curation. Existing research and datasets in fake song detection only focus on singing voice deepfake detection (SVDD), where the vocals are AI-generated but the instrumental music is sourced from real songs. However, this approach is inadequate for contemporary end-to-end AI-generated songs where all components (vocals, lyrics, music, and style) could be AI-generated. Additionally, existing datasets lack lyrics-music diversity, long-duration songs, and open fake songs. To address these gaps, we introduce SONICS, a novel dataset for end-to-end Synthetic Song Detection (SSD), comprising over 97k songs with over 49k synthetic songs from popular platforms like Suno and Udio. Furthermore, we highlight the importance of modeling long-range temporal dependencies in songs for effective authenticity detection, an aspect overlooked in existing methods. To capture these patterns, we propose a novel model, SpecTTTra, that is up to 3 times faster and 6 times more memory efficient compared to popular CNN and Transformer-based models while maintaining competitive performance. Finally, we offer both AI-based and Human evaluation benchmarks, addressing another deficiency in current research.

8/28/2024