UTAR Institutional Repository

Audio files comparator using wavelet transform and similarity metrics

Lee, Da Long (2025) Audio files comparator using wavelet transform and similarity metrics. Final Year Project, UTAR.

[img]
Preview
PDF
Download (2171Kb) | Preview

    Abstract

    This project is a development-based project revolving around signal processing. The aim of this project is to develop a program that utilizes continuous wavelet transform (CWT) for audio similarity recognition. Its primary objective is to identify the similarities among audio files with different information such as file names or formats. In today’s diverse musical landscape, songs undergo various interpretations, covered in different languages, or rendered using a myriad of instruments. Compositions may span the spectrum, ranging from performances with real musical instruments to those composed solely of synthesized sounds, typically electronic dance music (EDM). Furthermore, songs exhibit versatility in their presentation, ranging from vocal renditions accompanied by instruments to whistling, humming or acapella performances. The evolution of music has also fostered the emergence of mashups and remixes, where distinct tracks seamlessly blend together to create new compositions. Despite these variations, the tunes or pitches of songs remain recognizable to the human ear and even audio detection algorithms. With the proliferation of digital music, people download songs from music applications or the internet, whether for personal listening in vehicles or to play in parties. However, these downloaded songs may vary depending on their file names and formats. Consequently, this project aims to identify identical or akin songs with various information and display out the percentage of differences between the audio files. The project’s methodology centres on Python programming, where comparisons of audio similarities will be conducted. Area of Study (Minimum 1 and Maximum 2): Signal Processing Keywords (Minimum 5 and Maximum 10): Audio Comparison, Music Cover Detection, Wavelet Transform, MFCC Analysis, Python Programming, User-Friendly Application

    Item Type: Final Year Project / Dissertation / Thesis (Final Year Project)
    Subjects: T Technology > T Technology (General)
    Divisions: Faculty of Information and Communication Technology > Bachelor of Information Technology (Honours) Computer Engineering
    Depositing User: ML Main Library
    Date Deposited: 28 Aug 2025 15:46
    Last Modified: 28 Aug 2025 15:46
    URI: http://eprints.utar.edu.my/id/eprint/7204

    Actions (login required)

    View Item