This research aims to develop an automatic gamelan musical note player that can naturally play musical note as human does. A musician estimates time to hit an instrument button in an approximate time which is as close as to the target time. The tolerated time to play a note was identified based on the human play. A gamelan musician was selected to play five note sequences of songs, and the play was recorded to be analyzed. Execution time in hitting instrument buttons in human play was identified using time-frequency analysis and peak detection to define time range which can be tolerated as time value that not too fast or not too late in hitting buttons, and then the result of the analysis was used as parameters to randomize approximate time to play a note. The evaluation shows that the program played all note sequences in the approximate time as human does and the program played more natural and better than another program which played a note as exact as its time target.
Junior thesis: Automatic tempo identification from audio signal of bollywood ...Parthe Pandit
The study presented here outlines a general structure for a tempo indentifica-
tion algorithms. A few existing methods found in literature for each of the building blocks
were tried out and simulations were performed on a few excerpts from Bollywood songs
which have the most commonly observed 4/4 beat. Based on the results of the simulations,
comparisons were made for these techniques. The main challenge of octave error in tempo
estimation is also discussed for focussed efforts in the future
The ancient Greeks, particularly Pythagoras, were the first to analyze music mathematically rather than just appreciating it as art. Pythagoras discovered that the pitch of a note was related to the length of the string producing it, and that this depended on how the string vibrated. This led to the science of acoustics. Pythagoras also developed one of the first mathematical musical scales by considering intervals of octaves and fifths. Later, it was shown that every musical pitch has a distinct frequency and wavelength that can be represented mathematically. Rhythm, tempo, note duration, and other elements of music also have mathematical representations involving fractions, ratios, and other relationships.
Pythagoras viewed music as a branch of mathematics and discovered relationships between the pitch of notes and string lengths. He developed one of the first mathematical scales based on octave and fifth intervals. Mathematics is involved in many aspects of music including frequency, duration, time signatures, rhythm, intervals, chords, counting, conducting, and contemporary music forms like 12-tone music which are based on mathematical matrices. Fractions represent note values in musical notation and show the relationship between notes and rhythmic movement in music.
This document is the first section of a student project report on using machine learning techniques for polyphonic music transcription. It introduces the topics of Fourier analysis, spectrograms, and scalograms to analyze musical instrument sounds. Sparse autoencoders are proposed to learn instrument timbre and identify individual notes. The objective is to understand these time-frequency analysis techniques and use them as features to train autoencoders to recognize instruments and transcribe notes in polyphonic music.
Sounding the Keys: A Comprehensive FFT-Based Approach for Piano Note DetectionIRJET Journal
This document presents a system that uses digital signal processing techniques like the Fast Fourier Transform (FFT) algorithm and thresholding methods to provide real-time feedback and assessment of piano performances. The system takes an audio file of a piano performance as input. It uses FFT to convert the audio from time domain to frequency domain, allowing it to detect the frequencies of individual piano notes played. Thresholding methods are used to separate notes from noise. The system assesses the performance against sheet music to calculate a correctness score. It was tested on performances of "Ode to Joy" and showed high accuracy compared to human assessments, demonstrating the potential for such systems to make piano learning more accessible through automated feedback.
Abstract :
Although solutions to the challenge of binaural artificial recreation of audio specializations exist in the Computer Music domain, a review of the area suggests that a comprehensive, generic, accurate and efficient tool set is required; hence this paper/thesis will deal with automated sound creation using sound synthesis. The entire setup was implemented using Csound. Csound implements many synthesis techniques unavailable through other means, such as user-controlled physical modeling.
This document discusses the use of artificial intelligence in organized sound as surveyed in the journal Organised Sound. It provides an overview of key AI technologies like Auto-Tune audio processing that can correct pitch and organize sound. Applications discussed include general sound classification, open sound control for music networking, and time-frequency representations for sound analysis and resynthesis. The document also outlines recent research on intelligent composer assistants, responsive instruments, and recognition of musical sounds. Finally, it discusses the future of AI in organizing sound through planning and machine learning.
This document introduces key concepts for analyzing musical rhythm, including onset, tempo, meter, bar, and rhythm. It discusses how humans perceive and group temporal musical events, and describes typical onset durations for different instruments. Accuracy for detecting onsets ranges from 2-300ms, depending on factors like tempo and instrumentation. Musical works represent these temporal concepts through notation of tempo, time signature, bars, and note values.
Junior thesis: Automatic tempo identification from audio signal of bollywood ...Parthe Pandit
The study presented here outlines a general structure for a tempo indentifica-
tion algorithms. A few existing methods found in literature for each of the building blocks
were tried out and simulations were performed on a few excerpts from Bollywood songs
which have the most commonly observed 4/4 beat. Based on the results of the simulations,
comparisons were made for these techniques. The main challenge of octave error in tempo
estimation is also discussed for focussed efforts in the future
The ancient Greeks, particularly Pythagoras, were the first to analyze music mathematically rather than just appreciating it as art. Pythagoras discovered that the pitch of a note was related to the length of the string producing it, and that this depended on how the string vibrated. This led to the science of acoustics. Pythagoras also developed one of the first mathematical musical scales by considering intervals of octaves and fifths. Later, it was shown that every musical pitch has a distinct frequency and wavelength that can be represented mathematically. Rhythm, tempo, note duration, and other elements of music also have mathematical representations involving fractions, ratios, and other relationships.
Pythagoras viewed music as a branch of mathematics and discovered relationships between the pitch of notes and string lengths. He developed one of the first mathematical scales based on octave and fifth intervals. Mathematics is involved in many aspects of music including frequency, duration, time signatures, rhythm, intervals, chords, counting, conducting, and contemporary music forms like 12-tone music which are based on mathematical matrices. Fractions represent note values in musical notation and show the relationship between notes and rhythmic movement in music.
This document is the first section of a student project report on using machine learning techniques for polyphonic music transcription. It introduces the topics of Fourier analysis, spectrograms, and scalograms to analyze musical instrument sounds. Sparse autoencoders are proposed to learn instrument timbre and identify individual notes. The objective is to understand these time-frequency analysis techniques and use them as features to train autoencoders to recognize instruments and transcribe notes in polyphonic music.
Sounding the Keys: A Comprehensive FFT-Based Approach for Piano Note DetectionIRJET Journal
This document presents a system that uses digital signal processing techniques like the Fast Fourier Transform (FFT) algorithm and thresholding methods to provide real-time feedback and assessment of piano performances. The system takes an audio file of a piano performance as input. It uses FFT to convert the audio from time domain to frequency domain, allowing it to detect the frequencies of individual piano notes played. Thresholding methods are used to separate notes from noise. The system assesses the performance against sheet music to calculate a correctness score. It was tested on performances of "Ode to Joy" and showed high accuracy compared to human assessments, demonstrating the potential for such systems to make piano learning more accessible through automated feedback.
Abstract :
Although solutions to the challenge of binaural artificial recreation of audio specializations exist in the Computer Music domain, a review of the area suggests that a comprehensive, generic, accurate and efficient tool set is required; hence this paper/thesis will deal with automated sound creation using sound synthesis. The entire setup was implemented using Csound. Csound implements many synthesis techniques unavailable through other means, such as user-controlled physical modeling.
This document discusses the use of artificial intelligence in organized sound as surveyed in the journal Organised Sound. It provides an overview of key AI technologies like Auto-Tune audio processing that can correct pitch and organize sound. Applications discussed include general sound classification, open sound control for music networking, and time-frequency representations for sound analysis and resynthesis. The document also outlines recent research on intelligent composer assistants, responsive instruments, and recognition of musical sounds. Finally, it discusses the future of AI in organizing sound through planning and machine learning.
This document introduces key concepts for analyzing musical rhythm, including onset, tempo, meter, bar, and rhythm. It discusses how humans perceive and group temporal musical events, and describes typical onset durations for different instruments. Accuracy for detecting onsets ranges from 2-300ms, depending on factors like tempo and instrumentation. Musical works represent these temporal concepts through notation of tempo, time signature, bars, and note values.
Antescofo Syncronous Languages for Musical Composition nazlitemu
Antescofo is a syncronous language that is composed of some properties that Esterel and Lustral Languages..
Antesfoco is a project that is dedicated to generate intelligent contribution of the computers to the live music syntesis.
Design and Analysis System of KNN and ID3 Algorithm for Music Classification ...IJECEIAES
Each of music which has been created, has its own mood which is emitted, therefore, there has been many researches in Music Information Retrieval (MIR) field that has been done for recognition of mood to music. This research produced software to classify music to the mood by using K-Nearest Neighbor and ID3 algorithm. In this research accuracy performance comparison and measurement of average classification time is carried out which is obtained based on the value produced from music feature extraction process. For music feature extraction process it uses 9 types of spectral analysis, consists of 400 practicing data and 400 testing data. The system produced outcome as classification label of mood type those are contentment, exuberance, depression and anxious. Classification by using algorithm of KNN is good enough that is 86.55% at k value = 3 and average processing time is 0.01021. Whereas by using ID3 it results accuracy of 59.33% and average of processing time is 0.05091 second.
The kusc classical music dataset for audio key findingijma
In this paper, we present a benchmark dataset based on the KUSC classical music collection and provide
baseline key-finding comparison results. Audio key finding is a basic music information retrieval task; it
forms an essential component of systems for music segmentation, similarity assessment, and mood
detection. Due to copyright restrictions and a labor-intensive annotation process, audio key finding
algorithms have only been evaluated using small proprietary datasets to date. To create a common base for
systematic comparisons, we have constructed a dataset comprising of more than 3,000 excerpts of classical
music. The excerpts are made publicly accessible via commonly used acoustic features such as pitch-based
spectrograms and chromagrams. We introduce a hybrid annotation scheme that combines the use of title
keys with expert validation and correction of only the challenging cases. The expert musicians also provide
ratings of key recognition difficulty. Other meta-data include instrumentation. As demonstration of use of
the dataset, and to provide initial benchmark comparisons for evaluating new algorithms, we conduct a
series of experiments reporting key determination accuracy of four state-of-the-art algorithms. We further
show the importance of considering factors such as estimated tuning frequency, key strength or confidence
value, and key recognition difficulty in key finding. In the future, we plan to expand the dataset to include
meta-data for other music information retrieval tasks.
The document describes an audio player component of the Human-Computer Music Performance (HCMP) system. The audio player allows HCMP to accompany live music performances using audio files rather than just MIDI. Key aspects of the audio player include using a phase vocoder to change the playback speed of audio files without altering pitch, and using a "label track" to map beats in the audio file to a shared clock for synchronization. The audio player operates in different play modes (tempo track, override, conductor-controlled) and uses various "beat layers" to account for differences between beats in the audio, score, and live performance.
The Influence of perceptual Attack times in
Networked Music performance
Pilot Study conducted at CCRMA, Stanford University (2011)
44th AES conference (2011 - San Diego)
Feature Extraction of Musical Instrument Tones using FFT and Segment AveragingTELKOMNIKA JOURNAL
A feature extraction for musical instrument tones that based on a transform domain approach was
proposed in this paper. The aim of the proposed feature extraction was to get the lower feature extraction
coefficients. In general, the proposed feature extraction was carried out as follow. Firstly, the input signal
was transformed using FFT (Fast Fourier Transform). Secondly, the left half of the transformed signal was
divided into a number of segments. Finally, the averaging results of that segments, was the feature
extraction of the input signal. Based on the test results, the proposed feature extraction was highly efficient
for the tones, which have many significant local peaks in the Fourier transform domain, because it only
required at least four feature extraction coefficients, in order to represent every tone.
Class 12th presentation on maths in musicSakshiBisht48
Math's Project on the topic of "Math in Music" is summarized as follows:
1) The project introduces the connections between math and music, such as counting, rhythm, scales, intervals, patterns, and harmonies that relate to mathematical concepts.
2) It provides historical background on figures like Pythagoras who first investigated musical scales in terms of numerical ratios and frequency.
3) Key mathematical elements in music are discussed like the Fibonacci sequence, golden ratio, set theory, abstract algebra, and how composers have incorporated mathematical principles and patterns.
4) Relationships are examined between elements like rhythm and numbers, frequency and pitch, and how patterns are fundamental to both math and musical structure
Graphical visualization of musical emotionsPranay Prasoon
The document discusses graphical visualization of musical emotions using artificial neural networks. 13 audio features are extracted from Hindustani classical music clips labeled as happy or sad. An ANN model with backpropagation algorithm is trained on 70% of data, validated on 15% and tested on 15%. The model correctly classified 15 of 17 happy clips and 21 of 22 sad clips. Testing was repeated 10 times with over 90% accuracy each time, showing the model effectively recognizes musical emotions. Future work involves expanding the model to recognize additional emotions and incorporating physiological features.
This document discusses a system for controlling the emotional content of pre-composed music to express an intended emotion. The system uses a knowledge base mapping music features like rhythm, melody, and instrumentation to emotional dimensions of valence and arousal. Music is selected and transformed based on these mappings. The system was evaluated and shown to be effective, correctly selecting music with correlations of 87.4% for valence and 84.8% for arousal, and transforming music with correlations of 67.8% for valence and 69.4% for arousal. The system allows customizing music to express different emotions through automated selection and transformation of pre-existing music.
This document summarizes an article from the International Journal of Computer Engineering and Technology (IJCET) that proposes a system for identifying ragas in North Indian classical music. It begins with background on ragas and their identifying musical elements. The system first uses a pitch detection tool to convert an audio file into a note sequence. It then applies two pakad matching methods - n-gram matching and delta occurrence with alpha-bounded gaps - to identify which raga's pakad most closely matches the note sequence. This identifies the raga of the input music. The document provides details on each step of the system to enable accurate raga identification from an audio file.
Audio descriptive analysis of singer and musical instrument identification in...eSAT Journals
Abstract Music information retrieval (MIR) has reached to a reasonably stable state after advancement in the Low Level audio Descriptors (LLDs) and feature extraction techniques. The analysis of sound has now become simple by the continuous efforts and research of MIR community in the field of signal processing from last two decades. In north Indian classical music, a singer is accompanied by some instruments such as harmonium, violin or flute. These instruments are tuned in the same musical scale (pitch range) in which the singer is signing. Separate researches have been made in recent past to identify a musical instrument and a singer. In this paper, we have analyzed the low level audio descriptors, for singing voice and musical instrument sound together, that appears to human ear as similar with respect to ‘timbre’, to see if we could treat them same and use identification/ classification routines to classify them into their classes. We have used Hybrid Selection algorithm from wrapper technique(the one that uses classifier also in feature selection process) to identify and extract the features and K-Means and K nearest neighbor classifiers to classify and cross verify the accuracy of classification. The accuracy of classification achieved was 91.1% which clearly proves that musical instruments and singing voice that sounds similar in timbral aspect can be grouped together and classification is possible with mixed database of instruments and singing voices. Keywords: Music Information Retrieval (MIR), Timbre, Singing Voice, Low level Descriptors (LLD, North Indian Classical music. MIRTOOL BOX
The document discusses an improvisation support system called JamSketch that allows musical novices to enjoy creating music. JamSketch uses a simple drawing interface where users draw melodic outlines, and a genetic algorithm then generates a complete melody in real-time based on the outline. An experiment found the system-generated melodies were comparable to inexperienced human players but lacked human-likeness. Ongoing work aims to improve rhythm variety and allow input on smartphones or with eye tracking. The author believes systems should act as ghostwriters to support but not replace human musical ideas.
The document outlines Olmo Cornelis' research on opportunities for symbiosis between Western and non-Western musical idioms from 2008-2014. It discusses his background, previous work digitizing an ethnomusicological archive, challenges in accurately describing non-Western music, and a proposed methodology using music information retrieval techniques to objectively analyze ethnic music and inform new compositions blending musical elements from different cultures.
Amazon products reviews classification based on machine learning, deep learni...TELKOMNIKA JOURNAL
In recent times, the trend of online shopping through e-commerce stores and websites has grown to a huge extent. Whenever a product is purchased on an e-commerce platform, people leave their reviews about the product. These reviews are very helpful for the store owners and the product’s manufacturers for the betterment of their work process as well as product quality. An automated system is proposed in this work that operates on two datasets D1 and D2 obtained from Amazon. After certain preprocessing steps, N-gram and word embedding-based features are extracted using term frequency-inverse document frequency (TF-IDF), bag of words (BoW) and global vectors (GloVe), and Word2vec, respectively. Four machine learning (ML) models support vector machines (SVM), logistic regression (RF), logistic regression (LR), multinomial Naïve Bayes (MNB), two deep learning (DL) models convolutional neural network (CNN), long-short term memory (LSTM), and standalone bidirectional encoder representations (BERT) are used to classify reviews as either positive or negative. The results obtained by the standard ML, DL models and BERT are evaluated using certain performance evaluation measures. BERT turns out to be the best-performing model in the case of D1 with an accuracy of 90% on features derived by word embedding models while the CNN provides the best accuracy of 97% upon word embedding features in the case of D2. The proposed model shows better overall performance on D2 as compared to D1.
Design, simulation, and analysis of microstrip patch antenna for wireless app...TELKOMNIKA JOURNAL
In this study, a microstrip patch antenna that works at 3.6 GHz was built and tested to see how well it works. In this work, Rogers RT/Duroid 5880 has been used as the substrate material, with a dielectric permittivity of 2.2 and a thickness of 0.3451 mm; it serves as the base for the examined antenna. The computer simulation technology (CST) studio suite is utilized to show the recommended antenna design. The goal of this study was to get a more extensive transmission capacity, a lower voltage standing wave ratio (VSWR), and a lower return loss, but the main goal was to get a higher gain, directivity, and efficiency. After simulation, the return loss, gain, directivity, bandwidth, and efficiency of the supplied antenna are found to be -17.626 dB, 9.671 dBi, 9.924 dBi, 0.2 GHz, and 97.45%, respectively. Besides, the recreation uncovered that the transfer speed side-lobe level at phi was much better than those of the earlier works, at -28.8 dB, respectively. Thus, it makes a solid contender for remote innovation and more robust communication.
Design and simulation an optimal enhanced PI controller for congestion avoida...TELKOMNIKA JOURNAL
This document describes using a snake optimization algorithm to tune the gains of an enhanced proportional-integral controller for congestion avoidance in a TCP/AQM system. The controller aims to maintain a stable and desired queue size without noise or transmission problems. A linearized model of the TCP/AQM system is presented. An enhanced PI controller combining nonlinear gain and original PI gains is proposed. The snake optimization algorithm is then used to tune the parameters of the enhanced PI controller to achieve optimal system performance and response. Simulation results are discussed showing the proposed controller provides a stable and robust behavior for congestion control.
Improving the detection of intrusion in vehicular ad-hoc networks with modifi...TELKOMNIKA JOURNAL
Vehicular ad-hoc networks (VANETs) are wireless-equipped vehicles that form networks along the road. The security of this network has been a major challenge. The identity-based cryptosystem (IBC) previously used to secure the networks suffers from membership authentication security features. This paper focuses on improving the detection of intruders in VANETs with a modified identity-based cryptosystem (MIBC). The MIBC is developed using a non-singular elliptic curve with Lagrange interpolation. The public key of vehicles and roadside units on the network are derived from number plates and location identification numbers, respectively. Pseudo-identities are used to mask the real identity of users to preserve their privacy. The membership authentication mechanism ensures that only valid and authenticated members of the network are allowed to join the network. The performance of the MIBC is evaluated using intrusion detection ratio (IDR) and computation time (CT) and then validated with the existing IBC. The result obtained shows that the MIBC recorded an IDR of 99.3% against 94.3% obtained for the existing identity-based cryptosystem (EIBC) for 140 unregistered vehicles attempting to intrude on the network. The MIBC shows lower CT values of 1.17 ms against 1.70 ms for EIBC. The MIBC can be used to improve the security of VANETs.
Conceptual model of internet banking adoption with perceived risk and trust f...TELKOMNIKA JOURNAL
Understanding the primary factors of internet banking (IB) acceptance is critical for both banks and users; nevertheless, our knowledge of the role of users’ perceived risk and trust in IB adoption is limited. As a result, we develop a conceptual model by incorporating perceived risk and trust into the technology acceptance model (TAM) theory toward the IB. The proper research emphasized that the most essential component in explaining IB adoption behavior is behavioral intention to use IB adoption. TAM is helpful for figuring out how elements that affect IB adoption are connected to one another. According to previous literature on IB and the use of such technology in Iraq, one has to choose a theoretical foundation that may justify the acceptance of IB from the customer’s perspective. The conceptual model was therefore constructed using the TAM as a foundation. Furthermore, perceived risk and trust were added to the TAM dimensions as external factors. The key objective of this work was to extend the TAM to construct a conceptual model for IB adoption and to get sufficient theoretical support from the existing literature for the essential elements and their relationships in order to unearth new insights about factors responsible for IB adoption.
Efficient combined fuzzy logic and LMS algorithm for smart antennaTELKOMNIKA JOURNAL
The smart antennas are broadly used in wireless communication. The least mean square (LMS) algorithm is a procedure that is concerned in controlling the smart antenna pattern to accommodate specified requirements such as steering the beam toward the desired signal, in addition to placing the deep nulls in the direction of unwanted signals. The conventional LMS (C-LMS) has some drawbacks like slow convergence speed besides high steady state fluctuation error. To overcome these shortcomings, the present paper adopts an adaptive fuzzy control step size least mean square (FC-LMS) algorithm to adjust its step size. Computer simulation outcomes illustrate that the given model has fast convergence rate as well as low mean square error steady state.
Design and implementation of a LoRa-based system for warning of forest fireTELKOMNIKA JOURNAL
This paper presents the design and implementation of a forest fire monitoring and warning system based on long range (LoRa) technology, a novel ultra-low power consumption and long-range wireless communication technology for remote sensing applications. The proposed system includes a wireless sensor network that records environmental parameters such as temperature, humidity, wind speed, and carbon dioxide (CO2) concentration in the air, as well as taking infrared photos.The data collected at each sensor node will be transmitted to the gateway via LoRa wireless transmission. Data will be collected, processed, and uploaded to a cloud database at the gateway. An Android smartphone application that allows anyone to easily view the recorded data has been developed. When a fire is detected, the system will sound a siren and send a warning message to the responsible personnel, instructing them to take appropriate action. Experiments in Tram Chim Park, Vietnam, have been conducted to verify and evaluate the operation of the system.
Wavelet-based sensing technique in cognitive radio networkTELKOMNIKA JOURNAL
Cognitive radio is a smart radio that can change its transmitter parameter based on interaction with the environment in which it operates. The demand for frequency spectrum is growing due to a big data issue as many Internet of Things (IoT) devices are in the network. Based on previous research, most frequency spectrum was used, but some spectrums were not used, called spectrum hole. Energy detection is one of the spectrum sensing methods that has been frequently used since it is easy to use and does not require license users to have any prior signal understanding. But this technique is incapable of detecting at low signal-to-noise ratio (SNR) levels. Therefore, the wavelet-based sensing is proposed to overcome this issue and detect spectrum holes. The main objective of this work is to evaluate the performance of wavelet-based sensing and compare it with the energy detection technique. The findings show that the percentage of detection in wavelet-based sensing is 83% higher than energy detection performance. This result indicates that the wavelet-based sensing has higher precision in detection and the interference towards primary user can be decreased.
A novel compact dual-band bandstop filter with enhanced rejection bandsTELKOMNIKA JOURNAL
In this paper, we present the design of a new wide dual-band bandstop filter (DBBSF) using nonuniform transmission lines. The method used to design this filter is to replace conventional uniform transmission lines with nonuniform lines governed by a truncated Fourier series. Based on how impedances are profiled in the proposed DBBSF structure, the fractional bandwidths of the two 10 dB-down rejection bands are widened to 39.72% and 52.63%, respectively, and the physical size has been reduced compared to that of the filter with the uniform transmission lines. The results of the electromagnetic (EM) simulation support the obtained analytical response and show an improved frequency behavior.
More Related Content
Similar to Natural automatic musical note player using time-frequency analysis on human play
Antescofo Syncronous Languages for Musical Composition nazlitemu
Antescofo is a syncronous language that is composed of some properties that Esterel and Lustral Languages..
Antesfoco is a project that is dedicated to generate intelligent contribution of the computers to the live music syntesis.
Design and Analysis System of KNN and ID3 Algorithm for Music Classification ...IJECEIAES
Each of music which has been created, has its own mood which is emitted, therefore, there has been many researches in Music Information Retrieval (MIR) field that has been done for recognition of mood to music. This research produced software to classify music to the mood by using K-Nearest Neighbor and ID3 algorithm. In this research accuracy performance comparison and measurement of average classification time is carried out which is obtained based on the value produced from music feature extraction process. For music feature extraction process it uses 9 types of spectral analysis, consists of 400 practicing data and 400 testing data. The system produced outcome as classification label of mood type those are contentment, exuberance, depression and anxious. Classification by using algorithm of KNN is good enough that is 86.55% at k value = 3 and average processing time is 0.01021. Whereas by using ID3 it results accuracy of 59.33% and average of processing time is 0.05091 second.
The kusc classical music dataset for audio key findingijma
In this paper, we present a benchmark dataset based on the KUSC classical music collection and provide
baseline key-finding comparison results. Audio key finding is a basic music information retrieval task; it
forms an essential component of systems for music segmentation, similarity assessment, and mood
detection. Due to copyright restrictions and a labor-intensive annotation process, audio key finding
algorithms have only been evaluated using small proprietary datasets to date. To create a common base for
systematic comparisons, we have constructed a dataset comprising of more than 3,000 excerpts of classical
music. The excerpts are made publicly accessible via commonly used acoustic features such as pitch-based
spectrograms and chromagrams. We introduce a hybrid annotation scheme that combines the use of title
keys with expert validation and correction of only the challenging cases. The expert musicians also provide
ratings of key recognition difficulty. Other meta-data include instrumentation. As demonstration of use of
the dataset, and to provide initial benchmark comparisons for evaluating new algorithms, we conduct a
series of experiments reporting key determination accuracy of four state-of-the-art algorithms. We further
show the importance of considering factors such as estimated tuning frequency, key strength or confidence
value, and key recognition difficulty in key finding. In the future, we plan to expand the dataset to include
meta-data for other music information retrieval tasks.
The document describes an audio player component of the Human-Computer Music Performance (HCMP) system. The audio player allows HCMP to accompany live music performances using audio files rather than just MIDI. Key aspects of the audio player include using a phase vocoder to change the playback speed of audio files without altering pitch, and using a "label track" to map beats in the audio file to a shared clock for synchronization. The audio player operates in different play modes (tempo track, override, conductor-controlled) and uses various "beat layers" to account for differences between beats in the audio, score, and live performance.
The Influence of perceptual Attack times in
Networked Music performance
Pilot Study conducted at CCRMA, Stanford University (2011)
44th AES conference (2011 - San Diego)
Feature Extraction of Musical Instrument Tones using FFT and Segment AveragingTELKOMNIKA JOURNAL
A feature extraction for musical instrument tones that based on a transform domain approach was
proposed in this paper. The aim of the proposed feature extraction was to get the lower feature extraction
coefficients. In general, the proposed feature extraction was carried out as follow. Firstly, the input signal
was transformed using FFT (Fast Fourier Transform). Secondly, the left half of the transformed signal was
divided into a number of segments. Finally, the averaging results of that segments, was the feature
extraction of the input signal. Based on the test results, the proposed feature extraction was highly efficient
for the tones, which have many significant local peaks in the Fourier transform domain, because it only
required at least four feature extraction coefficients, in order to represent every tone.
Class 12th presentation on maths in musicSakshiBisht48
Math's Project on the topic of "Math in Music" is summarized as follows:
1) The project introduces the connections between math and music, such as counting, rhythm, scales, intervals, patterns, and harmonies that relate to mathematical concepts.
2) It provides historical background on figures like Pythagoras who first investigated musical scales in terms of numerical ratios and frequency.
3) Key mathematical elements in music are discussed like the Fibonacci sequence, golden ratio, set theory, abstract algebra, and how composers have incorporated mathematical principles and patterns.
4) Relationships are examined between elements like rhythm and numbers, frequency and pitch, and how patterns are fundamental to both math and musical structure
Graphical visualization of musical emotionsPranay Prasoon
The document discusses graphical visualization of musical emotions using artificial neural networks. 13 audio features are extracted from Hindustani classical music clips labeled as happy or sad. An ANN model with backpropagation algorithm is trained on 70% of data, validated on 15% and tested on 15%. The model correctly classified 15 of 17 happy clips and 21 of 22 sad clips. Testing was repeated 10 times with over 90% accuracy each time, showing the model effectively recognizes musical emotions. Future work involves expanding the model to recognize additional emotions and incorporating physiological features.
This document discusses a system for controlling the emotional content of pre-composed music to express an intended emotion. The system uses a knowledge base mapping music features like rhythm, melody, and instrumentation to emotional dimensions of valence and arousal. Music is selected and transformed based on these mappings. The system was evaluated and shown to be effective, correctly selecting music with correlations of 87.4% for valence and 84.8% for arousal, and transforming music with correlations of 67.8% for valence and 69.4% for arousal. The system allows customizing music to express different emotions through automated selection and transformation of pre-existing music.
This document summarizes an article from the International Journal of Computer Engineering and Technology (IJCET) that proposes a system for identifying ragas in North Indian classical music. It begins with background on ragas and their identifying musical elements. The system first uses a pitch detection tool to convert an audio file into a note sequence. It then applies two pakad matching methods - n-gram matching and delta occurrence with alpha-bounded gaps - to identify which raga's pakad most closely matches the note sequence. This identifies the raga of the input music. The document provides details on each step of the system to enable accurate raga identification from an audio file.
Audio descriptive analysis of singer and musical instrument identification in...eSAT Journals
Abstract Music information retrieval (MIR) has reached to a reasonably stable state after advancement in the Low Level audio Descriptors (LLDs) and feature extraction techniques. The analysis of sound has now become simple by the continuous efforts and research of MIR community in the field of signal processing from last two decades. In north Indian classical music, a singer is accompanied by some instruments such as harmonium, violin or flute. These instruments are tuned in the same musical scale (pitch range) in which the singer is signing. Separate researches have been made in recent past to identify a musical instrument and a singer. In this paper, we have analyzed the low level audio descriptors, for singing voice and musical instrument sound together, that appears to human ear as similar with respect to ‘timbre’, to see if we could treat them same and use identification/ classification routines to classify them into their classes. We have used Hybrid Selection algorithm from wrapper technique(the one that uses classifier also in feature selection process) to identify and extract the features and K-Means and K nearest neighbor classifiers to classify and cross verify the accuracy of classification. The accuracy of classification achieved was 91.1% which clearly proves that musical instruments and singing voice that sounds similar in timbral aspect can be grouped together and classification is possible with mixed database of instruments and singing voices. Keywords: Music Information Retrieval (MIR), Timbre, Singing Voice, Low level Descriptors (LLD, North Indian Classical music. MIRTOOL BOX
The document discusses an improvisation support system called JamSketch that allows musical novices to enjoy creating music. JamSketch uses a simple drawing interface where users draw melodic outlines, and a genetic algorithm then generates a complete melody in real-time based on the outline. An experiment found the system-generated melodies were comparable to inexperienced human players but lacked human-likeness. Ongoing work aims to improve rhythm variety and allow input on smartphones or with eye tracking. The author believes systems should act as ghostwriters to support but not replace human musical ideas.
The document outlines Olmo Cornelis' research on opportunities for symbiosis between Western and non-Western musical idioms from 2008-2014. It discusses his background, previous work digitizing an ethnomusicological archive, challenges in accurately describing non-Western music, and a proposed methodology using music information retrieval techniques to objectively analyze ethnic music and inform new compositions blending musical elements from different cultures.
Similar to Natural automatic musical note player using time-frequency analysis on human play (13)
Amazon products reviews classification based on machine learning, deep learni...TELKOMNIKA JOURNAL
In recent times, the trend of online shopping through e-commerce stores and websites has grown to a huge extent. Whenever a product is purchased on an e-commerce platform, people leave their reviews about the product. These reviews are very helpful for the store owners and the product’s manufacturers for the betterment of their work process as well as product quality. An automated system is proposed in this work that operates on two datasets D1 and D2 obtained from Amazon. After certain preprocessing steps, N-gram and word embedding-based features are extracted using term frequency-inverse document frequency (TF-IDF), bag of words (BoW) and global vectors (GloVe), and Word2vec, respectively. Four machine learning (ML) models support vector machines (SVM), logistic regression (RF), logistic regression (LR), multinomial Naïve Bayes (MNB), two deep learning (DL) models convolutional neural network (CNN), long-short term memory (LSTM), and standalone bidirectional encoder representations (BERT) are used to classify reviews as either positive or negative. The results obtained by the standard ML, DL models and BERT are evaluated using certain performance evaluation measures. BERT turns out to be the best-performing model in the case of D1 with an accuracy of 90% on features derived by word embedding models while the CNN provides the best accuracy of 97% upon word embedding features in the case of D2. The proposed model shows better overall performance on D2 as compared to D1.
Design, simulation, and analysis of microstrip patch antenna for wireless app...TELKOMNIKA JOURNAL
In this study, a microstrip patch antenna that works at 3.6 GHz was built and tested to see how well it works. In this work, Rogers RT/Duroid 5880 has been used as the substrate material, with a dielectric permittivity of 2.2 and a thickness of 0.3451 mm; it serves as the base for the examined antenna. The computer simulation technology (CST) studio suite is utilized to show the recommended antenna design. The goal of this study was to get a more extensive transmission capacity, a lower voltage standing wave ratio (VSWR), and a lower return loss, but the main goal was to get a higher gain, directivity, and efficiency. After simulation, the return loss, gain, directivity, bandwidth, and efficiency of the supplied antenna are found to be -17.626 dB, 9.671 dBi, 9.924 dBi, 0.2 GHz, and 97.45%, respectively. Besides, the recreation uncovered that the transfer speed side-lobe level at phi was much better than those of the earlier works, at -28.8 dB, respectively. Thus, it makes a solid contender for remote innovation and more robust communication.
Design and simulation an optimal enhanced PI controller for congestion avoida...TELKOMNIKA JOURNAL
This document describes using a snake optimization algorithm to tune the gains of an enhanced proportional-integral controller for congestion avoidance in a TCP/AQM system. The controller aims to maintain a stable and desired queue size without noise or transmission problems. A linearized model of the TCP/AQM system is presented. An enhanced PI controller combining nonlinear gain and original PI gains is proposed. The snake optimization algorithm is then used to tune the parameters of the enhanced PI controller to achieve optimal system performance and response. Simulation results are discussed showing the proposed controller provides a stable and robust behavior for congestion control.
Improving the detection of intrusion in vehicular ad-hoc networks with modifi...TELKOMNIKA JOURNAL
Vehicular ad-hoc networks (VANETs) are wireless-equipped vehicles that form networks along the road. The security of this network has been a major challenge. The identity-based cryptosystem (IBC) previously used to secure the networks suffers from membership authentication security features. This paper focuses on improving the detection of intruders in VANETs with a modified identity-based cryptosystem (MIBC). The MIBC is developed using a non-singular elliptic curve with Lagrange interpolation. The public key of vehicles and roadside units on the network are derived from number plates and location identification numbers, respectively. Pseudo-identities are used to mask the real identity of users to preserve their privacy. The membership authentication mechanism ensures that only valid and authenticated members of the network are allowed to join the network. The performance of the MIBC is evaluated using intrusion detection ratio (IDR) and computation time (CT) and then validated with the existing IBC. The result obtained shows that the MIBC recorded an IDR of 99.3% against 94.3% obtained for the existing identity-based cryptosystem (EIBC) for 140 unregistered vehicles attempting to intrude on the network. The MIBC shows lower CT values of 1.17 ms against 1.70 ms for EIBC. The MIBC can be used to improve the security of VANETs.
Conceptual model of internet banking adoption with perceived risk and trust f...TELKOMNIKA JOURNAL
Understanding the primary factors of internet banking (IB) acceptance is critical for both banks and users; nevertheless, our knowledge of the role of users’ perceived risk and trust in IB adoption is limited. As a result, we develop a conceptual model by incorporating perceived risk and trust into the technology acceptance model (TAM) theory toward the IB. The proper research emphasized that the most essential component in explaining IB adoption behavior is behavioral intention to use IB adoption. TAM is helpful for figuring out how elements that affect IB adoption are connected to one another. According to previous literature on IB and the use of such technology in Iraq, one has to choose a theoretical foundation that may justify the acceptance of IB from the customer’s perspective. The conceptual model was therefore constructed using the TAM as a foundation. Furthermore, perceived risk and trust were added to the TAM dimensions as external factors. The key objective of this work was to extend the TAM to construct a conceptual model for IB adoption and to get sufficient theoretical support from the existing literature for the essential elements and their relationships in order to unearth new insights about factors responsible for IB adoption.
Efficient combined fuzzy logic and LMS algorithm for smart antennaTELKOMNIKA JOURNAL
The smart antennas are broadly used in wireless communication. The least mean square (LMS) algorithm is a procedure that is concerned in controlling the smart antenna pattern to accommodate specified requirements such as steering the beam toward the desired signal, in addition to placing the deep nulls in the direction of unwanted signals. The conventional LMS (C-LMS) has some drawbacks like slow convergence speed besides high steady state fluctuation error. To overcome these shortcomings, the present paper adopts an adaptive fuzzy control step size least mean square (FC-LMS) algorithm to adjust its step size. Computer simulation outcomes illustrate that the given model has fast convergence rate as well as low mean square error steady state.
Design and implementation of a LoRa-based system for warning of forest fireTELKOMNIKA JOURNAL
This paper presents the design and implementation of a forest fire monitoring and warning system based on long range (LoRa) technology, a novel ultra-low power consumption and long-range wireless communication technology for remote sensing applications. The proposed system includes a wireless sensor network that records environmental parameters such as temperature, humidity, wind speed, and carbon dioxide (CO2) concentration in the air, as well as taking infrared photos.The data collected at each sensor node will be transmitted to the gateway via LoRa wireless transmission. Data will be collected, processed, and uploaded to a cloud database at the gateway. An Android smartphone application that allows anyone to easily view the recorded data has been developed. When a fire is detected, the system will sound a siren and send a warning message to the responsible personnel, instructing them to take appropriate action. Experiments in Tram Chim Park, Vietnam, have been conducted to verify and evaluate the operation of the system.
Wavelet-based sensing technique in cognitive radio networkTELKOMNIKA JOURNAL
Cognitive radio is a smart radio that can change its transmitter parameter based on interaction with the environment in which it operates. The demand for frequency spectrum is growing due to a big data issue as many Internet of Things (IoT) devices are in the network. Based on previous research, most frequency spectrum was used, but some spectrums were not used, called spectrum hole. Energy detection is one of the spectrum sensing methods that has been frequently used since it is easy to use and does not require license users to have any prior signal understanding. But this technique is incapable of detecting at low signal-to-noise ratio (SNR) levels. Therefore, the wavelet-based sensing is proposed to overcome this issue and detect spectrum holes. The main objective of this work is to evaluate the performance of wavelet-based sensing and compare it with the energy detection technique. The findings show that the percentage of detection in wavelet-based sensing is 83% higher than energy detection performance. This result indicates that the wavelet-based sensing has higher precision in detection and the interference towards primary user can be decreased.
A novel compact dual-band bandstop filter with enhanced rejection bandsTELKOMNIKA JOURNAL
In this paper, we present the design of a new wide dual-band bandstop filter (DBBSF) using nonuniform transmission lines. The method used to design this filter is to replace conventional uniform transmission lines with nonuniform lines governed by a truncated Fourier series. Based on how impedances are profiled in the proposed DBBSF structure, the fractional bandwidths of the two 10 dB-down rejection bands are widened to 39.72% and 52.63%, respectively, and the physical size has been reduced compared to that of the filter with the uniform transmission lines. The results of the electromagnetic (EM) simulation support the obtained analytical response and show an improved frequency behavior.
Deep learning approach to DDoS attack with imbalanced data at the application...TELKOMNIKA JOURNAL
A distributed denial of service (DDoS) attack is where one or more computers attack or target a server computer, by flooding internet traffic to the server. As a result, the server cannot be accessed by legitimate users. A result of this attack causes enormous losses for a company because it can reduce the level of user trust, and reduce the company’s reputation to lose customers due to downtime. One of the services at the application layer that can be accessed by users is a web-based lightweight directory access protocol (LDAP) service that can provide safe and easy services to access directory applications. We used a deep learning approach to detect DDoS attacks on the CICDDoS 2019 dataset on a complex computer network at the application layer to get fast and accurate results for dealing with unbalanced data. Based on the results obtained, it is observed that DDoS attack detection using a deep learning approach on imbalanced data performs better when implemented using synthetic minority oversampling technique (SMOTE) method for binary classes. On the other hand, the proposed deep learning approach performs better for detecting DDoS attacks in multiclass when implemented using the adaptive synthetic (ADASYN) method.
The appearance of uncertainties and disturbances often effects the characteristics of either linear or nonlinear systems. Plus, the stabilization process may be deteriorated thus incurring a catastrophic effect to the system performance. As such, this manuscript addresses the concept of matching condition for the systems that are suffering from miss-match uncertainties and exogeneous disturbances. The perturbation towards the system at hand is assumed to be known and unbounded. To reach this outcome, uncertainties and their classifications are reviewed thoroughly. The structural matching condition is proposed and tabulated in the proposition 1. Two types of mathematical expressions are presented to distinguish the system with matched uncertainty and the system with miss-matched uncertainty. Lastly, two-dimensional numerical expressions are provided to practice the proposed proposition. The outcome shows that matching condition has the ability to change the system to a design-friendly model for asymptotic stabilization.
Implementation of FinFET technology based low power 4×4 Wallace tree multipli...TELKOMNIKA JOURNAL
Many systems, including digital signal processors, finite impulse response (FIR) filters, application-specific integrated circuits, and microprocessors, use multipliers. The demand for low power multipliers is gradually rising day by day in the current technological trend. In this study, we describe a 4×4 Wallace multiplier based on a carry select adder (CSA) that uses less power and has a better power delay product than existing multipliers. HSPICE tool at 16 nm technology is used to simulate the results. In comparison to the traditional CSA-based multiplier, which has a power consumption of 1.7 µW and power delay product (PDP) of 57.3 fJ, the results demonstrate that the Wallace multiplier design employing CSA with first zero finding logic (FZF) logic has the lowest power consumption of 1.4 µW and PDP of 27.5 fJ.
Evaluation of the weighted-overlap add model with massive MIMO in a 5G systemTELKOMNIKA JOURNAL
The flaw in 5G orthogonal frequency division multiplexing (OFDM) becomes apparent in high-speed situations. Because the doppler effect causes frequency shifts, the orthogonality of OFDM subcarriers is broken, lowering both their bit error rate (BER) and throughput output. As part of this research, we use a novel design that combines massive multiple input multiple output (MIMO) and weighted overlap and add (WOLA) to improve the performance of 5G systems. To determine which design is superior, throughput and BER are calculated for both the proposed design and OFDM. The results of the improved system show a massive improvement in performance ver the conventional system and significant improvements with massive MIMO, including the best throughput and BER. When compared to conventional systems, the improved system has a throughput that is around 22% higher and the best performance in terms of BER, but it still has around 25% less error than OFDM.
Reflector antenna design in different frequencies using frequency selective s...TELKOMNIKA JOURNAL
In this study, it is aimed to obtain two different asymmetric radiation patterns obtained from antennas in the shape of the cross-section of a parabolic reflector (fan blade type antennas) and antennas with cosecant-square radiation characteristics at two different frequencies from a single antenna. For this purpose, firstly, a fan blade type antenna design will be made, and then the reflective surface of this antenna will be completed to the shape of the reflective surface of the antenna with the cosecant-square radiation characteristic with the frequency selective surface designed to provide the characteristics suitable for the purpose. The frequency selective surface designed and it provides the perfect transmission as possible at 4 GHz operating frequency, while it will act as a band-quenching filter for electromagnetic waves at 5 GHz operating frequency and will be a reflective surface. Thanks to this frequency selective surface to be used as a reflective surface in the antenna, a fan blade type radiation characteristic at 4 GHz operating frequency will be obtained, while a cosecant-square radiation characteristic at 5 GHz operating frequency will be obtained.
Reagentless iron detection in water based on unclad fiber optical sensorTELKOMNIKA JOURNAL
A simple and low-cost fiber based optical sensor for iron detection is demonstrated in this paper. The sensor head consist of an unclad optical fiber with the unclad length of 1 cm and it has a straight structure. Results obtained shows a linear relationship between the output light intensity and iron concentration, illustrating the functionality of this iron optical sensor. Based on the experimental results, the sensitivity and linearity are achieved at 0.0328/ppm and 0.9824 respectively at the wavelength of 690 nm. With the same wavelength, other performance parameters are also studied. Resolution and limit of detection (LOD) are found to be 0.3049 ppm and 0.0755 ppm correspondingly. This iron sensor is advantageous in that it does not require any reagent for detection, enabling it to be simpler and cost-effective in the implementation of the iron sensing.
Impact of CuS counter electrode calcination temperature on quantum dot sensit...TELKOMNIKA JOURNAL
In place of the commercial Pt electrode used in quantum sensitized solar cells, the low-cost CuS cathode is created using electrophoresis. High resolution scanning electron microscopy and X-ray diffraction were used to analyze the structure and morphology of structural cubic samples with diameters ranging from 40 nm to 200 nm. The conversion efficiency of solar cells is significantly impacted by the calcination temperatures of cathodes at 100 °C, 120 °C, 150 °C, and 180 °C under vacuum. The fluorine doped tin oxide (FTO)/CuS cathode electrode reached a maximum efficiency of 3.89% when it was calcined at 120 °C. Compared to other temperature combinations, CuS nanoparticles crystallize at 120 °C, which lowers resistance while increasing electron lifetime.
In place of the commercial Pt electrode used in quantum sensitized solar cells, the low-cost CuS cathode is created using electrophoresis. High resolution scanning electron microscopy and X-ray diffraction were used to analyze the structure and morphology of structural cubic samples with diameters ranging from 40 nm to 200 nm. The conversion efficiency of solar cells is significantly impacted by the calcination temperatures of cathodes at 100 °C, 120 °C, 150 °C, and 180 °C under vacuum. The fluorine doped tin oxide (FTO)/CuS cathode electrode reached a maximum efficiency of 3.89% when it was calcined at 120 °C. Compared to other temperature combinations, CuS nanoparticles crystallize at 120 °C, which lowers resistance while increasing electron lifetime.
A progressive learning for structural tolerance online sequential extreme lea...TELKOMNIKA JOURNAL
This article discusses the progressive learning for structural tolerance online sequential extreme learning machine (PSTOS-ELM). PSTOS-ELM can save robust accuracy while updating the new data and the new class data on the online training situation. The robustness accuracy arises from using the householder block exact QR decomposition recursive least squares (HBQRD-RLS) of the PSTOS-ELM. This method is suitable for applications that have data streaming and often have new class data. Our experiment compares the PSTOS-ELM accuracy and accuracy robustness while data is updating with the batch-extreme learning machine (ELM) and structural tolerance online sequential extreme learning machine (STOS-ELM) that both must retrain the data in a new class data case. The experimental results show that PSTOS-ELM has accuracy and robustness comparable to ELM and STOS-ELM while also can update new class data immediately.
Electroencephalography-based brain-computer interface using neural networksTELKOMNIKA JOURNAL
This study aimed to develop a brain-computer interface that can control an electric wheelchair using electroencephalography (EEG) signals. First, we used the Mind Wave Mobile 2 device to capture raw EEG signals from the surface of the scalp. The signals were transformed into the frequency domain using fast Fourier transform (FFT) and filtered to monitor changes in attention and relaxation. Next, we performed time and frequency domain analyses to identify features for five eye gestures: opened, closed, blink per second, double blink, and lookup. The base state was the opened-eyes gesture, and we compared the features of the remaining four action gestures to the base state to identify potential gestures. We then built a multilayer neural network to classify these features into five signals that control the wheelchair’s movement. Finally, we designed an experimental wheelchair system to test the effectiveness of the proposed approach. The results demonstrate that the EEG classification was highly accurate and computationally efficient. Moreover, the average performance of the brain-controlled wheelchair system was over 75% across different individuals, which suggests the feasibility of this approach.
Adaptive segmentation algorithm based on level set model in medical imagingTELKOMNIKA JOURNAL
For image segmentation, level set models are frequently employed. It offer best solution to overcome the main limitations of deformable parametric models. However, the challenge when applying those models in medical images stills deal with removing blurs in image edges which directly affects the edge indicator function, leads to not adaptively segmenting images and causes a wrong analysis of pathologies wich prevents to conclude a correct diagnosis. To overcome such issues, an effective process is suggested by simultaneously modelling and solving systems’ two-dimensional partial differential equations (PDE). The first PDE equation allows restoration using Euler’s equation similar to an anisotropic smoothing based on a regularized Perona and Malik filter that eliminates noise while preserving edge information in accordance with detected contours in the second equation that segments the image based on the first equation solutions. This approach allows developing a new algorithm which overcome the studied model drawbacks. Results of the proposed method give clear segments that can be applied to any application. Experiments on many medical images in particular blurry images with high information losses, demonstrate that the developed approach produces superior segmentation results in terms of quantity and quality compared to other models already presented in previeous works.
Automatic channel selection using shuffled frog leaping algorithm for EEG bas...TELKOMNIKA JOURNAL
Drug addiction is a complex neurobiological disorder that necessitates comprehensive treatment of both the body and mind. It is categorized as a brain disorder due to its impact on the brain. Various methods such as electroencephalography (EEG), functional magnetic resonance imaging (FMRI), and magnetoencephalography (MEG) can capture brain activities and structures. EEG signals provide valuable insights into neurological disorders, including drug addiction. Accurate classification of drug addiction from EEG signals relies on appropriate features and channel selection. Choosing the right EEG channels is essential to reduce computational costs and mitigate the risk of overfitting associated with using all available channels. To address the challenge of optimal channel selection in addiction detection from EEG signals, this work employs the shuffled frog leaping algorithm (SFLA). SFLA facilitates the selection of appropriate channels, leading to improved accuracy. Wavelet features extracted from the selected input channel signals are then analyzed using various machine learning classifiers to detect addiction. Experimental results indicate that after selecting features from the appropriate channels, classification accuracy significantly increased across all classifiers. Particularly, the multi-layer perceptron (MLP) classifier combined with SFLA demonstrated a remarkable accuracy improvement of 15.78% while reducing time complexity.
AI in customer support Use cases solutions development and implementation.pdfmahaffeycheryld
AI in customer support will integrate with emerging technologies such as augmented reality (AR) and virtual reality (VR) to enhance service delivery. AR-enabled smart glasses or VR environments will provide immersive support experiences, allowing customers to visualize solutions, receive step-by-step guidance, and interact with virtual support agents in real-time. These technologies will bridge the gap between physical and digital experiences, offering innovative ways to resolve issues, demonstrate products, and deliver personalized training and support.
https://www.leewayhertz.com/ai-in-customer-support/#How-does-AI-work-in-customer-support
Open Channel Flow: fluid flow with a free surfaceIndrajeet sahu
Open Channel Flow: This topic focuses on fluid flow with a free surface, such as in rivers, canals, and drainage ditches. Key concepts include the classification of flow types (steady vs. unsteady, uniform vs. non-uniform), hydraulic radius, flow resistance, Manning's equation, critical flow conditions, and energy and momentum principles. It also covers flow measurement techniques, gradually varied flow analysis, and the design of open channels. Understanding these principles is vital for effective water resource management and engineering applications.
Sri Guru Hargobind Ji - Bandi Chor Guru.pdfBalvir Singh
Sri Guru Hargobind Ji (19 June 1595 - 3 March 1644) is revered as the Sixth Nanak.
• On 25 May 1606 Guru Arjan nominated his son Sri Hargobind Ji as his successor. Shortly
afterwards, Guru Arjan was arrested, tortured and killed by order of the Mogul Emperor
Jahangir.
• Guru Hargobind's succession ceremony took place on 24 June 1606. He was barely
eleven years old when he became 6th Guru.
• As ordered by Guru Arjan Dev Ji, he put on two swords, one indicated his spiritual
authority (PIRI) and the other, his temporal authority (MIRI). He thus for the first time
initiated military tradition in the Sikh faith to resist religious persecution, protect
people’s freedom and independence to practice religion by choice. He transformed
Sikhs to be Saints and Soldier.
• He had a long tenure as Guru, lasting 37 years, 9 months and 3 days
Digital Twins Computer Networking Paper Presentation.pptxaryanpankaj78
A Digital Twin in computer networking is a virtual representation of a physical network, used to simulate, analyze, and optimize network performance and reliability. It leverages real-time data to enhance network management, predict issues, and improve decision-making processes.
Levelised Cost of Hydrogen (LCOH) Calculator ManualMassimo Talia
The aim of this manual is to explain the
methodology behind the Levelized Cost of
Hydrogen (LCOH) calculator. Moreover, this
manual also demonstrates how the calculator
can be used for estimating the expenses associated with hydrogen production in Europe
using low-temperature electrolysis considering different sources of electricity
This presentation is about Food Delivery Systems and how they are developed using the Software Development Life Cycle (SDLC) and other methods. It explains the steps involved in creating a food delivery app, from planning and designing to testing and launching. The slide also covers different tools and technologies used to make these systems work efficiently.
Blood finder application project report (1).pdfKamal Acharya
Blood Finder is an emergency time app where a user can search for the blood banks as
well as the registered blood donors around Mumbai. This application also provide an
opportunity for the user of this application to become a registered donor for this user have
to enroll for the donor request from the application itself. If the admin wish to make user
a registered donor, with some of the formalities with the organization it can be done.
Specialization of this application is that the user will not have to register on sign-in for
searching the blood banks and blood donors it can be just done by installing the
application to the mobile.
The purpose of making this application is to save the user’s time for searching blood of
needed blood group during the time of the emergency.
This is an android application developed in Java and XML with the connectivity of
SQLite database. This application will provide most of basic functionality required for an
emergency time application. All the details of Blood banks and Blood donors are stored
in the database i.e. SQLite.
This application allowed the user to get all the information regarding blood banks and
blood donors such as Name, Number, Address, Blood Group, rather than searching it on
the different websites and wasting the precious time. This application is effective and
user friendly.
Determination of Equivalent Circuit parameters and performance characteristic...pvpriya2
Includes the testing of induction motor to draw the circle diagram of induction motor with step wise procedure and calculation for the same. Also explains the working and application of Induction generator
Accident detection system project report.pdfKamal Acharya
The Rapid growth of technology and infrastructure has made our lives easier. The
advent of technology has also increased the traffic hazards and the road accidents take place
frequently which causes huge loss of life and property because of the poor emergency facilities.
Many lives could have been saved if emergency service could get accident information and
reach in time. Our project will provide an optimum solution to this draw back. A piezo electric
sensor can be used as a crash or rollover detector of the vehicle during and after a crash. With
signals from a piezo electric sensor, a severe accident can be recognized. According to this
project when a vehicle meets with an accident immediately piezo electric sensor will detect the
signal or if a car rolls over. Then with the help of GSM module and GPS module, the location
will be sent to the emergency contact. Then after conforming the location necessary action will
be taken. If the person meets with a small accident or if there is no serious threat to anyone’s
life, then the alert message can be terminated by the driver by a switch provided in order to
avoid wasting the valuable time of the medical rescue team.
2. ISSN: 1693-6930
TELKOMNIKA Vol. 17, No. 1, February 2019: 235-245
236
a virtual music orchestra, and it becomes a motivation in this research to develop an automatic
musical note player for traditional music orchestra called gamelan.
Song in gamelan music known as gendhing consists of a bar sequence. A bar called
gatra contains four beats, in which each beat is initially represented with dot note called pin. The
dot note then can be filled with a pitch or remain in the form of dot note. Rhythm in gamelan
music is represented with a number of notes in a beat, and the speed in playing the musical
note [5]. There are five levels of rhythms in gamelan music, which are lancar (1/1) where a beat
contains one note, tanggung (1/2) where a beat contains two notes, dados (1/4) where a beat
contains four notes, wiled (1/8) where a beat contains eight notes and rangkep (1/16) where a
beat contains sixteen notes. The tempo to play is defined based on a constant interval time
value. For example, the rhythm of lancar, the time interval value is about 1000 ms between two
consecutive beats. Figure 1 shows an illustration of a note sequence consisting of 32 beats in a
gamelan song entitled Suwe Ora Jamu.
Figure 1. Example of a note sequence of a gamelan song
entitled Suwe Ora Jamu
Instead of using a time interval value for the system to play a note, a random technique
parameterized based on an observation on how a gamelan musician playing an instruments
was proposed in this research. This was to obtain natural touch in playing a note sequence as
human does. The parameterization for randomizing time as an approximate time to hit a button
was defined by a set of time range value. Given a name P for a function to hit a button to play a
note in which P identifies a set of time range RT to define an approximate time AT to execute
the play, so the notation is P: RT→AT. Time range RT is defined based on human play using
time-frequency analysis and peak detection.
Time-frequency analysis is commonly used for music signal problems, such as to
analyze the sound of musical instruments [6], to analyze the interaction of musical rhythm with
melodic structure [7], to analyze various frequency representation of the tone of a traditional
instrument from Nusa Tenggara Timur, Indonesia called gong [8]. Time-frequency analysis
known as spectrograms handles changes in frequency content over time, provides a
time-frequency portrait of musical sounds, can describe quick transitions between notes
problems [6]. The function in fourier analysis can be represented in both time and frequency
domain known as time-frequency analysis [9, 10]. Fourier transform and peak detection were
used in time-frequency analysis by [11] for audio segmentation. Peak detection is to identify a
time value when human hitting an instrument button. Tempo and beat estimation research was
conducted by [12-14]. Spectral analysis, spectral energy flux and peak detection function were
used by [12], while k-NN regression was used by [13] and utilization of harmonic separation and
periodicity analysis were used by [14] to estimate tempo and beat. These researches aim to
define a tempo and to track beats of a music audio source, in which the result can be used to
identify a music genre. The methods used in these researches were quite similar with the
procedures needed in this work. In this work, tempo and beat estimation were used to observe
the way a musicion estimating a tempo in playing a music instrument. Fourier transform and
peak detection used in time-frequency analysis by [11, 12] for audio segmentation were used in
this research. Peak detection is to identify a time value when human hitting an instrument
button, and then the time value was calculated based on a time target to set a tolerated range of
time (RT) used for randomizing approximate time (AT) to play a note.
2. Research Method
Natural automatic musical notes player was developed by analyzing the way a musician
estimating execution time referred to the time target of the tempo. Execution time estimation
3. TELKOMNIKA ISSN: 1693-6930
Natural automatic musical note player using time-frequency... (Khafiizh Hastuti)
237
was represented to approximate time containing a time range value which can be tolerated by
time target of the tempo, and the program executes the play of a music instrument in an
approximate time. An illustration of a relation between notes sequence NS containing notes S,
time target TT containing time value T, and execution time ET containing time value E is shown
by Figure 2. The first note (S1) has time target first time target value (T1), and the first execution
time (E1) will have time value which is to T1 more or less, and so on. Further, the value of ET
was used to define approximate time AT.
Figure 2. Illustration of relation between notes sequence, time target,
execution time, and approximate time in playing music instrument
Execution time was measured by analyzing recording audio of musicians playing a
music instrument. The problem in this phase came up with selection of a method to accurately
identify execution time of keystroke of hitting an instrument button. The analysis was to obtain
data of pitch and time of musical notes played by a musician. Time-frequency analysis was
used to accurately find execution time of an interaction of a musician and his instrument music
in playing notes sequence. Time-frequency analysis is a method that represents visualization of
sound in form of spectrogram. In musical sound analysis domain, spectrogram describes
transition between notes including its time. Figure 3 shows the model diagram proposed to
develop natural automatic musical notes player.
Figure 3. Model diagram of the development of automatic
musical notes player
4. ISSN: 1693-6930
TELKOMNIKA Vol. 17, No. 1, February 2019: 235-245
238
The experiment in this research was conducted by developing an automatic gamelan
musical note player limited into the rhythm of lancar and melodic abstraction note played using
melodic abstraction instruments such as saron, demung, peking and slenthem. The proposed
method was divided into data acquisition, pitch and time analysis and implementation.
2.1. Data Acquisition
The experiment used instrument of gamelan music for developing automatic musical
notes player. There are two types of musical scale in gamelan music, which are slendro which
consists of five pitches of (1, 2, 3, 5, 6) and pelog which consist of seven pitches
of (1, 2, 3, 4, 5, 6, 7). The frequency of pitches of these types of musical scale is different, and
the frequency of pitches of gamelan music is different with that in western music. A set of
gamelan orchestra can contain both gamelan slendro and gamelan pelog, or only one of these
types. Both types of gamelan musical scale consist of instruments classified based on their
function, which are a group of instruments that plays notes sequence of melody skeleton of
songs, such as demung, saron, peking, and slenthem; a group of instruments that plays notes
of songs, such as gender and rebab; a group of instruments that plays notes which have
function to define the structure of a song, such as kenong and gong.
The development of automatic musical notes player was limited to demung, an
instrument of gamelan pelog that plays notes sequence of melody skeleton of songs. Demung is
a type of xylophone instrument that has a register containing range of notes which
are (1, 2, 3, , 5, 6, 7). Figure 4 shows an illustration of demung.
Figure 4. Demung, a gamelan music instrument used to play
notes sequences of melody skeleton
Notes sequence of a melody skeleton are played in a constant interval time, such as a
beat per second for slow tempo, or a beat per half a second for tempo that is faster than slow.
For an example, a song entitled “Suwe Ora Jamu” (Figure 1) consists of 32 beats filled with dot
notes and number notes. The use of slow tempo with one beat per second defines the first beat
filled with a dot note has time target at 1
st
second, the second beat filled with a note of 2 has
time target at 2
nd
seconds, the third beat filled with a dote note has time target at 3
rd
seconds,
the fourth beat filled with a note of 3 has time target at 4
th
seconds, and so on. This makes time
target to play all beats in the song is 32 seconds.
Data acquisition was to obtain audio signal data of gamelan song played by a musician
in the rhythm of lancar where the time interval value is about 1000ms between two consecutive
beats. The acquisition was conducted by recording a gamelan musician playing saron, one of
melodic abstraction instruments. The play was recorded into .wav file format with sample rate of
48 khz and 16 bit mono. The musician played a note sequence of five gamelan songs collected
from www.gamelanbvg.com. The gamelan songs used as dataset were described in Table 1.
The musical note structure of gamelan song can be seen in Figure 5 that depicts the musical
note of the first song in the dataset.
Table 1. Dataset of Gamelan Songs
No Song Title Number of Beats
1 Suwe Ora Jamu 32
2 Kebo Giro 80
3 Gugur Gunung 64
4 Lasem 64
5 Manyarmilar 72
5. TELKOMNIKA ISSN: 1693-6930
Natural automatic musical note player using time-frequency... (Khafiizh Hastuti)
239
2.2. Pitch and Time Analysis
Time interval value was used as target time for a gamelan musician knowing when a
button should be hit to play a note. Human plays it based on intuition or musical sense. This
behavior show the way a musician estimating time to play in an approximate time which is as
close as to the target time. Time-frequency analysis was used to identify the approximate time
from a gamelan musician while playing a note sequence. The analysis was conducted by
performing fast fourier transform (FFT) technique to remove noise, and then followed by
performing peak detection technique to find the exactly time value of an approximate time from
the gamelan musician play. The distance of execution time resulted from peak detection to the
time target was measured to set range time used to parameterize a random value for
approximate time value. Figure 5 shows an example of data input and data after noise removal
using FFT which are then used to identify peaks values.
(a)
(b)
Figure 5. Signal data input (a), after noise removal using FFT (b)
The distance of execution time resulted from peak detection to the time target was
measured to set range time used to parameterize a random value for approximate time value.
For example, given a time interval value at 1000ms (rhythm of lancar) and the play was started
from time value at 0, so the target time for each note can be formulated as:
𝑇𝑇 = ∑ [𝑇𝐼 𝑥 𝐾]
𝑘=𝐿−1
𝑘=0
6. ISSN: 1693-6930
TELKOMNIKA Vol. 17, No. 1, February 2019: 235-245
240
where:
TT=Time target
TI=Time interval
L=Number of beats
Execution time was identified based on the peak value resulted from peak detection.
The formula to find execution time value is shown as follow:
𝐸𝑇 = ∑ [𝑇𝑇𝑘 − 𝑃𝑇𝑘]
𝑘=𝐿−1
𝑘=0
where:
ET=Execution time
PT=Peak time
L=Number of beats
Time range setting was used as parameters to random a time value to play a note. The
data of execution time are sorted to find the lowest and the highest approximate time value. The
lowest value will be negative value when the gamelan musician playing a note before target
time. The highest value will be positive value when a gamelan musician playing a note after
target time. The lowest and highest execution time value were used to parameterize a random
value for the system playing a note in every time target, in which each value of time target was
summed with the lowest value and the highest value. Below is the formula of time range setting.
𝐷𝑆 = 𝑆𝑜𝑟𝑡 (𝐸𝑇)
𝑅𝑇 = ∑ [(𝑇𝑇𝑘 + 𝐷𝑆[0]), (𝑇𝑇𝑘 + 𝐷𝑆[𝐿 − 1])]
𝑘=𝐿−1
𝑘=0
𝐴𝑇 = 𝑅𝑎𝑛𝑑𝑜𝑚 (𝑅𝑇)
where:
DS=Data sorted from ET
RT=Random value parameter.
AT=Approximate time
For example, given a time interval value is at 1000 ms, and the approximate time to
play a note in the second segment is:
TI=1000ms.
ET=[0, 11,-31, 150, 87].
DS=[-31, 0, 11, 87, 150].
TT3=2000ms.
RT3=((2000+ (-31)), (2000 + 150))
AT3=Random (1969, 2150)
Based on the example above, the approximate time AT to play the third note in a note
sequence with time target of 2000 ms is a time in a time range RT of 1969 to 2150 ms, thus the
system will randomize the approximate time value based on this time range.
The formulas above were implemented in the experiment. A gamelan musician who has
27 years experience in playing gamelan instruments was selected to play note sequences of
five gamelan songs in the form of melodic abstraction using a melodic abstraction instrument
called saron. The play of each song was separately recorded, and then the data were analyzed
using Matlab to conduct time-frequency analysis and peak detection. Table 2 shows the result
of time-frequency analysis for five songs used in the dataset in which each song was played in
the rhythm of lancar with time interval for two consecutive beats at 1000 ms. The columns in the
table are labeled with GS which stands for index of gamelan songs in the dataset, IN for index
of notes, NS for note sequence, TT for time target, PT for peak time and ET for execution time.
There were 312 peaks values of execution time obtained from five songs. The
execution time values from all samples were concatenated and sorted to find lowest and highest
values of execution time as seen below with DA stands for result of data of time-frequency
analysis, DC stands for result of data concatenation, and DS stands for result of data sorting.
7. TELKOMNIKA ISSN: 1693-6930
Natural automatic musical note player using time-frequency... (Khafiizh Hastuti)
241
Table 2. Result of Time-frequency Analysis
GS IN NS
TT PT
ET
(TT-PT)
(in milliseconds)
1 1 . 0 0 0
2 2 1000 985 15
3 . 2000 2236 -236
… … … … …
32 6 32000 32378 -378
2 1 . 0 0 0
2 6 1000 1074 -74
3 . 2000 2133 -133
… … … … …
80 5 80000 79987 13
3 1 . 0 0 0
2 6 1000 1142 -142
3 . 2000 1947 53
… … … … …
64 2 64000 63892 108
4 1 3 0 0 0
2 5 1000 891 109
3 6 2000 2212 -212
… … … … …
64 6 64000 64231 -231
5 1 . 0 0 0
2 2 1000 1081 -81
3 . 2000 2143 -212
… … … … …
72 7 72000 71967 33
DA = [ [0, 15, -236, …, -378] [0, -74, -133, …, 13] [0, -174, -142, …, 108] [0, 109,
-212, …, -231] [0, -81, -212, …, 33] ]
DC = [0, 15, -236, …, -378, 0, -74, -133, …, 13, 0, -174, -142, …, 108, 0, 109, -212,
…, -231, 0, -81, -212, …, 33]
DS = [-328, …, 0, …, 246]
The result of the lowest value was -328, and the highest value was 246. The result was
used for the range time as parameters to randomize approximate time for playing a note
sequence.
2.3. System Design
An automatic musical notes player system was developed to play notes sequence of
melody skeleton using demung instrument of gamelan music. Notes sequences data were
inputted into the system. The system works by playing a note sequence data with output of
sound and animation of a note selected by order. Figure 6 shows workflow diagram of automatic
musical notes player system.
Automatic musical notes player system was developed on mobile application platform.
System development was divided into four phases, which were note sequence audio-visual,
notes sequences data input, programming and testing, and interface design. Each phase was
described in following discussions. Outputs of automatic musical notes player were sound of
notes and its visual displayed using its instrument button picture. Sounds of seven demung
instrument buttons were separately recorded and saved into .wav format file, which were:
sound1.wav, sound2.wav, sound3.wav, sound4.wav, sound5.wav, sound6.wav and
sound7.wav.
The instrument images were manipulated by adding its mallet picture to visualize the
play of notes sequence. There were eight pictures to visualize execution of silent/dot note, and
note 1 until note 7. The mallet was positioned in the bottom-center of the instrument to visualize
silent and execution of dot notes, while for other notes visualization, the mallet was positioned
above the each instrument button. Each picture was saved into .png format file and labeled with
pic0.png, pic1.png, pic2.png, pic3.png, pic4.png, pic5.png, pic6.png, and pic7.png. Figure 7
shows some examples of pictures of instrument buttons visualization.
8. ISSN: 1693-6930
TELKOMNIKA Vol. 17, No. 1, February 2019: 235-245
242
Figure 6. Workflow diagram of automatic musical notes player systems
Figure 7. Instrument buttons visualization with
(a) for silent and dot note visualization, (b) for note 1 execution, (c) for note 4 execution
Sounds of notes of instrument buttons and pictures of instrument buttons visualization
were embedded into the program. The program will call each sound and each picture according
to a note which was played on the execution time. The program was designed to automatically
play notes sequences inputted into the system. Total 30 notes sequences of melody skeleton
including notes sequences used as dataset were used for the songs library in the program. All
notes sequences were formatted as array data, and the dot notation was translated into value 0.
Figure 8 shows the illustration of a notes sequence input formatted as array data.
Figure 8. Note sequence formatted as array data
9. TELKOMNIKA ISSN: 1693-6930
Natural automatic musical note player using time-frequency... (Khafiizh Hastuti)
243
Array data of notes sequences used in songs library of the program were described in
Table 3.
Table 3. Notes Sequences Data Set
ID Notes Sequences
ID_1 [0, 2, 0, 3, 0, 2, 0, 3, 0, 1, 0, 2, 0, 3, 0, 2, 0, 3, 0, 5, 0, 6, 0, 5, 0, 4, 0, 2, 0, 1, 0, 6]
ID_2
[0, 6, 0, 5, 0, 3, 0, 2, 0, 3, 0, 2, 0, 6, 0, 5, 0, 6, 0, 5, 0, 3, 0, 2, 0, 3, 0, 2, 0, 6, 0, 5,
0, 6, 0, 5, 0, 6, 0, 7, 0, 6, 0, 7, 0, 6, 0, 5, 0, 6, 0, 5, 0, 6, 0, 7, 0, 6, 0, 7, 0, 6, 0, 5, 0,
7, 0, 6, 0, 3, 0, 2, 0, 3, 0, 2, 0, 6, 0, 5]
ID_3
[0, 6, 0, 7, 0, 6, 0, 7, 0, 3, 0, 5, 0, 7, 0, 6, 0, 2, 0, 7, 0, 2, 0, 7, 0, 6, 0, 5, 0, 2, 0, 3,
0, 5, 0, 6, 0, 5, 0, 6, 0, 2, 0, 3, 0, 6, 0, 5, 0, 2, 0, 3, 0, 2, 0, 3, 0, 6, 0, 5, 0, 3, 0, 2]
ID_4
[3, 5, 6, 5, 3, 5, 3, 2, 5, 3, 2, 3, 5, 6, 7, 6, 3, 5, 6, 5, 3, 5, 3, 2, 5, 3, 2, 3, 5, 6, 7, 6,
0, 2, 6, 2, 2, 6, 0, 2, 6, 0, 2, 6, 3, 2, 7, 6, 0, 2, 6, 0, 2, 6, 0, 2, 6, 0, 2, 6, 3, 2, 7, 6]
ID_5
[0, 2, 0, 7, 0, 2, 0, 6, 0, 2, 0, 6, 0, 2, 0, 7, 0, 2, 0, 7, 0, 2, 0, 6, 0, 3, 0, 5, 0, 3, 0, 2,
0, 3, 0, 2, 0, 3, 0, 5, 0, 7, 0, 6, 0, 3, 0, 2, 0, 6, 0, 6, 0, 6, 0, 6, 0, 0, 0, 0, 3, 5, 6, 7]
Phase of programming and testing was conducted by writing codes for automatic
musical notes player program application, and testing to make sure that there were no errors
found in the program, and all functions in the program can properly works. Below is the
algorithm to automatically play a notes sequence:
// Songs library = [id_1, id_2, …, id_30];
// Sounds library = [sound1.wav, sound2.wav, …, sound7.wav]
// Pictures library = [pic0.png, pic1.png, …, pic7.png];
// Tempo = 1000 ms;
// Timer = 0 millisecond
(1) Select a notes sequence from songs library.
(2) Set duration by multiplying number of notes in selected notes sequence with tempo.
(4) Set execution time for all notes by randomizing approximate time value.
(5) Set timer to 0 milliseconds.
(6) Play sound and display picture for every note by order in selected notes sequence when
timer reach time value of execution time data.
(7) Stop the play when timer value equals to duration value.
3. Results and Analysis
An automatic gamelan musical note player mobile application was developed using
Adobe Flash program. The interface design of automatic musical notes player consisted of two
main parts, which were songs library which contained title songs and its notes sequence, and
play section which automatically presented the play of selected musical notes sequence
including outputs of sound and instrument visualization. Figure 9 shows the interface design of
automatic musical notes player for gamelan instrument named demung.
Figure 9. Interface design, (a) songs library (b) automatic play
10. ISSN: 1693-6930
TELKOMNIKA Vol. 17, No. 1, February 2019: 235-245
244
For an addition, the program was designed to display a note sequence data and play it
instrument animation and sounds as outputs, and three melodic abstraction instruments were
used in the program which were saron, demung and slentem. Figure 10 shows the screenshot
of the automatic gamelan musical note player application.
Figure 10. Screenshot of automatic gamelan musical note player program
The evaluation was simply conducted by recording the automatic play by the program
using note sequences of five song samples. The recording audio files were analyzed using
procedures implemented in the phase of data analysis which were time-frequency analysis and
peak detection. The expectation was that all notes were played by the program in approximate
time as the range time obtained from observing a gamelan musician play which were random
value between (-328, 246) as addition for time target value. The result shows that all notes were
played in approximate time as expected. Table 4 shows the execution times in playing the
instrument by the program developed in this research denoted with Program I, and by the
program developed by [4] denoted as Program II as the comparation. The columns in the table
are labeled with IN which stands for index of notes, NS for note sequence, TT for time target
and ET for execution time.
Table 4. Example of Execution Time Results for Suwe Ora Jamu
IN NS TT
RT
Random
(-328, 246)
ET
Program I
(TT + RT)
Program II
(TT)
1 . 0 151 151 0
2 2 1000 -302 698 1000
3 . 2000 -153 1847 2000
4 3 3000 -114 2886 3000
5 . 4000 -155 3845 4000
6 2 5000 -25 4975 5000
7 . 6000 51 6051 6000
8 3 7000 -111 6889 7000
… … … … … …
32 6 31000 124 31124 31000
Another evaluation was conducted by asking three experts to compare the 5 plays
recorded by program I (the program developed in this research) with the plays recorded by
Program II (Smart Gamelan Player developed by [4]). The task for the experts was to choose
the plays that sound natural. All experts stated that the plays recorded by Program II were not
natural and all plays were straight and very clean since all the instruments exactly play a note in
the same time, while all plays recorded by Program I sound more natural.
4. Conclusion
This research aims to develop an automatic gamelan musical note player that can
naturally play a musical note as human does. A function P to hit a button to play a note in a
11. TELKOMNIKA ISSN: 1693-6930
Natural automatic musical note player using time-frequency... (Khafiizh Hastuti)
245
tolerated time was formulated into P: RT→AT. The value of time range (RT) was defined based
on human play using time-frequency analysis. The time range was used as parameters to
random approximate time for playing a note. After observing plays by a musician and
conducting pitch and time analysis, the approximate time AT value was a random value of time
range RT which was a value between-328 until 246 milliseconds from the time target value. An
automatic gamelan musical note player program was developed based on this function. The
evaluation shows that the program played all note sequences in the approximate time as human
does. Based on the expert opinions, the program developed in this research played sounds
more natural and better than Smart Gamelan Player developed by [4].
For the next works, the results of this research will be used to develop a model of
musical notes reader, a program that can read play a musical sheet, and to develop an
interactive music instrument program which can detect and measure the tempo of the play by
the user.
References
[1] S Keith, Bridging The Gap: Thoughts On Computer Music And Contemporary (Popular)
Electronic Music. Proceedings of the 2010 Australasian Computer Music Conference,
Canberra. 2010: 37-42.
[2] D Keislar. A historical view of computer music technology. In: R.T. Dean. The Oxford
Handbook of Computer Music. New York: Oxford University Press. 2009: 11-43.
[3] C Dannen, iPhone Design Award Winning Project, Apress, New York. 2009
[4] K Hastuti, AZ Fanani, AM Syarif. Virtual Player of Melodic Abstraction Instruments for
Automatic Gamelan Orchestra, 2
nd
International Conference on Science in Information
Technology (ICSITech), Balikpapan. 2016: 30-34.
[5] Martopangrawit, Pengetahuan Karawitan, ASKI, Surakarta. 1969.
[6] JF Alm, JS Walker, Time-Frequency Analysis of Musical Instruments, Society for Industrial
and Applied Mathematics (SIAM). 2002; 44 (3): 457-476.
[7] X Cheng, JV Hart, and JS Walker. Timefrequency analysis of musical rhythm, Notices of
the American Mathematical Society. 2009; 56:344-60, available at
http://www.ams.org/notices/200903/
[8] YCH Siki, NMR Mamulak. Time-frequency analysis on gong timor music using short-time
fourier transform and continuous wavelet transform. International Journal of Advances in
Intelligent Informatics. 2017; 3 (3): 146-153.
[9] N Lenssen, D Needell. An Introduction to Fourier Analysis with Applications to Music,
Journal of Humanistic Mathematics. 2014; 4 (1): 72-89.
[10] A Degani, M Dalai, R Leonardi, P Migliorati, Time-Frequency Analysis of Musical Signals
Using the Phase Coherence, Proc. of the 16
th
Int. Conference on Digital Audio Effects
(DAFx-13), Maynooth. 2013: 2-5.
[11] C Meyer, M Spiertz. Audio Segmentation Using Different Time-Frequency Representations,
Poster 2008: 12
th
International Student Conference on Electrical Engineering, Prague.
2008: 1-5.
[12] Miquel Alonso, Bertrand David, Gael Richard, Tempo and Beat Estimation of Musical
Signals, ISMIR 2004 5
th
International Conference on Music Information Retrieval,
Barcelona. 2004: 32-37.
[13] Antti Eronen and Anssi Klapuri. Music Tempo Estimation with K-NN Regression, IEEE
Transactions on Audio, Speech, and Language Processing. 2010; 18 (1): 50-57.
[14] Aggelos Gkiokas, Vassilis Katsouros, George Carayannis and Themos Stafylakis. Music
Tempo Estimation and Beat Tracking by Applying Source Separation and Metrical
Relations. 2012 IEEE International Conference on Acoustics, Speech and Signal
Processing (ICASSP), Kyoto. 2012: 421-424.