The dataset is complex. The dataste used is combination of free available datasets in the internet, dataset provided by research papers, data collected by my team and some contribution from public. The audio data present in the dataset is not enough. So we used VAE for data augmentation. The new augmented data contains melspectrogram representation of all classes with required number of samples. Dataset will be provided on request.