1.深度學習PyTorch入門-語音分類
https://blog.csdn.net/sinat_41787040/article/details/129795496
https://github.com/musikalkemist/pytorchforaudio
https://github1s.com/musikalkemist/pytorchforaudio/blob/main/04%20Creating%20a%20custom%20dataset/urbansounddataset.py
https://github.com/musikalkemist/DeepLearningForAudioWithPython
https://github.com/musikalkemist/AudioSignalProcessingForML
https://github.com/musikalkemist/generating-sound-with-neural-networks
https://github.com/musikalkemist/Deep-Learning-Audio-Application-From-Design-to-Deployment
https://github.com/yaokaishile/AudioClassifier
GitHub - carl03q/AudioClassifier: A CNN audio classifier via spectrogram images.
2.基于Pytorch實現聲音的分類
https://blog.csdn.net/weixin_45277161/article/details/135185613
https://blog.csdn.net/qq_33200967/article/details/119830756
https://github.com/yeyupiaoling/AudioClassification-Pytorch
3.基于深度學習的各種音頻分類pytorch實戰
https://blog.csdn.net/weixin_45717863/article/details/136237419
https://github.com/ethanyhzhang/AudioClassification-Pytorch
4.Urbansound8k聲音分類深度學習實戰
GitHub - musikalkemist/pytorchforaudio: Code for the "PyTorch for Audio + Music Processing" series on The Sound of AI YouTube channel.
pytorchforaudio/09 Training urban sound classifier at main · musikalkemist/pytorchforaudio · GitHub
UrbanSound8K數據集
#wget命令下載
wget https://zenodo.org/record/1203745/files/UrbanSound8K.tar.gz
5.語音識別 - PyTorch實現
https://juejin.cn/post/7107844968957345805
6.音頻識別(Audio Classification)學習筆記
https://blog.csdn.net/Killer_kali/article/details/122798462
7.基于Pytorch的聲音事件檢測分類系統
https://blog.csdn.net/weixin_43509698/article/details/145432599
https://github.com/Shybert-AI/AudioClassificationModelZoo-Pytorch
8.使用PyTorch進行城市聲音分類
https://blog.csdn.net/gitblog_00081/article/details/139518564
https://github.com/ksanjeevan/crnn-audio-classification
9.torch入門:音頻分類任務(pytorch+librosa)
https://blog.csdn.net/Ceverymxt7/article/details/146146361
10.PyTorch音頻分類實戰(完整代碼)
https://blog.csdn.net/SoulmateY/article/details/143689699
https://blog.csdn.net/python1234_/article/details/145307536
https://zhuanlan.zhihu.com/p/6235637735
- Github:github.com/Zeyi-Lin/Py…
- 數據集:pan.baidu.com/s/14CTI_9MD…?提取碼: 1a9e
- SwanLab實驗日志:swanlab.cn/@ZeyiLin/Py…
- 更多實驗日志:swanlab.cn/@ZeyiLin/Py…
Github:https://github.com/Zeyi-Lin/PyTorch-Audio-Classification
數據集:https://pan.baidu.com/s/14CTI_9MD1vXCqyVxmAbeMw?pwd=1a9e 提取碼: 1a9e
SwanLab實驗日志:https://swanlab.cn/@ZeyiLin/PyTorch_Audio_Classification-simple/charts
更多實驗日志:https://swanlab.cn/@ZeyiLin/PyTorch_Audio_Classification/charts
11.使用深度學習進行音頻分類的端到端示例和解釋
https://blog.csdn.net/deephub/article/details/114997017
12.深度學習實現50種環境聲音分類
https://blog.csdn.net/Mind_programmonkey/article/details/121024085
13.esc50
https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/examples/esc50
https://github.com/PaddlePaddle/PaddleSpeech/blob/develop/docs/tutorial/cls/cls_tutorial.ipynb
14.CBAM與DenseNet121融合進行鳥類聲音識別
https://blog.csdn.net/weixin_34945060/article/details/143473683
https://github.com/CarrieX6/-Xeno-Canto-
融合了CBAM與DenseNet121,使用中心損失函數及鳥聲融合特征進行數據預處理、鳥聲識別等任務
15.基于梅爾頻譜的信號分類和識別
https://github.com/Friedrich-M/Audio-signal-classification-and-identification
數據集Urbansound8K
Urbansound8K
是目前應用較為廣泛的用于自動城市環境聲分類研究的公共數據集, 包含10個分類:空調聲、汽車鳴笛聲、兒童玩耍聲、狗叫聲、鉆孔聲、引擎空轉聲、槍聲、手提鉆、警笛聲和街道音樂聲。- 數據集下載
16.使用BirdNET-V2.4訓練自己的數據集
https://blog.csdn.net/weixin_53238517/article/details/144215320
https://github.com/birdnet-team/BirdNET-Analyzer/tree/training-with-test-data
https://github.com/birdnet-team/BirdNET-Analyzer
https://github.com/joeweiss/birdnetlib
15.鳥音頻數據文件下載資源
https://hf-mirror.com/datasets/sakei/Bird_audio_in_China
https://hf-mirror.com/datasets/Trelis/bird-songs
https://github.com/DBD-research-group/BirdSet
A benchmark dataset collection for bird sound classification
https://hf-mirror.com/datasets/DBD-research-group/BirdSet
從huggingface上下載數據集具體步驟_huggingface 下載數據集-CSDN博客
eg:
? ? ? git clone https://hf-mirror.com/datasets/DBD-research-group/BirdSet
? ? ? git clone https://hf-mirror.com/datasets/sakei/Bird_audio_in_China
? ? ? git clone https://hf-mirror.com/datasets/Trelis/bird-songs
https://zhuanlan.zhihu.com/p/701195734
使用ffmpeg提取視頻中的音頻并保存為單聲道wav_ffmpeg wav-CSDN博客
https://zhuanlan.zhihu.com/p/452368276
birdData: 大批量,多線程下載鳥聲文件
https://github.com/realzza/xenopy/tree/birdData
https://github.com/AgaMiko/xeno-canto-download
GitHub - AgaMiko/xeno-canto-download: Easy and ready-to-use script to download bird sound files and jsons from www.xeno-canto.org based on search terms.
鳥綱 Aves|iPlant 植物智——植物物種信息系統