音頻分類的學習

1.深度學習PyTorch入門-語音分類

https://blog.csdn.net/sinat_41787040/article/details/129795496

https://github.com/musikalkemist/pytorchforaudio

https://github1s.com/musikalkemist/pytorchforaudio/blob/main/04%20Creating%20a%20custom%20dataset/urbansounddataset.py

https://github.com/musikalkemist/DeepLearningForAudioWithPython

https://github.com/musikalkemist/AudioSignalProcessingForML

https://github.com/musikalkemist/generating-sound-with-neural-networks

https://github.com/musikalkemist/Deep-Learning-Audio-Application-From-Design-to-Deployment

https://github.com/yaokaishile/AudioClassifier

GitHub - carl03q/AudioClassifier: A CNN audio classifier via spectrogram images.

2.基于Pytorch實現聲音的分類

https://blog.csdn.net/weixin_45277161/article/details/135185613

https://blog.csdn.net/qq_33200967/article/details/119830756

https://github.com/yeyupiaoling/AudioClassification-Pytorch

3.基于深度學習的各種音頻分類pytorch實戰

https://blog.csdn.net/weixin_45717863/article/details/136237419

https://github.com/ethanyhzhang/AudioClassification-Pytorch

4.Urbansound8k聲音分類深度學習實戰

GitHub - musikalkemist/pytorchforaudio: Code for the "PyTorch for Audio + Music Processing" series on The Sound of AI YouTube channel.

pytorchforaudio/09 Training urban sound classifier at main · musikalkemist/pytorchforaudio · GitHub

UrbanSound8K數據集

#wget命令下載

wget https://zenodo.org/record/1203745/files/UrbanSound8K.tar.gz

5.語音識別 - PyTorch實現

https://juejin.cn/post/7107844968957345805

6.音頻識別(Audio Classification)學習筆記

https://blog.csdn.net/Killer_kali/article/details/122798462

7.基于Pytorch的聲音事件檢測分類系統

https://blog.csdn.net/weixin_43509698/article/details/145432599

https://github.com/Shybert-AI/AudioClassificationModelZoo-Pytorch

8.使用PyTorch進行城市聲音分類

https://blog.csdn.net/gitblog_00081/article/details/139518564

https://github.com/ksanjeevan/crnn-audio-classification

9.torch入門：音頻分類任務(pytorch+librosa)

https://blog.csdn.net/Ceverymxt7/article/details/146146361

10.PyTorch音頻分類實戰（完整代碼）

https://blog.csdn.net/SoulmateY/article/details/143689699

https://blog.csdn.net/python1234_/article/details/145307536

https://zhuanlan.zhihu.com/p/6235637735

Github：github.com/Zeyi-Lin/Py…
數據集：pan.baidu.com/s/14CTI_9MD…?提取碼: 1a9e
SwanLab實驗日志：swanlab.cn/@ZeyiLin/Py…
更多實驗日志：swanlab.cn/@ZeyiLin/Py…

Github：https://github.com/Zeyi-Lin/PyTorch-Audio-Classification
數據集：https://pan.baidu.com/s/14CTI_9MD1vXCqyVxmAbeMw?pwd=1a9e 提取碼: 1a9e
SwanLab實驗日志：https://swanlab.cn/@ZeyiLin/PyTorch_Audio_Classification-simple/charts
更多實驗日志：https://swanlab.cn/@ZeyiLin/PyTorch_Audio_Classification/charts

11.使用深度學習進行音頻分類的端到端示例和解釋

https://blog.csdn.net/deephub/article/details/114997017

12.深度學習實現50種環境聲音分類

https://blog.csdn.net/Mind_programmonkey/article/details/121024085

13.esc50

https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/examples/esc50

https://github.com/PaddlePaddle/PaddleSpeech/blob/develop/docs/tutorial/cls/cls_tutorial.ipynb

14.CBAM與DenseNet121融合進行鳥類聲音識別

https://blog.csdn.net/weixin_34945060/article/details/143473683

https://github.com/CarrieX6/-Xeno-Canto-

融合了CBAM與DenseNet121，使用中心損失函數及鳥聲融合特征進行數據預處理、鳥聲識別等任務

15.基于梅爾頻譜的信號分類和識別

https://github.com/Friedrich-M/Audio-signal-classification-and-identification

數據集Urbansound8K

Urbansound8K是目前應用較為廣泛的用于自動城市環境聲分類研究的公共數據集，包含10個分類：空調聲、汽車鳴笛聲、兒童玩耍聲、狗叫聲、鉆孔聲、引擎空轉聲、槍聲、手提鉆、警笛聲和街道音樂聲。
數據集下載

16.使用BirdNET-V2.4訓練自己的數據集

https://blog.csdn.net/weixin_53238517/article/details/144215320

https://github.com/birdnet-team/BirdNET-Analyzer/tree/training-with-test-data

https://github.com/birdnet-team/BirdNET-Analyzer

https://github.com/joeweiss/birdnetlib

15.鳥音頻數據文件下載資源

https://hf-mirror.com/datasets/sakei/Bird_audio_in_China

https://hf-mirror.com/datasets/Trelis/bird-songs

https://github.com/DBD-research-group/BirdSet

A benchmark dataset collection for bird sound classification

https://hf-mirror.com/datasets/DBD-research-group/BirdSet

從huggingface上下載數據集具體步驟_huggingface 下載數據集-CSDN博客

eg:

? ? ? git clone https://hf-mirror.com/datasets/DBD-research-group/BirdSet

? ? ? git clone https://hf-mirror.com/datasets/sakei/Bird_audio_in_China

? ? ? git clone https://hf-mirror.com/datasets/Trelis/bird-songs

https://zhuanlan.zhihu.com/p/701195734

使用ffmpeg提取視頻中的音頻并保存為單聲道wav_ffmpeg wav-CSDN博客

https://zhuanlan.zhihu.com/p/452368276

birdData: 大批量，多線程下載鳥聲文件

https://github.com/realzza/xenopy/tree/birdData

https://github.com/AgaMiko/xeno-canto-download

GitHub - AgaMiko/xeno-canto-download: Easy and ready-to-use script to download bird sound files and jsons from www.xeno-canto.org based on search terms.

鳥綱 Aves|iPlant 植物智——植物物種信息系統

本文來自互聯網用戶投稿，該文觀點僅代表作者本人，不代表本站立場。本站僅提供信息存儲空間服務，不擁有所有權，不承擔相關法律責任。
如若轉載，請注明出處：http://www.pswp.cn/bicheng/80976.shtml
繁體地址，請注明出處：http://hk.pswp.cn/bicheng/80976.shtml
英文地址，請注明出處：http://en.pswp.cn/bicheng/80976.shtml

如若內容造成侵權/違法違規/事實不符，請聯系多彩編程網進行投訴反饋email:809451989@qq.com，一經查實，立即刪除！