拔了牙吃什么消炎药

Question

I'm struggling to find a reliable method to estimate the quality of harmonic vocalizations in African penguins, specifically I'm interested in the 'b' syllable: the longest syllables in their Ecstatic Display Song (EDS), which is composed of multiple elements.

I’m working with recordings like the ones attached and want to assess the relative quality of different calls. I initially tried using Signal-to-Noise Ratio (SNR), but I'm encountering issues because the strong harmonic structure of the calls fills the spectrogram, leaving little to no background for a clean noise estimate.

I’ve attempted to extract a background segment (e.g., 0.2 seconds before the vocalization), but faced two main problems:

Overlapping signals: other syllables often occur just before or after the target call, contaminating the background estimate and leading to unreliable SNR values.
Fixed-window CNN detections: I also need to assess the quality of calls detected by a CNN model, but these detections are based on fixed-length windows that are not always well-centered on the syllable, making it harder to define signal and noise regions precisely.

Has anyone dealt with a similar issue when working with harmonic-rich signals? I’d really appreciate suggestions for alternative metrics or approaches. I’ve also considered using harmonicity-based measures, such as the Harmonic-to-Noise Ratio (HNR), but I’m not convinced it's the right metric in this context. The problem is that even faint vocalizations (e.g., low amplitude calls) can still produce high HNR values if they are relatively clean, while stronger calls that are slightly masked by background noise might score lower, so the metric doesn't seem to reliably reflect perceived call quality or prominence.

Any ideas or recommendations on how to better capture the signal quality or salience would be greatly appreciated!

Thanks in advance!

WMXZ · Accepted Answer · 2025-08-07 18:26:48Z

3

Maybe there is indeed an alternative approach to these harmonic rich signals.

Very often these type of spectrograms are a product (an artefact) of the spectrogram processing. Whenever you have pulsed signals with a high number of pulses within the spectrogram window, you can get effects like this.

So, I would take these 'b' syllable and zoom into the time series. If they indeed are composed as a sequence of pulses, I would the characterize them by pulses/seconds and amplitude.

answered Jul 21 at 18:26

WMXZ

8,0661 gold badge11 silver badges35 bronze badges

Add a comment |

淋巴结节吃什么药	男性更年期吃什么药	结石能喝什么茶	什么叫公租房	火龙果什么时候开花
耳鼻喉科属于什么科	月经期间可以吃什么水果	爬山需要准备什么东西	什么的天空飘着什么的白云	软科是什么意思
夏威夷披萨都有什么配料	3月23是什么星座	坐月子适合吃什么水果	什么是闰月	生殖器疱疹是什么原因引起的
胆囊肌腺症是什么病	祛风是什么意思	阳痿吃什么药	罗宾尼手表什么档次	主动脉硬化什么意思

梦见自己掉牙齿是什么征兆hcv9jop5ns7r.cn	出水芙蓉是什么意思hcv8jop5ns8r.cn	荷花什么时候开hcv9jop3ns8r.cn	三点水加盆读什么wzqsfys.com	雪茄为什么不过肺kuyehao.com
感冒了喝什么汤好hcv9jop0ns0r.cn	夏天受凉感冒吃什么药hcv9jop7ns4r.cn	洗耳朵用什么药水gangsutong.com	冉是什么意思hcv8jop2ns5r.cn	白带褐色什么原因hcv8jop6ns3r.cn
cas是什么意思hcv7jop5ns3r.cn	异常白细胞形态检查是查什么病naasee.com	脾胃不好吃什么食物好hcv8jop3ns9r.cn	两个圈的皮带是什么牌子hcv8jop8ns4r.cn	gfr医学上是什么意思hcv7jop6ns0r.cn
人越来越瘦是什么原因hcv9jop6ns2r.cn	增强ct是什么hcv9jop4ns3r.cn	什么原因会怀上葡萄胎xjhesheng.com	孕妇佩戴什么保胎辟邪hcv9jop6ns0r.cn	梦见男朋友是什么意思liaochangning.com

Stack Exchange Network

拔了牙吃什么消炎药

1 Answer 1

Your Answer

Hot Network Questions

拔了牙吃什么消炎药

1 Answer 1

Your Answer

Sign up or log in

Post as a guest

Related

Hot Network Questions