Hello, I was able to get evaluate_locally.py to successfully process the provided training data but wanted to explore using my own.
I followed what I believed to be the exact format (including making sure the bit depth and sample rate were identical) of the example files (such as from Al James - Schoolboy Facination).
I created a new folder and put each of the stems from my source into it and labeled them to match the examples but I’m receiving an error indicating an array problem.
Traceback (most recent call last):
File "/home/user/sdx-2023/sdx-2023-music-demixing-track-starter-kit/evaluate_locally.py", line 98, in <module>
evaluate(LocalEvalConfig)
File "/home/user/sdx-2023/sdx-2023-music-demixing-track-starter-kit/evaluate_locally.py", line 73, in evaluate
all_metrics[fname] = calculate_metrics(ground_truth_path, prediction_path)
File "/home/user/sdx-2023/sdx-2023-music-demixing-track-starter-kit/evaluate_locally.py", line 38, in calculate_metrics
gt = np.stack(gt) # shape: n_sources x n_samples x n_channels
File "<__array_function__ internals>", line 200, in stack
File "/home/user/sdx-2023/lib/python3.10/site-packages/numpy/core/shape_base.py", line 464, in stack
raise ValueError('all input arrays must have the same shape')
ValueError: all input arrays must have the same shape
The files I’m expecting to check out are:
sdx-2023-music-demixing-track-starter-kit/public_dataset/test/db-ld$ ls
accompaniment.wav bass.wav drums.wav mixture.wav other.wav vocals.wav
And mediainfo for vocals (but it’s identical across)
General
Complete name : vocals.wav
Format : Wave
File size : 49.5 MiB
Duration : 4 min 54 s
Overall bit rate mode : Constant
Overall bit rate : 1 411 kb/s
Writing application : Lavf58.76.100
Audio
Format : PCM
Format settings : Little / Signed
Codec ID : 1
Duration : 4 min 54 s
Bit rate mode : Constant
Bit rate : 1 411.2 kb/s
Channel(s) : 2 channels
Sampling rate : 44.1 kHz
Bit depth : 16 bits
Stream size : 49.5 MiB (100%)
Is there a maximum length for samples, or other guidance on how to prepare my data for evaluation?
Thanks in advance.