SynTex

SynTex collection of synthetic audio textures

Each dataset has a unique set of syntheis parameters that are grid-sampled at the specified resolutions to produce the data files as described for each entry below. A representive set of audio samples from each datset can be auditioned from the 'Audio samples' drop down menu. The naming convention for example sounds is to concatenate the parameters with an =L, =M, and =H indicating that a low, medium, or high range value of the corresponding parameter was used to synthesize the sample.

Creating the the default dataset on your local system is a one-step process. You can also customize the dataset by editing the json configuration file (see the accompanying README file.)

DS_BasicAM_1.1
sound_id 6784f6aad5fe931d3f317536

Description: Amplitude modulation, (.5+.5*𝑚𝐼*(𝑐𝑜𝑠(2𝜋 𝑚𝑓 𝑡)))*𝑠𝑖𝑛(2𝜋 𝑐𝑓 𝑡) - modulating the amplitude of a sin wave with carrier frequency in [33, 660] Hz with a raised cos wave with a frequency in [0,55] Hz, ranging from slow shallow tremelo to the edge of the perception of a timbre.

Parameters:

cf_exp : 13 values mapped to n in [0, 1], where n -> 330*2^n, carrier frequency in Hz
mf_exp : 21 values mapped to n in [0, 20], where n -> (1.22185)^n, modulator frequency in Hz (n=20 -> 55 Hz).
mI : 11 values mapped to n in [0, 1], where n -> modulation index

Key Features:
1 audio variations per parameter setting for 3003 Parameters (3003 Files)
1 Hrs:40 Min:6.0 Sec of Audio segmented as 2.0 sec chunks
Total Size of Dataset: 183.29 MB @ SR 16Khz

DS_BasicFM_1.1
sound_id 67853eaf78c25f751abb1f5b

Description: Frequency modulation of a sine wave with cf in [330, 660] Hz modulated by a sine wave in [0, 55] Hz. The parameters span the range from perceptual vibrato speeding up until it can just be heard as a timbre. sin(2pi*cf*t + mI*sin(2pi*mf*t)).

Parameters:

cf_exp : 13 values mapped to n in [0, 1], where n -> 330*2^n, carrier frequency in Hz
mf_exp : 21 values mapped to n in [0, 20], where n -> (1.22185)^n, modulation frequency in Hz (n=20 -> 55 Hz).
mI : 11 values mapped to n in [0, 25], where n -> modulation index

Key Features:
1 audio variations per parameter setting for 3003 Parameters (3003 Files)
1 Hrs:40 Min:6.0 Sec of Audio segmented as 2.0 sec chunks
Total Size of Dataset: 183.29 MB @ SR 16Khz

DS_Chirp_1.1
sound_id 678552e89df7a0b03d0ebad6

Description: A sequence of sine wave chirps rising or falling (nocts) with various rates and regularity in event timing.

Parameters:

irreg_exp : 3 values mapped to n in [0, 2], where n -> (n/event-per-second) as standard deviation of gaussian around regularly spaced events
rate_exp : 5 values mapped to n in [1, 4], where n -> 2^(n) events per second
nocts : 4 values mapped to n in [-3, 3], where n -> number of octaves spanned by chirp centered at center frequency
evdur : 5 values mapped to n in [0.04, 0.2], where n -> duration of each chirp in seconds
cf_exp : 7 values mapped to n in [0, 1], where n -> 440*2^(n), center frequency in Hz

Key Features:
1 audio variations per parameter setting for 2100 Parameters (2100 Files)
2 Hrs:20 Min:0.0 Sec of Audio segmented as 4.0 sec chunks
Total Size of Dataset: 256.35 MB @ SR 16Khz

DS_Pops_3.1
sound_id 67856f837444c89e806a3daa

Description: Short pops contructed of 3 noise samples follwed by narrow band pass filters are generated with a range of center frequencies, rates, and regularity.

Parameters:

irreg_exp : 11 values mapped to n in [0, 1], where n -> (n/event-per-second) as standard deviation of gaussian around regularly spaced events normalized by events-per-second.
cf : 13 values mapped to n in [440, 880], where n -> Center frequency in Hz.
rate_exp : 10 values mapped to n in [1, 4], where n -> 2**n events per second.

Key Features:
1 audio variations per parameter setting for 1430 Parameters (1430 Files)
1 Hrs:35 Min:20.0 Sec of Audio segmented as 4.0 sec chunks
Total Size of Dataset: 174.56 MB @ SR 16Khz

DS_TokWotalDuet_2.1
sound_id 678572c412b10df76ab31331

Description: Events times of varying temporal regularity are occupied by either a wood or a metal tok of varying ratios. The metal sound has a variable range of center frequencies.

Parameters:

hitratio : 6 values mapped to n in [0.5, 1], where n -> proportion of events times that will have a sound event
wmratio : 9 values mapped to n in [0, 1], where n -> proportion of sound events that will be wood as opposed to metal
rate_exp : 7 values mapped to n in [1, 3], where n -> 2^(n) events per second

Key Features:
1 audio variations per parameter setting for 378 Parameters (378 Files)
0 Hrs:37 Min:48.0 Sec of Audio segmented as 6.0 sec chunks
Total Size of Dataset: 69.21 MB @ SR 16Khz

DS_Tapping1.2_1.1
sound_id 6785759dcac32a7847df11bd

Description: Regularly spaced 2-strike tapping. Similar to 'Tapping 1-2.wav' from McDermott and Simoncelli (2011 Neuron paper), but with systematic variation of rate and phase of 2nd tap.

Parameters:

rate_exp : 8 values mapped to n in [0.25, 2.25], where n -> 2**n events per second.
phaserel : 10 values mapped to n in [0.05, 0.5], where n -> phase (in [0,1]) of the second tap in the cycle

Key Features:
1 audio variations per parameter setting for 80 Parameters (80 Files)
0 Hrs:13 Min:20.0 Sec of Audio segmented as 10.0 sec chunks
Total Size of Dataset: 24.41 MB @ SR 16Khz

DS_Pistons_1.1
sound_id 67861b91e0981b588ccd4ec4

Description: An 8-cycle 'engine.' The irreglular timing of a cycle of 8 pistons is repreated exactly, but the piston burst is radnom noise. Different samples have different irregular timing patterns for the same rate parameter.

Parameters:

rate_exp : 5 values mapped to n in [4, 8], where n -> 2^n events per second
irreg_exp : 4 values mapped to n in [0.3, 0.9], where n -> normalized sd of gaussian around regularly spaced events: .1*n*np.power(10,n) = irregularity, gaussian sd=irregularity/eps

Key Features:
10 audio variations per parameter setting for 20 Parameters (200 Files)
0 Hrs:13 Min:20.0 Sec of Audio segmented as 4.0 sec chunks
Total Size of Dataset: 24.41 MB @ SR 16Khz

Description: Feedback comb-filtered noise: 𝑦[𝑛]=(1−alpha)𝑥[𝑛]+𝛼alpha*𝑦[𝑛−𝐾]. Intended for testing models that perform differently depending on how pitched a signal is.

Parameters:

pitchedness : 11 values mapped to n in [0, 1], where n -> 1-1/(2^4n) = alpha, the feedback coefficient. Thus n in [0,1] ->[whitenoise, pitched]

Key Features:
3 audio variations per parameter setting for 11 Parameters (33 Files)
0 Hrs:01 Min:22.5 Sec of Audio segmented as 2.5 sec chunks
Total Size of Dataset: 2.52 MB @ SR 16Khz

DS_Chua_1.1
sound_id 67862548a2038d80c56fc511

Description: Chua Oscillator varying alpha, beta, gamma of canonical dimensionless equations (see for example, Ch 3 of Bilotta, E., & Pantano, P. (2008). A Gallery Of Chua Attractors, World Scientific. - This dataset is derived from the DE64 example in that book). This set could be challenging to model because of the nonlinear relationship between parameters and audio. Although the parameters are grid sampled, the audio data characteristics are not balanced. The parameters explore the space between pitched and noisy sounds. A few sounds are not strictly textures because they are bistable and make a single switch from one from one to the other attractor. If you look at DSChua.py, you'll see a datastructure with 150 parameter sets for different Chua attractors. You can generate datasets for them by editing the config_file.

Parameters:

alpha : 9 values mapped to n in [6.2, 7], where n -> coefficient
beta : 9 values mapped to n in [10.25, 11], where n -> coefficient
gamma : 9 values mapped to n in [-0.2, -0.045], where n -> coefficient

Key Features:
1 audio variations per parameter setting for 729 Parameters (729 Files)
0 Hrs:36 Min:27.0 Sec of Audio segmented as 3.0 sec chunks
Total Size of Dataset: 66.74 MB @ SR 16Khz

DS_Mosquito_4.1
sound_id 6786367b08e6c1b9d711d678

Description: A mosquito or fly like sound that moves in space. Uses simplex noise for smooth modulations (eg distance) affecting frequency and amplitude at longish time scales. See the configuration to explore a wealth of synth parameters that go well beyond the mosquito-like sound.

Parameters:

cf_exp : 7 values mapped to n in [-1.5, 0], where n -> 440*2**n, the center frequency of the beating wings
motion_freq : 3 values mapped to n in [1, 2], where n -> the lp filter cutoff frequency used to filter the noise that produces the smoothly-varying distance of the bug to you.
busybodyFreq : 3 values mapped to n in [1, 15], where n -> synchronized noisy AM & FM average modulation frequency

Key Features:
1 audio variations per parameter setting for 63 Parameters (63 Files)
0 Hrs:08 Min:24.0 Sec of Audio segmented as 8.0 sec chunks
Total Size of Dataset: 15.38 MB @ SR 16Khz

DS_WindChimes_1.1
sound_id 67863924adbb8b78b7bad677

Description: Wind chimes that can be sized and blown in wind of varying strength. Wind blows in the background, change this in the config file if you wish.

Parameters:

strength : 11 values mapped to n in [0, 1], where n -> stength (180_440*n is average freq of noise filter center frequency)
chimeSize : 11 values mapped to n in [0, 1], where n -> size of the chime (inversely proportional to frequency)

Key Features:
1 audio variations per parameter setting for 121 Parameters (121 Files)
0 Hrs:14 Min:7.0 Sec of Audio segmented as 7.0 sec chunks
Total Size of Dataset: 25.85 MB @ SR 16Khz

DS_Peepers_2.1
sound_id 67863af29f41ced5629c6fcb

Description: Like peepers (tree frogs), these sounds are made of sequences of chirps ('words'), with chirps varying in center frequency, range, and event period. The default dataset parameters are the number of peepers in the chorus, and the number of octaves swept by each chrip. https://syntex.sonicthings.org/peeperSynthExample

Parameters:

numPeepers_exp : 7 values mapped to n in [0, 6], where n -> 2^n number of Peepers to play simultaneously
mean_nocts : 11 values mapped to n in [-1.25, 1.25], where n -> the mean number of octaves around a center frequency of each chirp in a word

Key Features:
1 audio variations per parameter setting for 77 Parameters (77 Files)
0 Hrs:08 Min:59.0 Sec of Audio segmented as 7.0 sec chunks
Total Size of Dataset: 16.45 MB @ SR 16Khz

DS_Bees_3.1
sound_id 67863dcea0b1af21d7b296de

Description: A swarm of bugs (although parameters for frequency excursions take sounds out of bounds of realsim). Uses simplex noise for smooth modulations (eg distance) affecting frequency and amplitude at longish time scales. See the configuration for other synth parameters to explore.

Parameters:

busybodyFreqFactor : 11 values mapped to n in [0, 0.5], where n -> noisy FM max variation (cf frequency variation in octaves)
cf_exp : 11 values mapped to n in [-2, 0], where n -> 440*2**n, the center frequency of the beating wings

Key Features:
1 audio variations per parameter setting for 121 Parameters (121 Files)
0 Hrs:08 Min:4.0 Sec of Audio segmented as 4.0 sec chunks
Total Size of Dataset: 14.77 MB @ SR 16Khz

DS_Applause_2.1.1
sound_id 678647196111654cdb1acf60

Description: Rough imitation of applause varying in number of clappers and clapping rate. Uses a bit of reverb that can also be manipulated in the config file.

Parameters:

numClappers_exp : 11 values mapped to n in [0, 7], where n -> round(2^n) number of clappers to play simultaneously
rate_exp : 7 values mapped to n in [0, 3], where n -> 2^n claps (on average) per second per clapper

Key Features:
1 audio variations per parameter setting for 77 Parameters (77 Files)
0 Hrs:03 Min:51.0 Sec of Audio segmented as 3.0 sec chunks
Total Size of Dataset: 7.05 MB @ SR 16Khz

DS_Wind_1.1
sound_id 67877fccfcde3b504a7fa2c8

Description: A wind sound generated with noise passed through a LP and BP filter modulated with simplex noise.

Parameters:

strength : 11 values mapped to n in [0, 1], where n -> stength (180_440*n is average freq of noise filter center frequency)
gustiness : 11 values mapped to n in [0, 1], where n -> frequency parameter for simplex noise creating variation in filter center frequency and gain
howliness : 11 values mapped to n in [0, 1], where n -> .5+40*n, the Q value for the BP noise filter.

Key Features:
1 audio variations per parameter setting for 1331 Parameters (1331 Files)
2 Hrs:35 Min:17.0 Sec of Audio segmented as 7.0 sec chunks
Total Size of Dataset: 284.33 MB @ SR 16Khz

SynTex collection of synthetic audio textures

...