Deprecated: The SidTrain
task is deprecated for IDOL Server version 11.0.0. Use the SpkIdTrain
task instead.
This task is still available for existing implementations, but it might be incompatible with new functionality. The task might be deleted in future.
The SidTrain
task uses an audio file to create or update a speaker training file. You can use a set of SidTrain
tasks (with one audio file for each task) and combine these with the SidTrainFinal task to use multiple audio files to train a single speaker model.
Parameter | Description | Required |
---|---|---|
Type | The task name. Set to SidTrain .
|
Yes |
Ast | The base classifier file to use for speech segmentation. | |
Diag | Whether to generate diagnostic information. | |
DiagFile | The file to write the diagnostic information to. | |
EndTime | The end time of a specific section of an audio file to process. | |
File | The audio file that contains sample speech from one person. | Yes |
OrgModel | The base model to update (this can be the base USM, or an existing SPK file). | |
Sfreq | The sample frequency of the audio file to process. | |
SidBase | The sid base pack resource to use to determine the base files to use. | |
Sig | The .sig file to use for speaker identification. | |
Spt | The name of the speaker training file to create. | Yes |
SptAppend | Whether to append to an existing SPT file, or overwrite it. | |
StartTime | The start time of a specific section of an audio file to process. | |
SugdInputChannels | The channel layout of the input media file. | |
SugdInputFrequency | The sampling rate of the input media file. |
http://localhost:13000/action=AddTask&Type=SidTrain&File=C:/Data/BobSpeech.wav&Spt=Bob.spt
This action uses port 13000
to instruct IDOL Speech Server, which is located on the local machine, to create the Bob.spt
speaker training file using the BobSpeech.wav
file.
|