You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: reference.md
+6-27Lines changed: 6 additions & 27 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1516,9 +1516,9 @@ client = ElevenLabs(
1516
1516
)
1517
1517
client.speech_to_speech.convert(
1518
1518
voice_id="string",
1519
-
enable_logging="0",
1520
-
optimize_streaming_latency="mp3_22050_32",
1521
-
output_format="string",
1519
+
enable_logging=True,
1520
+
optimize_streaming_latency="0",
1521
+
output_format="mp3_22050_32",
1522
1522
)
1523
1523
1524
1524
```
@@ -1553,36 +1553,23 @@ core.File` — See core.File for more documentation
1553
1553
<dl>
1554
1554
<dd>
1555
1555
1556
-
**enable_logging:**`typing.Optional[OptimizeStreamingLatency]` — You can turn on latency optimizations at some cost of quality. The best possible final latency varies by model.
1556
+
**enable_logging:**`typing.Optional[bool]` — When enable_logging is set to false full privacy mode will be used for the request. This will mean history features are unavailable for this request, including request stitching. Full privacy mode may only be used by enterprise customers.
1557
1557
1558
1558
</dd>
1559
1559
</dl>
1560
1560
1561
1561
<dl>
1562
1562
<dd>
1563
1563
1564
-
**optimize_streaming_latency:**`typing.Optional[OutputFormat]` — The output format of the generated audio.
1564
+
**optimize_streaming_latency:**`typing.Optional[OptimizeStreamingLatency]` — You can turn on latency optimizations at some cost of quality. The best possible final latency varies by model.
1565
1565
1566
1566
</dd>
1567
1567
</dl>
1568
1568
1569
1569
<dl>
1570
1570
<dd>
1571
1571
1572
-
**output_format:**`typing.Optional[str]`
1573
-
1574
-
Output format of the generated audio. Must be one of:
1575
-
mp3_22050_32 - output format, mp3 with 22.05kHz sample rate at 32kbps.
1576
-
mp3_44100_32 - output format, mp3 with 44.1kHz sample rate at 32kbps.
1577
-
mp3_44100_64 - output format, mp3 with 44.1kHz sample rate at 64kbps.
1578
-
mp3_44100_96 - output format, mp3 with 44.1kHz sample rate at 96kbps.
1579
-
mp3_44100_128 - default output format, mp3 with 44.1kHz sample rate at 128kbps.
1580
-
mp3_44100_192 - output format, mp3 with 44.1kHz sample rate at 192kbps. Requires you to be subscribed to Creator tier or above.
1581
-
pcm_16000 - PCM format (S16LE) with 16kHz sample rate.
1582
-
pcm_22050 - PCM format (S16LE) with 22.05kHz sample rate.
1583
-
pcm_24000 - PCM format (S16LE) with 24kHz sample rate.
1584
-
pcm_44100 - PCM format (S16LE) with 44.1kHz sample rate. Requires you to be subscribed to Pro tier or above.
1585
-
ulaw_8000 - μ-law format (sometimes written mu-law, often approximated as u-law) with 8kHz sample rate. Note that this format is commonly used for Twilio audio inputs.
1572
+
**output_format:**`typing.Optional[OutputFormat]` — The output format of the generated audio.
1586
1573
1587
1574
</dd>
1588
1575
</dl>
@@ -3441,14 +3428,6 @@ ultra lossless - ultra quality output format, 705.6kbps with 44.1kHz sample rate
3441
3428
<dl>
3442
3429
<dd>
3443
3430
3444
-
**callback_url:**`typing.Optional[str]` — A url that will be called by our service when the project is converted with a json containing the status of the conversion
A list of pronunciation dictionary locators (pronunciation_dictionary_id, version_id) encoded as a list of JSON strings for pronunciation dictionaries to be applied to the text. A list of json encoded strings is required as adding projects may occur through formData as opposed to jsonBody. To specify multiple dictionaries use multiple --form lines in your curl, such as --form 'pronunciation_dictionary_locators="{\"pronunciation_dictionary_id\":\"Vmd4Zor6fplcA7WrINey\",\"version_id\":\"hRPaxjlTdR7wFMhV4w0b\"}"' --form 'pronunciation_dictionary_locators="{\"pronunciation_dictionary_id\":\"JzWtcGQMJ6bnlWwyMo7e\",\"version_id\":\"lbmwxiLu4q6txYxgdZqn\"}"'. Note that multiple dictionaries are not currently supported by our UI which will only show the first.
134
133
135
-
callback_url : typing.Optional[str]
136
-
A url that will be called by our service when the project is converted with a json containing the status of the conversion
A list of pronunciation dictionary locators (pronunciation_dictionary_id, version_id) encoded as a list of JSON strings for pronunciation dictionaries to be applied to the text. A list of json encoded strings is required as adding projects may occur through formData as opposed to jsonBody. To specify multiple dictionaries use multiple --form lines in your curl, such as --form 'pronunciation_dictionary_locators="{\"pronunciation_dictionary_id\":\"Vmd4Zor6fplcA7WrINey\",\"version_id\":\"hRPaxjlTdR7wFMhV4w0b\"}"' --form 'pronunciation_dictionary_locators="{\"pronunciation_dictionary_id\":\"JzWtcGQMJ6bnlWwyMo7e\",\"version_id\":\"lbmwxiLu4q6txYxgdZqn\"}"'. Note that multiple dictionaries are not currently supported by our UI which will only show the first.
757
751
758
-
callback_url : typing.Optional[str]
759
-
A url that will be called by our service when the project is converted with a json containing the status of the conversion
When enable_logging is set to false full privacy mode will be used for the request. This will mean history features are unavailable for this request, including request stitching. Full privacy mode may only be used by enterprise customers.
Output format of the generated audio. Must be one of:
57
-
mp3_22050_32 - output format, mp3 with 22.05kHz sample rate at 32kbps.
58
-
mp3_44100_32 - output format, mp3 with 44.1kHz sample rate at 32kbps.
59
-
mp3_44100_64 - output format, mp3 with 44.1kHz sample rate at 64kbps.
60
-
mp3_44100_96 - output format, mp3 with 44.1kHz sample rate at 96kbps.
61
-
mp3_44100_128 - default output format, mp3 with 44.1kHz sample rate at 128kbps.
62
-
mp3_44100_192 - output format, mp3 with 44.1kHz sample rate at 192kbps. Requires you to be subscribed to Creator tier or above.
63
-
pcm_16000 - PCM format (S16LE) with 16kHz sample rate.
64
-
pcm_22050 - PCM format (S16LE) with 22.05kHz sample rate.
65
-
pcm_24000 - PCM format (S16LE) with 24kHz sample rate.
66
-
pcm_44100 - PCM format (S16LE) with 44.1kHz sample rate. Requires you to be subscribed to Pro tier or above.
67
-
ulaw_8000 - μ-law format (sometimes written mu-law, often approximated as u-law) with 8kHz sample rate. Note that this format is commonly used for Twilio audio inputs.
68
-
69
58
model_id : typing.Optional[str]
70
59
Identifier of the model that will be used, you can query them using GET /v1/models. The model needs to have support for speech to speech, you can check this using the can_do_voice_conversion property.
When enable_logging is set to false full privacy mode will be used for the request. This will mean history features are unavailable for this request, including request stitching. Full privacy mode may only be used by enterprise customers.
Output format of the generated audio. Must be one of:
268
-
mp3_22050_32 - output format, mp3 with 22.05kHz sample rate at 32kbps.
269
-
mp3_44100_32 - output format, mp3 with 44.1kHz sample rate at 32kbps.
270
-
mp3_44100_64 - output format, mp3 with 44.1kHz sample rate at 64kbps.
271
-
mp3_44100_96 - output format, mp3 with 44.1kHz sample rate at 96kbps.
272
-
mp3_44100_128 - default output format, mp3 with 44.1kHz sample rate at 128kbps.
273
-
mp3_44100_192 - output format, mp3 with 44.1kHz sample rate at 192kbps. Requires you to be subscribed to Creator tier or above.
274
-
pcm_16000 - PCM format (S16LE) with 16kHz sample rate.
275
-
pcm_22050 - PCM format (S16LE) with 22.05kHz sample rate.
276
-
pcm_24000 - PCM format (S16LE) with 24kHz sample rate.
277
-
pcm_44100 - PCM format (S16LE) with 44.1kHz sample rate. Requires you to be subscribed to Pro tier or above.
278
-
ulaw_8000 - μ-law format (sometimes written mu-law, often approximated as u-law) with 8kHz sample rate. Note that this format is commonly used for Twilio audio inputs.
279
-
280
258
model_id : typing.Optional[str]
281
259
Identifier of the model that will be used, you can query them using GET /v1/models. The model needs to have support for speech to speech, you can check this using the can_do_voice_conversion property.
0 commit comments