Speaker Identification for Streaming with Python

dilek

Dilek Karasoy

Posted on August 11, 2023

Speaker Identification for Streaming with Python

Open-source speaker recognition is the only option for developers, unless they work for large enterprises. Recently, at Picovoice we made our internal tool for Speaker Recognition public. So, those who prefer production-ready solutions now have an option!

Let's get started!

1. Install [Picovoice Eagle Speaker Recognition](https://picovoice.ai/platform/eagle/) using pip. We will be using pvrecorder to get cross-platform audio, so install that as well:

pip3 install pveagle pvrecorder
Enter fullscreen mode Exit fullscreen mode

2. Grab your AccessKey
If you haven't create an account on Picovoice Console for free and grab your AccessKey

3. Enroll Speakers
Import pveagle and create an instance of the EagleProfiler class:

import pveagle

access_key = "{YOUR_ACCESS_KEY}";
try:
    eagle_profiler = pveagle.create_profiler(access_key=access_key)
except pveagle.EagleError as e:
    # Handle error
    pass
Enter fullscreen mode Exit fullscreen mode

Don't forget to replace the placeholder with your AccessKey!

Now, import pvrecorder and create an instance of the recorder as well. Use the EagleProfiler's .min_enroll_samples as the frame_length:

from pvrecorder import PvRecorder

DEFAULT_DEVICE_INDEX = -1
recorder = PvRecorder(
    device_index=DEFAULT_DEVICE_INDEX,
    frame_length=eagle_profiler.min_enroll_samples)
Enter fullscreen mode Exit fullscreen mode

The .enroll() function generates a percentage value to know when Enrollment is done and another speaker can be enrolled:

recorder.start()

enroll_percentage = 0.0
while enroll_percentage < 100.0:
    audio_frame = recorder.read()
    enroll_percentage, feedback = eagle_profiler.enroll(audio_frame)

recorder.stop()
Enter fullscreen mode Exit fullscreen mode

4. Export the Speaker Profile

We'll need use in the next step to identify / verify the speaker!

speaker_profile = eagle_profiler.export()
Enter fullscreen mode Exit fullscreen mode

You can reuse the speaker_profile object. Check out the docs for details.

Add more speakers by creating additional profiles by calling the .reset() function on the EagleProfiler, and repeating the .enroll() step.

5. Clean up used resources:
Once you create profiles for all speakers, let's clean up used resources!

recorder.delete()
eagle_profiler.delete()
Enter fullscreen mode Exit fullscreen mode

6. Recognize Speakers:

Enter fullscreen mode Exit fullscreen mode

import pveagle

access_key = "{YOUR_ACCESS_KEY}"
profiles = [speaker_profile_1, speaker_profile_2]
try:
eagle = pveagle.create_recognizer(
access_key=access_key,
speaker_pofiles=profiles)
except pveagle.EagleError as e:
# Handle error
pass

Set up `pvrecorder` to use with Eagle Speaker Recognition:

Enter fullscreen mode Exit fullscreen mode

recorder = PvRecorder(
device_index=DEFAULT_DEVICE_INDEX,
frame_length=eagle.frame_length)

Pass audio frames into the `eagle.process()` function get back speaker scores:

Enter fullscreen mode Exit fullscreen mode

while True:
audio_frame = recorder.read()
scores = eagle.process(audio_frame)

When finished, again clean up used resources:


Enter fullscreen mode Exit fullscreen mode

eagle.delete()




## Connect them All Together

Enter fullscreen mode Exit fullscreen mode

import pveagle
from pvrecorder import PvRecorder

DEFAULT_DEVICE_INDEX = -1
access_key = "{YOUR_ACCESS_KEY}";

Step 1: Enrollment

try:
eagle_profiler = pveagle.create_profiler(access_key=access_key)
except pveagle.EagleError as e:
pass

enroll_recorder = PvRecorder(
device_index=DEFAULT_DEVICE_INDEX,
frame_length=eagle_profiler.min_enroll_samples)

enroll_recorder.start()

enroll_percentage = 0.0
while enroll_percentage < 100.0:
audio_frame = enroll_recorder.read()
enroll_percentage, feedback = eagle_profiler.enroll(audio_frame)

enroll_recorder.stop()

speaker_profile = eagle_profiler.export()

enroll_recorder.delete()
eagle_profiler.delete()

Step 2: Recognition

try:
eagle = pveagle.create_recognizer(
access_key=access_key,
speaker_pofiles=[speaker_profile])
except pveagle.EagleError as e:
pass

recognizer_recorder = PvRecorder(
device_index=DEFAULT_DEVICE_INDEX,
frame_length=eagle.frame_length)

recognizer_recorder.start()

while True:
audio_frame = recorder.read()
scores = eagle.process(audio_frame)
print(scores)

recognizer_recorder.stop()

recognizer_recorder.delete()
eagle.delete()




---
For more information:

- [Check the original article],(https://picovoice.ai/blog/speaker-recognition-in-python/)
- Learn more about Speaker [Recognition](https://picovoice.ai/blog/speaker-recognition/), [Identification](https://picovoice.ai/blog/speaker-identification/) and [Verification](https://picovoice.ai/blog/voice-biometrics/),
- [Check out other Eagle Speaker Recognition SDKs](https://picovoice.ai/docs/eagle/)
- Visit [Eagle Speaker Recognition GitHub repository](https://github.com/Picovoice/eagle) for open-source demos and to create issues. While on GitHub, if you like building with Eagle Speaker Recognition, give it a star and help fellow devs find it easily.
Enter fullscreen mode Exit fullscreen mode
💖 💪 🙅 🚩
dilek
Dilek Karasoy

Posted on August 11, 2023

Join Our Newsletter. No Spam, Only the good stuff.

Sign up to receive the latest update from our blog.

Related

Speaker Identification for Streaming with Python
machinelearning Speaker Identification for Streaming with Python

August 11, 2023