We are nearly complete on an AI solution – it is being built on top of Microsoft Teams and Azure OpenAI APIs. We call it SSW YakShaver.
The product has a blocking issue. We are recording a Teams call (and screen) and we have hit something called a “compliance policy”.
The below is what we plan to do from here. I would have assumed we would have gone with Option 1 (using the Teams bot with a compliance policy) but it seems like we are going to use the Graph API (simpler).
Thoughts?
——————————————————————————————————————-
I got this email from the YakShaver team:
We have been trying to get the compliance policy working in MS Teams (Option1), but have pretty much given up.
We have decided to do Option 2 below followed by Option 2.1.
We are *not* doing Option 1.
Here are all the points we have learnt.
Then the user flow.
Then the pros and cons.
Option 1: Media Stream – Application hosted or compliance recording bot (Not recommended)
Using a Teams bot with a compliance policy to record the audio and video, it will record every participants sound individually, those who are not in the policy will not be recorded.
- Complexity – on the backend
- We need the compliance policy on everyone in the call for them to be recorded
- User adds bot to the call or meeting, it will directly start to record, when the user kick out the bot it will stop.
- The bot will send an api call to the endpoint /api/call
- After this we need to connect with Microsoft Media Stream
- From the Media Stream we can get the video stream.
- We need to add in a way to store the videos so that Media Stream to do this
- Suggestion: Use data blob (YakShaver Teams bot store video in datablob, and media stream pulls it)
- We could send it directly to the api, but according to Uly we need to store the video somewhere so the final PBI can have a link to it.
- When we’re using the compliancy policy it’s not recording as normally and the video will not be stored in Stream as it would normally would be
User flow
- User adds bot to the call or meeting.
- It will directly start to record, and the user can tell the task.
- When the user kicks out the bot, it will stop the recording and send of the task.
✅ Pros:
- No frontend needed, just an instruction page
- Easy on the user end, the user never needs to start recording and stop recording, instead the user can just add and remove the bot.
❌ Cons:
- Complexity of coding on the backend
- Saving Audio and video files is a problem
- Audio might be recorded separately among participants, and if so, it requires coding to re-join them together
- We need extra work to notify the user that they are being recorded
- Potential legal issue because we record it and we are responsible for it, we need some extra safety rules on the recorded audios
- We need to run the whole thing on the windows VM.
Info – Application Hosted Bot
https://learn.microsoft.com/en-us/microsoftteams/platform/bots/calls-and-meetings/calls-meetings-bots-overview
https://learn.microsoft.com/en-us/microsoftteams/platform/bots/calls-and-meetings/requirements-considerations-application-hosted-media-bots
Option 2: Use the Graph API (Recommended)
Using the Graph API to do the work, the user manually start and stop recording, and then the YakShaver will get the transcrips for the user manually.
- Delegated auth flow, everyone can be recorded.
- Access to transcript need a PAT (personal access token)
- We can’t access the start and stop recording from the API so this has to be done manually.
- User start recording, user end recording
- Since the user is starting and stopping the recording – the recording is stored on their account and everyone in the meeting has access to it
- That’s why we need their PAT to access the recording/transscript
- Later: investigate how we can move the recording to another location where everyone have access
User flow (⚠️worse flow, but quicker to develop and cheaper $ Azure costs)
- User start recording
- User click on YakShaver tab button before or after recording
- User stop recording
- The YakShaver will do the task
✅ Pros:
- Less software complexity
- No need to save video stream and no need to merge transcripts.
- We have the UI so the YakShaver can track the issue being created and show it on the UI
- It’s already been proved to work in POC
- We can sell it to a client because it’s all using delegated permissions
❌ Cons:
- We need some front-end work, pop ups, SSOs to manage user credentials.
- More steps on the user end, manual recording is annoying and not natural.
User Flow – Options to skip the extra clicks (7 clicks in total):
Option 2 requires us to manually start and stop the recording in Teams. The following are 2 options to get around that.
Option 2.1: Azure Communication Service
Use communication service to automate the call record process – need a POC after Graph is done.
✅ Pros:
– It’s easy to use
– We can get rid of the compliance policy which is much much simpler!!!!!
– I can see the potential of making YakShaver into a commercial project.
– We can define our own compliance rule rather than rely on Azure to set up the compliance policy
❌ Cons:
– Calls have to be initialized from communication services
– It’s our responsibility to notify recording – very important
– We have to do compliance recording policy ourselves if we make it commercial later – might need a lawyer
How Azure Communication Services endpoints work with the Graph API?
Things we are sure of from the documentation:
- Azure Communication services can initialize a call, if the call is from Azure communication services, we can record it anytime
- Azure Communication services might be able record teams call but it might not.
- The call might not be on Teams.
When the users are calling the other user from YakShaver
- Can call another user from YakShaver and click record on YakShaver
- Users being invited to another call
- Inside that call, everything will be recorded
When 2 users are on a call and all of a sudden they decide to use YakShaver
- YakShaver will call them
- All users will be muted on teams
- All users will be invited to another call
- The newly created group call will be recorded
What happens if it’s a scheduled meeting
- We don’t need Azure communication service, the delegated flow should just be working….
Option 2.2: Click script
- Could use this if Option 2.1 fails
- What user interaction with computer can’t be done with an old-school click script?