top of page

WEEK 4: RESEARCH FINDINGS

  • Lauren Reyes
  • Feb 4
  • 2 min read

This week we were tasked to provide an update as our midterm assignment. This post will cover the main points of my presentation.


MAIN POINT OF CHANGE:

  • Pre-recorded voice & script -> AI Generative Conversations & Voice Recognition

  • Purchased the rotary phone


DO I USE A RASPBERRY PI OR ARDUINO?

  • Raspberry Pi!

    • Accurate audio processing and API integration

    • Can handle speech recognition & text to speech

    • USB audio support

    • I am more proficient in Python

    • Can multitask


UPDATED TIMELINE

  • WINTER

    • WEEK 5: Purchase equipment/hardware, begin coding

      WEEK 6: Continue coding

      • Focus on text-to-speech & API integration

    • WEEK 7: Coding complete, begin hardware assembly

    • WEEK 8: Assembled, begin testing

    • WEEK 9: Final presentation

  • SPRING

    • WEEK 11: PROGRAM API UNDER CERTAIN PERSONALITIES, FIND/CREATE VOICE BANKS

    • WEEK 12-13: BEGIN SCULPTURE 

      • BUY MATERIALS, CREATE SIMPLE MOLD

    • WEEK 14: ASSEMBLE EVERYTHING

    • WEEK 15: MIDTERM

    • WEEK 16-18: USER TESTING/DEBUGGING


HARDWARE

Raspberry Pi 4 Model B (4GB RAM)

Main processing unit

Vintage Rotary Phone Shell

Thrifted or vintage phone body

USB Audio Adapter

Enables microphone and speaker connectivity (e.g., UGREEN USB Audio Adapter)

Mini Speaker (3W-5W, 4Ω or 8Ω)

Fits inside the phone's original speaker location

Electret Microphone Module (MAX9814 or similar)

Captures user speech, placed in the original microphone location

SPDT Micro Switch

Detects handset lift and placement

5V 3A Power Supply with USB-C Connector

Powers the Raspberry Pi (e.g., LitStar 5V 3A USB Type C Adapter)

3.5mm Audio Jack Breakout Board

Facilitates clean audio connections inside the phone shell

SOFTWARE

  • Raspberry Pi OS (Lite or Full)

    • 16GB+ micro SD card

  • Python (3.x)

  • Speech-to-Text API (e.g., Google Cloud Speech-to-Text API)

  • Text-to-Speech API (e.g., ElevenLabs API / Google Text-to-Speech)


CORE INTERACTIONS

  • Handset Detection

  • User Speech Input

  • AI-Generated Response

  • Audio Playback

  • Session Termination


ASSEMBLY STEPS

Step 1: Disassemble the Rotary Phone

Step 2: Install the Raspberry Pi

Step 3: Set Up Handset Detection

Step 4: Position the Microphone and Speaker

Step 5: Connect Audio Components

Step 6: Develop and Deploy Software


CODE LINEUP

  • Configure API key

  • Audio settings for output

  • Detect when phone is picked up

  • Functions

    • AI prompts user with dial options

    • Dial

    • Choose AI depending on dial choice

    • Prompt introduction

    • Record audio

    • Transcribe audio

    • Generate response

    • Text-to-Speech

    • Call interaction

  • Loop until interaction ends



 
 
 

コメント


bottom of page