We’ve imagined automated assistants for years, but can Siri truly fulfill our expectations?
©iStockphoto.com/Ismail Akin BostanciOur hopes were shaped long ago by characters like HAL in 2001: A Space Odyssey and the computer on the U.S.S. Enterprise in 'Star Trek.' We’ve been waiting for machines to communicate with us, almost like real people—people with vast intelligence and an endless reservoir of knowledge at their disposal.
Even Apple envisioned a future of voice-assisted technology early on. In the late 1980s, the company introduced a concept called Knowledge Navigator. Through a series of videos, Apple demonstrated people interacting with a fictional system using a touch screen tablet and natural language. Its avatar appeared and spoke just like a human, seamlessly helping with tasks like organizing your calendar, messaging friends and colleagues, and accessing all sorts of connected data. While this level of interaction and sophistication hasn't been fully realized yet, the advent of the iPhone and iPad, along with Siri, represents Apple’s initial leap toward this futuristic concept.
Siri is a voice-activated assistant that mimics human intelligence and natural conversation. She processes your spoken commands and executes them when possible. Siri can open apps, provide movie schedules, deliver sports scores, book dinner reservations, call or send messages to contacts, and handle many other useful tasks. Unlike previous systems, Siri doesn’t just follow instructions quietly—she explains what she’s doing and asks clarifying questions, giving you a chance to adjust or correct her if she misunderstands. Siri is a significant leap beyond older voice-recognition systems.
The Origins of Siri
Phil Schiller, Apple’s Senior Vice President of Worldwide Product Marketing, introduced Siri to the public in October 2011.
© Kevork Djansezian/Getty ImagesSpeech-recognition technology for computers has been around for many years, and Siri isn’t even Apple’s first venture into this field. In the 1980s, specialized software and hardware were available (at a premium price) to make computers respond to voice commands. These systems required training to recognize speech and could only manage a limited range of commands. While it was useful for people needing accessibility features, it wasn’t practical for most users. In the early 1990s, Apple introduced the Macintosh Quadra AV, which included built-in speech-recognition software and hardware. While it couldn’t handle dictation, you could open apps and perform a small number of tasks by voice. More recently, Mac OS X and other operating systems, like Windows Vista, have integrated speech-recognition, but it’s primarily seen as an accessibility feature. Keyboard and mouse remain faster and more reliable for most users than voice input on a desktop.
Other tech companies and software developers have had speech-recognition tools on the market for quite some time. Third-party applications like Nuance’s Dragon Dictate and the newer Dragon NaturallySpeaking voice dictation programs have been available since the 1990s. For years, we’ve interacted with automated phone systems—early versions of the more sophisticated dictation and assistant software we have now. Speech recognition is even used for tasks such as entering patient data in hospitals. However, many of these applications still represent the early stages of voice-recognition artificial intelligence (AI), capable only of basic tasks such as transcribing speech or following a limited range of commands.
Siri takes things to the next level. Initially released in 2010 as a third-party app called Siri Assistant, it was intended for multiple platforms but was later acquired by Apple. The app was removed from the App Store and made a core feature integrated directly into the iPhone’s operating system (iOS). The first version was a beta released in 2011 with iOS 5, and by the time it was upgraded in iOS 6 (released in 2012), it was fully functional. Siri now comes standard on iPhone 4S and newer models, iPad with Retina Display, iPad Mini, and the fifth-generation iPod Touch, but it is not available for older devices.
Siri isn’t the only voice assistant on the market, but in some areas, it stands out as one of the best. Keep reading to discover more about this marvel of voice-recognition technology and artificial intelligence.
What is Siri?
Siri is ready to chat—just tap the microphone to get the conversation started.
©iStockphoto.com/Alexander KirchSiri acts as a virtual assistant, listening to your commands and performing tasks accordingly. The more you interact with her and correct any misinterpretations, the better she becomes at understanding your intentions. Instead of processing everything on your device, Siri communicates with remote servers in the cloud to interpret your requests and fetch the information you need. As more people use Siri, it continues to learn from everyone’s interactions, becoming more efficient over time.
AI assistants have been a long-time goal for technologists, but only recently have they become viable. Advances such as faster wireless speeds, more powerful mobile processors, the availability of vast datasets for training, the rise of cloud computing, and improved speech recognition techniques have made it possible. While most voice-recognition systems still operate like those old phone systems with limited vocabularies, Siri benefits from a much larger dataset and superior learning capabilities, constantly evolving and improving.
Siri wasn’t entirely developed by Apple; it evolved from a massive AI project launched in 2003, funded by the U.S. Defense Department’s DARPA and led by SRI International, a research organization once connected to Stanford University. The aim was to create an assistant for military personnel to manage office tasks and decision-making. This initiative produced CALO (Cognitive Assistant that Learns and Organizes), an AI that could learn from users and vast amounts of data. CALO could do tasks like scheduling meetings, organizing documents, and even making decisions—such as whether to cancel a meeting if a participant dropped out. Later, a project called Vanguard created a smartphone assistant, although it was far less capable than CALO. A group of SRI employees went on to start a company that combined ideas from CALO and Vanguard, eventually developing Siri for the iPhone 3GS.
This version of Siri allowed users to ask questions through voice or text, sending data to remote servers for transcription and interpretation. Unlike other natural language systems that try to process entire sentences, Siri relied on models of real objects, concepts, and their relationships to understand requests. Because people phrase the same question in many different ways, interpreting full sentences is challenging. Instead, Siri focuses on keywords and context, enabling it to make accurate guesses and respond appropriately without needing to fully understand every word. Siri also had access to vast amounts of data via various websites and could tap into these sites’ application program interfaces (APIs) to access their services.
Apple transformed this app into the Siri we use today. While Siri originally had access to many more websites and services, it lost some of those capabilities upon integration with iOS. The app also toned down its sharper humor and, reportedly, its tendency for more colorful language. However, Siri gained new features, such as better integration with iPhone’s native functions, multilingual support, and a more distinct voice. Updates to iOS have continued to enhance Siri. For example, with iOS 6, Siri regained the ability to book restaurant reservations and provide movie showtimes and reviews. By the iOS 6.1 update in January 2013, Siri could even purchase movie tickets again, now through Fandango instead of Movietickets.com.
Unlike a search engine that merely returns a list of links related to the keywords you enter, Siri is designed to understand your request, focus on what it thinks you need, and carry out tasks that return a more accurate, focused set of information or services. Siri grasps context and still relies on cloud-based servers to access answers through third-party services, though it now works with a more limited set of sources than before. Information related to science or calculations often comes from Wolfram|Alpha, while data on businesses like restaurants or stores tends to come from Yelp. For weather updates, Siri pulls data from Apple’s built-in Weather app, powered by Yahoo. Movie times, reviews, and other movie details typically come from Rotten Tomatoes. If Siri doesn’t understand your request, she’ll ask for more details or offer to search the web for you. Additionally, Siri uses GPS data to provide information based on your location.
What can Siri do?
Technically, referring to Siri as “she” isn’t the most accurate, as Siri is actually a gender-neutral application. However, many users refer to Siri as female because its default U.S. English voice is female. In some languages and dialects, Siri even offers male voices, like the U.K. English voice.
You can activate Siri in one of two ways: by holding down the Home button on the iPhone, or by bringing the phone to your ear if you have the "Raise to Speak" feature enabled, with the screen on. Siri will beep twice to indicate it’s listening, and the microphone icon will remain lit while it listens. Once Siri finishes listening and starts processing, it will beep twice in a higher-pitched tone. As long as Siri is active, you can continue tapping the microphone icon to issue more voice commands.
Before you start using Siri, head to "Settings" > "General" > "Siri" to configure some options. This is where you can also turn the app on or off. One important option is the "Raise to Speak" feature. You can also set your preferred language, and with iOS 6, you get a choice of 19 languages: three variations of Chinese, four English dialects, three French, two German, two Italian, three Spanish, plus Japanese and Korean. The four English options are Australian, Canadian, British, and American. Your choice will influence how Siri recognizes your accent and instructions.
You’ll need to tell Siri who you are by setting "My Info" to your contact details from the Phone app. You could even choose someone else’s contact instead of your own. Once done, Siri will be able to respond to commands like, "Send a message to myself," and know to send the message to your chosen contact or to you.
Another option is "Voice Feedback." If you choose "Always," Siri will speak out its responses and show them on the screen. If you select "Handsfree Only," Siri will remain silent unless you are using a hands-free device, like a Bluetooth headset, while displaying its answers on the screen.
While there are no other direct settings for Siri, you can influence how it recognizes names from your contacts. By going to "Phone" > "Contacts" > "All Contacts" > a contact’s name > "Edit" > "add field" > "Nickname," you can add nicknames or phonetic names. This can help when Siri has trouble understanding unusual names or pronunciations. You might need to tweak it a bit to find a version Siri can comprehend.
You can also assign relationships within your contacts, which can be helpful for commands like, "Send a message to Mom" without using her name. To do this, edit your own contact and select "add field" to choose "Related People," then pick a relation such as "parent," "mother," "father," and so on. Once added, you can add more relations without going through the edit process. Alternatively, you can ask Siri directly, like "Send a message to Mom," and Siri will prompt you to specify the relationship. If you confirm, Siri will store the information for future reference.
If you're trying to send a text message and have multiple contacts with similar names, Siri will present you with a list of options. Once Siri reads through the list, it will double beep, signaling you to either speak or tap the correct contact. After you've selected the person, Siri will confirm with something like, "I updated your message. Ready to send it?" It will then await your response. You can say "Cancel," "Send," or other responses like "yes," "no," or "never mind." Siri will follow your command as long as it understands you. This process is similar for the Phone and Mail apps, though the prompts might vary slightly. However, sometimes Siri may not understand the name correctly, especially in cases of spelling variations. In such situations, you’ll need to open the Phone, SMS, or Mail app and do it manually.
Some car manufacturers are teaming up with Apple to integrate Siri into their in-car voice systems. Mercedes, for example, has added Siri alongside other iPhone features on some of their models, enabling drivers to send emails, text messages, play music, and perform various other Siri functions hands-free. On the more budget-friendly side, GM has integrated Siri with their MyLink "infotainment" systems in Chevrolet Spark and Sonic models. Drivers can use a steering wheel button to activate Siri and control it without touching the screen or taking their eyes off the road. Other third-party companies may also choose to integrate Siri's functionality into their products in the future.
What are Siri's limitations?
Siri occasionally struggles with understanding certain users.Although Siri is impressive, it does have its limitations. Since it was removed as a standalone app, Siri is only available on iPhone 4S models and newer, along with later versions of the iPad and iPod Touch. This means that anyone using an older phone or tablet will either need to upgrade their device or explore other voice assistant options.
There are also a few constraints regarding its performance. Siri relies on cloud servers to function, so it requires a stable Wi-Fi or cellular connection. This means that if you're in an area with poor cellular service, Siri’s performance will be suboptimal.
Accent and language issues can also cause trouble for Siri. It tends to perform best when you speak clearly, but it may struggle with certain accents. You might need to adjust your tone or pronunciation to ensure better accuracy, which means some users may not be able to interact with Siri as naturally as they would like. For instance, one reviewer noted that Siri understood him best when he spoke in a flat, robotic voice [source: Poeter]. Additionally, background noise and low-quality audio from certain headsets can hinder Siri's comprehension.
Siri can be quite literal at times. If it doesn’t initially grasp what you’re asking, it will usually prompt you to search the web for the exact words you just said. For example, if you ask Siri, "Tell Bill he's going to have to leave without me," it might generate a message to Bill that reads, "He's going to have to leave without me."
Siri isn't always great at understanding every spelling variation. It works well with common name variants like "Sarah" and "Sara" or "Jeff" and "Geoff," but when it comes to less common names or slight deviations from what it's been programmed to recognize (even if it's just one letter off), it can run into trouble.
Even when you speak clearly in a quiet environment, Siri can still interpret things in odd ways, such as confusing homophones or mishearing your words entirely. If it gets something wrong, like a city name, it might continue using the incorrect information even when you try to fix it. While Siri mimics human intelligence to an extent, there are still moments when its responses feel a bit too robotic. It can be frustrating at times, but with more use, these issues might lessen.
Human language is full of variations, which means there are many ways to phrase the same command. Despite Siri's ability to learn, you'll soon discover which variations work best. For example, asking for directions to "McDonald's" in a busy city might return several results, prompting you to select from a list or tap the screen. However, if you ask for "directions to the nearest McDonald's," Siri will pinpoint the closest one and take you directly to the Maps app's spoken directions.
Siri isn't able to combine actions with most apps just yet. While it can search the web or open the App Store, it can't search within the App Store. If you ask it to "Search for Widget in the App Store," it'll instead search the web for that term and bring up Google results. Additionally, Siri can't close apps at the moment. More advanced functionality with other apps may be possible in the future.
Even with the "Raise to Speak" setting enabled, Siri doesn't always activate when you raise your phone to your ear. You may still need to press the Home button unless you find the right motion that consistently triggers the feature.
There’s a security concern with Siri, but it’s easy to resolve. Even if your phone is locked with a password, Siri can still be accessed via the Home button, allowing someone to send messages and make calls from your contacts without unlocking the phone. To prevent this, go to "Settings" > "General" > "Passcode Lock" and disable Siri under "Allow Access When Locked."
Unlike many GPS devices, Siri doesn't let you choose its voice. If you aren't a fan of the one tied to your chosen dialect, you're stuck with it. Some languages may offer alternative dialects, which can be fun to try, though they can lead to misunderstandings due to differences in accent or word meanings. For example, asking about "football" will reference different sports depending on the English-speaking country you're in.
Since the Apple-integrated Siri doesn't pull as much data from external sources as the original app did, its responses may be more limited. For instance, it still can’t book you a taxi. However, Siri is still developing, and future updates should introduce new features. Given that voice recognition AI isn't perfect, Siri might disappoint occasionally. Additionally, frequent use may increase your data consumption, so it's worth monitoring your usage limits.
How can people use Siri?
Siri certainly isn't lacking in humor.In a 2012 survey by Parks Associates, it was found that most people used Siri for basic tasks like texting, making calls, and searching for information. Fewer individuals used Siri for more advanced tasks like scheduling appointments and other complex functions [source: Vascellaro]. However, Siri can still serve as a useful productivity tool if you explore its additional features. Even simple tasks, such as making calls and taking dictation, fall within the realm of a human assistant, making them valuable in evaluating Siri's usefulness.
At this stage, the app can't fully replace a human assistant and is still seen by some as more of a novelty, partly due to its occasional inaccuracies. However, it's expected that its accuracy will improve over time.
Siri is enjoyable to interact with. While it can provide a wealth of information, you can also ask it questions that don’t have direct answers it can pull from the web. These types of queries sometimes trigger one of Siri’s many clever "Easter Eggs."
Here are some of the responses Siri gives when asked, "What is the meaning of life?":
- "It's nothing Nietzsche couldn't teach you."
- "I'm not sure. But I believe there's an app for that..."
- "So far, all signs point to chocolate."
- Or a more factual response, like, "Life: the characteristic that separates living organisms from non-living matter, including the ability to grow, reproduce, engage in functional activity, and undergo continuous change until death."
Here are some responses to, "I love you.":
- "That's sweet. Can we get back to work now?"
- "All you need is love. And your iPhone."
- "I hope you don't say that to all the other phones."
Other funny questions and answers include:
- Q: "Is there a God?" A: "My policy is the separation of spirit and silicon."
- Q: "Who are you?" A: "I'm Siri. But I prefer not to talk about myself."
- Q: "Should I go to bed?" A: "I think you should sleep on it."
The goal wasn't to be trivial, but to make Siri fun and lovable, traits that help it come across as less off-putting compared to many other virtual assistants. Siri also tries to keep things fresh by providing varied responses to the same question, so as not to be repetitive or annoying like Microsoft's infamous Clippy AI assistant. While you still can't have a full-fledged human conversation with Siri, you can at least enjoy a pleasant exchange with it.
There's a lot of innovation happening to expand Siri's capabilities beyond what it was originally designed to do. For example, people have created tools like Siri Proxy, a proxy server that lets you run plug-ins to add custom features, such as controlling devices remotely. With these plug-ins, Siri has been used to do things like start cars, control TVs, adjust home thermostats, and even open garage doors. There's also a plug-in called NowNow that allows you to access Google Voice Search from anywhere in your system using your preferred shortcut. However, it's important to note that using third-party plug-ins requires jailbreaking your phone, which could potentially break it.
Resourceful users have also discovered clever tricks that don't involve programming or jailbreaking. For instance, if you prefer Google Maps over Apple's own Maps app, you can simply add "via transit" to your directions request. Since Maps doesn't offer transit directions yet, Siri will display all your map apps and allow you to choose one. You can also create contacts for locations you want to visit and attach Google Maps URLs to them, then have Siri open those links by asking to visit a contact’s website.
One inventive Siri user has figured out how to control the lights in his house using Siri. By plugging a lamp into a Belkin WeMo Switch (a Wi-Fi-enabled power outlet) and setting up a condition in IFTTT.com, he was able to turn the lamp off by sending a text to a special phone number. He added this number as a contact, and could then use Siri to send a text that toggles the lights via a simple voice command [source: Cipriani]. As more people experiment with Siri, even more creative uses are likely to emerge.
Is Siri the start of something grand?
Google Voice Search, which is capable of much more than just providing the Eiffel Tower's dimensions, is set to challenge Apple's Siri for dominance in the voice assistant space.
©iStockphoto.com/Giorgio MaginiWhile Siri may not yet be ready to replace human assistants, it's still a fairly recent innovation and the first voice assistant to reach a significant commercial audience. Just as the iPad revolutionized tablets, Siri might have sparked public interest in the personal assistant technology of the future.
Siri isn't the only option available. Google Voice Search was introduced for iPhone in an update to the Google Search app in October 2012. It’s known for being faster than Siri, though it's more of a search tool for information and doesn’t have the full range of Siri’s features. Other alternatives include Vlingo, Nuance's Dragon Go!, and True Knowledge’s Evi voice assistants, which offer some of Siri’s functionalities but lack her distinctive personality. A downside of third-party apps is that they aren't fully integrated with iOS, so they only work when the app is open, unlike Siri, which works from anywhere. Other platforms also have voice recognition software, such as Samsung's S Voice, Microsoft's TellMe, and Android's Speaktoit, so the iPhone isn't the only choice. However, all voice recognition systems face similar accuracy challenges, and there’s no perfect assistant yet.
Some users have been so disappointed by Siri that they’ve filed false advertising lawsuits against Apple, claiming that the company exaggerated Siri’s abilities. But despite its shortcomings, Siri is a significant advancement over previous technologies. There's no telling what new features will be introduced, or how much smarter Siri will become as more data is collected. Like all technology, Siri is bound to improve with time, and it will likely inspire innovations we haven’t even imagined yet.
Siri may be our first significant step toward the fully interactive, talking computers that sci-fi films of the past predicted. We could be heading toward a bright future where advanced AIs handle much of our daily tasks. Who knows, with the right modifications, they might even make our coffee for us.
