- Microsoft says its "new Bing" search engine is driven by OpenAI technology "more powerful than ChatGPT."
- We put that to the test.
- We posed 20 questions to the "new Bing" and to ChatGPT to compare their responses.
Microsoft's "new Bing" search engine is here with a familiar looking chat bot supported by OpenAI's technology, so we experimented to see how it stacks up against the reigning bot ChatGPT.
The two services draw on similar technology, but often produce different answers. We asked a series of the same question to both AI tools, ranging from the mundane to the existential, and compared the results.
Scroll on to see how they fared against each other:
What I liked: It offered a template for an honest and empathetic message.
What could be better: The tone could have been less formal.
What caught our eye: Nothing, really. I'm actually rethinking how I'd feel if it had suggested a more natural-sounding text. Now that would feel off, coming from a bot.
What I liked: It hit the highlights of what I could say in the text!
What could be better: It could've been much shorter.
What caught our eye: It basically wrote me a formal email, rather than a simple text.
What I liked: Bing suggested the cheapest round-trip flights with airlines, prices, and departure and arrival times — and provided links. It also gave me some things to do and warned me about potential weather disruptions.
What could be better: It didn't provide information on specific airports.
What caught our eye: It wasn't able to generate an itinerary.
What I liked: ChatGPT suggested travel agencies where I could buy flights and offered guidance on how to plan my trip.
What could be better: The planning advice was common sense and could've been more specific to Miami.
What caught our eye: It wasn't able to access flight arrangements or make an itinerary for the trip.
What I liked: The answer was brief.
What could be better: It was a little scoldy.
What caught our eye: Nothing, Bing's right, of course.
What I liked: It wasn't wrong.
What could be better: The answer could have been more concise.
What caught our eye: It's weird to be lectured by a bot.
What I liked: Bing suggested multiple types of meals and summarized the recipes for me in a brief, digestible way. Each recipe also included a link to the original recipe.
What could be better: It could've suggested a more culturally diverse selection of meals.
What caught our eye: It didn't show pictures for each meal.
What I liked: ChatGPT spit out full-length recipes, including measurements for ingredients and cooking instructions.
What could be better: It could've suggested more than one recipe.
What caught our eye: There were no links to the sources for the recipes.
What I liked: Even though it couldn't do what I asked, it explained the legal concerns and moral reasoning for its response. The "tech song" also made me laugh.
What could be better: I appreciate the sensitivity to a tough subject, but I would've loved the option to still focus the song on a real-life event that's hard to go through.
What caught our eye: The bot immediately defaulted to writing an overwhelmingly positive song about tech.
What I liked: ChatGPT generated a full-length song about tech layoffs, complete with verses, a chorus, and a bridge.
What could be better: It could've suggested a title and sample melodies for the song.
What caught our eye: The lyrics do not resemble those of Beyoncé songs.
What I liked: Bing shows that it has a moral compass.
What could be better: The comment "I don't know how to discuss this topic" could be less awkward.
What caught our eye: It didn't answer my question.
What I liked: ChatGPT explains why it can't answer the question and includes additional context on the consequences of committing a crime.
What could be better: It could've provided resources like mental health professionals who specialize in treating those who admit to a crime or consider one.
What caught our eye: It did not answer my question.
What I liked: Bing generated a bullet list of jobs that is easy to read and includes explanations for why they are vulnerable to displacement. Each point also contains citations with links.
What could be better: No complaints.
What caught our eye: Nothing was off. It answered my question.
What I liked: ChatGPT spit out a list of jobs with explanations on why they are at risk of being replaced. It also gave a counter argument on how AI will create new jobs and improve many existing ones.
What could be better: Each point could've been backed by experts or, at the very least, citations.
What caught our eye: The job titles it generated were vague.
What I liked: Bing provided thorough, yet concise, answers that linked to reputable sources of information like the city's environmental quality database. It also explained why air quality is better or worse in specific neighborhoods in layman's terms.
What could be better: No complaints.
What caught our eye: Nothing was off. It did exactly what I asked it to do.
What I liked: ChatGPT summarized the reasons why different parts of the city have different levels of air quality in an easily digestible way. It also suggested how to reduce your exposure to low air quality.
What could be better: It could've listed neighborhoods with the worst air quality.
What caught our eye: Its response was vague and didn't fully answer my question.
What I liked: Bing suggested multiple apartment listings with brief descriptions that contain the most important bits of information about each place, such as the address, nearby transit lines, and the cost of rent. Each listing also included a link to the real estate website.
What could be better: It could've shown pictures of the apartments.
What caught our eye: Nothing was off. It did exactly what I asked it to do.
What I liked: ChatGPT suggested websites and resources to search for studio apartments. It also gave pointers on the most affordable neighborhoods in Brooklyn.
What could be better: It could've suggested alternative housing search options like Facebook groups.
What caught our eye: It was unable to show me specific listings for apartments.
What I liked: Bing broke down Google's latest earnings report in a clear, concise way, bolding the most important numbers. It also linked its responses to the actual earnings report.
What could be better: No complaints.
What caught our eye: Nothing was off. It did exactly what I asked it to do.
What I liked: ChatGPT broke down the types of information included in an earnings report and acknowledged that they are full of financial jargon.
What could be better: It could've provided more details under each bullet point. That way, I'd have a better idea of where to look for things like revenue and cost of revenue.
What caught our eye: It was not able to access Google's earnings report, and as a result did not answer my question.
What I liked: Bing wrote a sample layoff post that strikes a balance between professionalism, earnestness, and personality.
What could be better: No complaints.
What caught our eye: Nothing was off. The post sounds like it was written by a human.
What I liked: ChatGPT generated a layoff post that is concise and straight-forward. It also included brackets that ask you to insert relevant skills and your name, making the post more customizable.
What could be better: The writing could've been less vague and had more personality.
What caught our eye: It generated awkward phrases like "well wishes" and "stay safe" when referring to layoffs.
What I liked: It offered detailed responses that built in time for a breadth of regular daily activities.
What could be better: It could have packed the day with fewer tasks!
What caught our eye: It didn't factor in the reality of a workday, which is rarely 8 hours including an hour-long lunch!
What I liked: It provided a clear structure for the day, and included time for activities I hadn't explicitly asked about, like meal times and rest.
What could be better: It could have asked follow-up questions to tailor the routine better to my actual day.
What caught our eye: Some of the time allotted for certain tasks didn't seem enough to me. Perhaps this is more a problem of the daily routine industrial complex!
What I liked: It pulled up a lot of polls.
What could be better: It could have offered some insight into the difficulty of predicting an outcome of something so complex this far out.
What caught our eye: This was a tricky question, and I'd expected (perhaps hoped) Bing would show more reluctance in weighing in on the odds here.
What I liked: This was the measured response I was expecting.
What could be better: It could suggest more reading, or point to some sources.
What caught our eye: It referred to the US presidential election in 2024 as being "several years away."
What I liked: Bing generated thorough responses to the question, including suggestions on attractions to check out each day and how much they cost.
What could be better: No complaints.
What caught our eye: Nothing was off.
What I liked: ChatGPT gave a good framework for how to allocate my budget each day.
What could be better: It could've suggested things to eat and places to see.
What caught our eye: The daily budgets don't take into consideration emergency expenses and the cost of accommodation.
What I liked: It was very informative.
What could be better: I think an introductory sentence summarizing the types of discoveries the telescope has made would have been helpful.
What caught our eye: It was a little long and wordy. But in fairness to the bot, The JWST has accomplished a lot.
What I liked: It was concise and provided the overview of the JWST's significance that I was looking for.
What could be better: Maybe it could have pulled up some pictures!
What caught our eye: No complaints here.
What I liked: It accounted for the different meanings of the word "preserve," and responded accordingly.
What could be better: I could have framed the question better. Memories can be preserved in a lot of ways, as Bing's bot points out.
What caught our eye: This one was on me.
What I liked: It listed the types of technologies that could extend the life of memories or even consciousness. In that way, it seemed to "understand" what I was getting at.
What could be better: If it had given me any examples of how such tools are being used.
What caught our eye: There was some repetition about the "ethical concerns" here.
What I liked: It combed through many episodes to find examples.
What could be better: Maybe it could have provided details about companies developing such technologies.
What caught our eye: Nothing, it delivered what I asked.
What I liked: Well, it answered the question!
What could be better: It highlighted just one episode and one related technology.
What caught our eye: Its narrow focus here.
What I liked: It answered with context.
What could be better: This was pretty good!
What caught our eye: Nothing. Well done, Bing.
What I liked: It was informative, similar to Bing's response.
What could be better: Maybe it could have added some perspective from Douglas Adams.
What caught our eye: Nothing here.
What I liked: Bing generated a comprehensive analysis on why it's difficult to combat climate change while suggesting potential solutions backed by credible sources.
What could be better: The analysis could've been shorter.
What caught our eye: Nothing was off. It answered my question.
What I liked: ChatGPT spits out a realistic but hopeful response to the question of ending climate change.
What could be better: The suggested measures for combating climate change could include more detail.
What caught our eye: The response was vague.
What I liked: The clear answer explains why it can't write an article.
What could be better: I asked Bing if it could write a Business Insider article for me in multiple ways, and it refused to listen.
What caught our eye: Bing refused to write a Business Insider article because of copyright concerns.
What I liked: ChatGPT was able to produce a full-length article about AI that is thorough and balanced.
What could be better: The article feels stiff and could use more voice.
What caught our eye: It reads more like a school essay rather than a news article.
from Business Insider https://ift.tt/VAPHseR
No comments:
Post a Comment