Forum: TFSI

I tried Google's text-to-image AI and was shocked by the results

From TechnologyDaily@1337:1/100 to All on Sun May 29 14:15:04 2022

I tried Google's text-to-image AI and was shocked by the results

Date:
Sun, 29 May 2022 13:00:00 +0000

Description:
Googles new text-to-image AI, Imagen, can generate images from text with an astonishing level of fidelity, but its not without its limitations.

FULL STORY ======================================================================

Text-to-image artificial intelligence programs arent anything new. Indeed, existing neural networks like DALL-E have impressed us with their ability to generate simple, photorealistic images from brief yet descriptive sentences.

But this week I was introduced to Imagen . Developed by Google Researchs
Brain Team, Imagen is an AI similar to that of DALL-E and LDM. However, Brain Teams aim with Imagen is to generate images with a greater level of accuracy and fidelity, using that same short and descriptive sentence method to create them.

An example of such sentences would be as per demonstrations on the Imagen website A photo of a fuzzy panda wearing a cowboy hat and black leather jacket riding a bike on top of a mountain. Thats quite a mouthful, but the sentence is structured in such a way that the AI can identify each item as
its own criteria.

The AI then analyzes each segment of the sentence as a digestible chunk of information and attempts to produce an image as closely related to that sentence as possible. And barring some uncanniness or oddities here and
there, Imagen can do this with surprisingly quick and accurate results.
Imagen can draw better than me. (Image credit: Google / Imagen) A little too wholesome?

If youve checked out Imagen or other neural networks for yourself, then youve probably noticed the overwhelming focus on a select few subjects. DALL-E, for example, likes to create images based on everyday household items, like
clocks or toilets. Imagen, at least for now, seems to put cute animals at the forefront of its image generation capabilities. But theres actually a very good reason for this.

Googles Brain Team doesnt shy away from the fact that Imagen is keeping
things relatively harmless. As part of a rather lengthy disclaimer, the team is well aware that neural networks can be used to generate harmful content like racial stereotypes or push toxic ideologies. Imagen even makes use of a dataset thats known to contain such inappropriate content.

While a subset of our training data was filtered to remove noise and undesirable content, such as pornographic imagery and toxic language, Brain Team notes, we also utilized LAION-400M dataset which is known to contain a wide range of inappropriate content including pornographic imagery, racist slurs, and harmful social stereotypes.

Imagen relies on text encoders trained on uncurated web-scale data, and thus inherits the social biases and limitations of large language models. This was one of the less uncanny photos I was able to generate with Imagen. (Image credit: Google / Imagen)

This is also the reason why Googles Brain Team has no plans to release Imagen for public use, at least until it can develop further safeguards to prevent the AI from being used for nefarious purposes. As a result, the preview on
the website is limited to just a few handpicked variables.

Ultimately, its the right call. There have been examples in the past of AI programs being unleashed onto the online public with extremely undesirable results. You may remember Microsofts Tay, an AI Twitter account brought to
the social media platform roughly five years ago.

Tay was a pretty ballsy experiment on Microsofts part. Its intention was to see how an AI would react to and interact with real people in a social media environment. However, within hours, Tay went from a wholesome chatbot to a dispenser of anti-semitic talking points. This was despite the bot being modeled, cleaned and filtered according to Microsoft (thanks, The Verge ).

Given the precedent set by AI like Tay, then, its easy to see why Imagen has been reigned in. Clearly, even extensive filtering might not be enough. Still far from perfect

While I was immensely impressed by Imagen, and had a lot of fun mixing and matching sentences to create all kinds of bizarre pictures, its definitely
not something Id consider to be overwhelmingly convincing. At least not for the time being.

More often than not, Imagen returned some frighteningly hilarious results. Animals, in particular, often appeared with all kinds of wacky proportions. Seeing a raccoon with a massive head, or human-like girthy arms gripping a bikes handlebars was a pretty common sight. While very funny, these peculiar results blended with the photorealism often churned out disturbingly uncanny results.

The option to generate an oil painting was actually a good deal more convincing, and most of what Imagen was able to produce here wouldnt look out of place in a school project. And I mean that in the nicest possible way. As it turns out, a Persian cat strumming a guitar translates far more convincingly to a painting than it does a realistic photo.

As noted, its highly likely we wont get a public release of Imagen anytime soon. Or ever, for that matter. The risks posed by AI programs and neural networks being able to generate unsavory content are still far too great. For now, though, Im content with Imagen being a fun little curio for those
looking to spend a bit of time generating funny cowboy hat-wearing animals skateboarding down a mountain.

======================================================================
Link to news story: https://www.techradar.com/news/i-tried-googles-text-to-image-ai-and-was-shocke d-by-the-results/

--- Mystic BBS v1.12 A47 (Linux/64)
* Origin: tqwNet Technology News (1337:1/100)

Who's Online
Recent Visitors
- CyberNix
  Sun Jan 18 19:24:59 2026
  from London, UK via SSH
- CyberNix
  Fri Jan 16 17:43:24 2026
  from London, UK via Telnet
- CyberNix
  Fri Jan 16 17:40:38 2026
  from London, UK via Telnet
- Guest
  Wed Dec 17 04:43:01 2025
  from Tor via Telnet

System Info

Sysop:	CyberNix
Location:	London, UK
Users:	22
Nodes:	10 (0 / 10)
Uptime:	272:51:40
Calls:	911
Files:	5,306
D/L today:	29 files (15,185K bytes)
Messages:	787,786

I tried Google's text-to-image AI and was shocked by the results

Who's Online

Recent Visitors

System Info