[ad_1]

Google is pushing hard onartificial intelligence. In fact, soon there will be several projects of the company that will benefit from this technology, starting from the keyboard Gboard for Android devices. Specifically, the most recent beta version from the application – version 12.7.05.507749191 – contains code strings that refer to a Image Keyboard. This new option – which could be placed in the panel dedicated to keyboard shortcuts, such as GIFs – should allow you to generate images from text input.

Imagen, announced last May, will combine a deep level of text understanding with a “unprecedented degree of photorealism”. In a benchmark comparison from last year and that included VQ-GAN-CLIP, Latent Diffusion Models and DALL-E 2Google claimed that human users preferred “Imagen versus other models in side-by-side comparisons, both in terms of sample quality and image-to-text matching.” In addition, the company stated that Imagen is also more efficient in spatial relationships, long texts, rare words and more challenging suggestions.

In the end, the company added that “Our key finding is that large generic language models (eg T5), pre-trained on text-only corpora, are surprisingly effective at encoding text for image synthesis: increasing the size of the language model in Imagen results in in a dramatic increase in both sample fidelity and image-text alignment, far more than by increasing the size of the image diffusion pattern.”

[ad_2]

Source link

Leave a Reply

Your email address will not be published. Required fields are marked *