Text to Image

Estimated reading: 3 minutes

Text to Image

The Text to Image Wonder Tool allows you to generate high-quality images directly from a text prompt.

By describing an object or character in text, the system interprets your input and generates an image, which is displayed in the Viewer. This provides greater flexibility and enables a wider diversity of visual outputs. Once generated, the image can be previewed, regenerated, edited, and used as the input for 3D model generation.

Flux

How It Works

  • Enter a text prompt describing the object or character you want to generate.
  • The system processes the prompt and creates an image based on your description.
  • The generated image is displayed in the Viewer.
  • Optionally, you can regenerate the image or edit it further by describing the changes you want to apply.
  • The image remains available in the Viewer and can be used to generate a 3D model.

Limitations

  • The quality of the generated image depends on the clarity and precision of the text prompt.
  • Extremely complex, ambiguous, or overly abstract descriptions may result in incomplete or noisy outputs.
  • Dynamic poses, extreme motion, or overlapping elements may reduce the usability of the resulting 3D model generated from the image.

Tips for Best Results

  • Describe one main object or character per prompt.
  • Use clear and literal descriptions rather than cinematic or narrative language.
  • For characters intended for rigging, prefer neutral poses with separated limbs.
  • Specify materials, surface details, and proportions when relevant to improve visual and texturing quality

Nano Banana

Multiple model versions are available, including Nano Banana, Nano Banana Pro, and Nano Banana 2, each offering different strengths in terms of style, detail, and consistency.

While all models deliver high-quality results, there are some differences to consider. Nano Banana provides faster generation and is suitable for quick iterations, whereas Nano Banana Pro and Nano Banana 2 generally produce more visually refined and detailed outputs.

How It Works

  • Enter a text prompt describing the object or character you want to generate.
  • The system processes the prompt and creates an image based on your description.
  • The generated image is displayed in the Viewer.
  • Optionally, you can regenerate the image or edit it further by describing the changes you want to apply.
  • The image remains available in the Viewer and can be used to generate a 3D model.

Limitations

  • The quality of the generated image depends on the clarity and precision of the text prompt.
  • Extremely complex, ambiguous, or overly abstract descriptions may result in incomplete or noisy outputs.
  • Dynamic poses, extreme motion, or overlapping elements may reduce the usability of the resulting 3D model generated from the image.

Tips for Best Results

  • Describe one main object or character per prompt.
  • Use clear and literal descriptions rather than cinematic or narrative language.
  • For characters intended for rigging, prefer neutral poses with separated limbs.
  • Specify materials, surface details, and proportions when relevant to improve visual and texturing quality