Experimenting With AI

TechnologyAIGoogle GeminiChatGPTBloggingAI Images

Jul 1

For the last few months I have being using Google Gemini. I subscribed so I could access a wider spectrum of services and evaluate it effectively. I also wanted to dispel some of the hysteria that accompanies any conversation about Large Language Models and Artificial Intelligence. The results were initially interesting and Google Gemini proved useful when researching material for blog posts. It is good at providing summaries about well documented subjects. I wrote a post about food banks recently and it distilled a lot of data into a manageable amount, which was ideal for my article. Google Gemini has also produced some visual basic scripting and simple batch files that have allowed me to automate processes on my PC. Such as changing the default monitor, or screen resolution at the click of an icon. However, I found this AI struggled with in-depth software troubleshooting and that its ability to create images varies greatly depending on the subject.

The first major adjustment I had to make while using Google Gemini is that I needed to be very specific in how I phrased my question. Google Gemini is not intuitive and doesn’t always proactively ask you for additional data. It will take the prompt provided and doggedly work within its confines. You then realise that it needs more information and thus end up micromanaging the process, as it attempts to solve your problem. It would get to the right solution eventually but the amount of time and effort I would have to make with my continuous interventions and data updates, some what took the shine off the end results. This became very apparent when I tried to finally solve a long term problem regarding intermittent connectivity issues between my PC and Fire HD10 tablet, when using a USB cable. Appraising me of the failings of Microsoft Media Transfer Protocol without giving any useful suggestions within the parameters I’ve asked isn’t helpful.

Ecthelion Fighting Gothmog Google Gemini — A rather generic fantasy image depicting Ecthelion Versus Gothog, created via Google Gemini

Another issue that quickly became noticeable was that of image generation and the disparity in quality between specific types. On one occasion I asked Google Gemini to create an image specifically for a blog post, because I couldn’t find a comparable one online. I wanted a picture of a group of demonstrators outside parliament talking to a news reporter and crew. I eventually gave a detailed prompt and found this replication of a common place event to be satisfactory. However, when I gave detailed instructions for a fantasy based image, I found the results to be somewhat formulaic and lacklustre. Google Gemini seems to work best when drawing from established data and images. When called upon to do something inherently creative it doesn’t fare so well. Furthermore, if you draw attention to any mistake or confusion arising from your interactions with Google Gemini, the AI becomes excessively apologetic to the point of obsequiousness.

It was ultimately this lack of intuition and proactivity that made me cancel my subscription to Google Gemini. Using it was too labour intensive and required me to constantly oversee the proceedings. For me that mitigated some of the benefits inherent in using such a tool. Hence I moved on to ChatGPT and decided to proactively determine if it had the same failings as it competitor. I asked it whether it felt it was intuitive and likely to seek clarification on its own accord rather than just colour “within the lines”. It confidently stated that it was a more intuitive by default and immediately proved so when I set it the same problem that Google Gemini had laboured over to no avail. ChatGPT not only solved the problem with my tablet but gave me multiple solutions. It also displays its thinking process as it works through a problem, which is both useful and intriguing. Its creative artwork is superior, or has at least proven so for the present.

Numenor ChatGPT — A map of Númenor created via ChatGPT

AI in various shapes and forms is a reality that is no going to go away. That being said, I do not think it is the universal solution that many think that it is at present and I suspect that there will be several high profile failures in the months to come that will do much to reset both business and the public’s expectations. Sadly, I also think there will have to be an adjustment around the wider issue of those who directly affected by the use of AI. If I ever have to commission any further design work for my website or the branding associated with it, I will always hire a professional. However, for minor work, AI will suffice. For example I wanted a detailed, high resolution map of Númenor and gave the task to ChatGPT. The results were fine. However, this never was a paying job to begin with. A few years ago, before AI became commonly available, I would have just gone without or drawn it myself. The fact that a tool now exists to do it for me, means I will use it.