Despite its name, the infrastructure used by the “cloud” accounts for more global greenhouse emissions than commercial flights. In 2018, for instance, the 5bn YouTube hits for the viral song Despacito used the same amount of energy it would take to heat 40,000 US homes annually.
Large language models such as ChatGPT are some of the most energy-guzzling technologies of all. Research suggests, for instance, that about 700,000 litres of water could have been used to cool the machines that trained ChatGPT-3 at Microsoft’s data facilities.
Additionally, as these companies aim to reduce their reliance on fossil fuels, they may opt to base their datacentres in regions with cheaper electricity, such as the southern US, potentially exacerbating water consumption issues in drier parts of the world.
Furthermore, while minerals such as lithium and cobalt are most commonly associated with batteries in the motor sector, they are also crucial for the batteries used in datacentres. The extraction process often involves significant water usage and can lead to pollution, undermining water security. The extraction of these minerals are also often linked to human rights violations and poor labour standards. Trying to achieve one climate goal of limiting our dependence on fossil fuels can compromise another goal, of ensuring everyone has a safe and accessible water supply.
Moreover, when significant energy resources are allocated to tech-related endeavours, it can lead to energy shortages for essential needs such as residential power supply. Recent data from the UK shows that the country’s outdated electricity network is holding back affordable housing projects.
In other words, policy needs to be designed not to pick sectors or technologies as “winners”, but to pick the willing by providing support that is conditional on companies moving in the right direction. Making disclosure of environmental practices and impacts a condition for government support could ensure greater transparency and accountability.
But they don’t. Right now the GPU powering every console, gaming PC, developer PC, graphic artist, twitch streamer, YouTube recap, etc. consumer far far more power than LLM training.
And LLM training is still largely being done on GPUs which aren’t designed for it, as opposed to NPUs that can do so more efficiently at the chip level.
I understand the idea that AI training will always inherently consumer power because you can always train a model on bigger or more data, or train more parameters, but most uses of AI are not training, they’re just users using an existing trained model. Google’s base search infrastructure also took a lot more carbon to build initially than is accounted for when they calculate the carbon cost of an individual search.
Fair enough - I just hope the advancements in AI do not outpace our capabilities in producing a better hardware for the job, and that what’s left after finds a good use in other tasks.
Because otherwise it will grow more and more into a huge ecological problem.