A short history of AI


The Dartmouth meeting did not mark the beginning of scientific inquiry into machines which could think like people. Alan Turing, for whom the Turing prize is named, wondered about it; so did John von Neumann, an inspiration to McCarthy. By 1956 there were already a number of approaches to the issue; historians think one of the reasons McCarthy coined the term artificial intelligence, later AI, for his project was that it was broad enough to encompass them all, keeping open the question of which might be best. Some researchers favoured systems based on combining facts about the world with axioms like those of geometry and symbolic logic so as to infer appropriate responses; others preferred building systems in which the probability of one thing depended on the constantly updated probabilities of many others.

 


View Full Image

A-short-history-of-AI

The following decades saw much intellectual ferment and argument on the topic, but by the 1980s there was wide agreement on the way forward: “expert systems” which used symbolic logic to capture and apply the best of human know-how. The Japanese government, in particular, threw its weight behind the idea of such systems and the hardware they might need. But for the most part such systems proved too inflexible to cope with the messiness of the real world. By the late 1980s AI had fallen into disrepute, a byword for overpromising and underdelivering. Those researchers still in the field started to shun the term.

It was from one of those pockets of perseverance that today’s boom was born. As the rudiments of the way in which brain cells—a type of neuron—work were pieced together in the 1940s, computer scientists began to wonder if machines could be wired up the same way. In a biological brain there are connections between neurons which allow activity in one to trigger or suppress activity in another; what one neuron does depends on what the other neurons connected to it are doing. A first attempt to model this in the lab (by Marvin Minsky, a Dartmouth attendee) used hardware to model networks of neurons. Since then, layers of interconnected neurons have been simulated in software.

These artificial neural networks are not programmed using explicit rules; instead, they “learn” by being exposed to lots of examples. During this training the strength of the connections between the neurons (known as “weights”) are repeatedly adjusted so that, eventually, a given input produces an appropriate output. Minsky himself abandoned the idea, but others took it forward. By the early 1990s neural networks had been trained to do things like help sort the post by recognising handwritten numbers. Researchers thought adding more layers of neurons might allow more sophisticated achievements. But it also made the systems run much more slowly.

A new sort of computer hardware provided a way around the problem. Its potential was dramatically demonstrated in 2009, when researchers at Stanford University increased the speed at which a neural net could run 70-fold, using a gaming PC in their dorm room. This was possible because, as well as the “central processing unit” (cpu) found in all pcs, this one also had a “graphics processing unit” (gpu) to create game worlds on screen. And the gpu was designed in a way suited to running the neural-network code.

Coupling that hardware speed-up with more efficient training algorithms meant that networks with millions of connections could be trained in a reasonable time; neural networks could handle bigger inputs and, crucially, be given more layers. These “deeper” networks turned out to be far more capable.

The power of this new approach, which had come to be known as “deep learning”, became apparent in the ImageNet Challenge of 2012. Image-recognition systems competing in the challenge were provided with a database of more than a million labelled image files. For any given word, such as “dog” or “cat”, the database contained several hundred photos. Image-recognition systems would be trained, using these examples, to “map” input, in the form of images, onto output in the form of one-word descriptions. The systems were then challenged to produce such descriptions when fed previously unseen test images. In 2012 a team led by Geoff Hinton, then at the University of Toronto, used deep learning to achieve an accuracy of 85%. It was instantly recognised as a breakthrough.

By 2015 almost everyone in the image-recognition field was using deep learning, and the winning accuracy at the ImageNet Challenge had reached 96%—better than the average human score. Deep learning was also being applied to a host of other “problems…reserved for humans” which could be reduced to the mapping of one type of thing onto another: speech recognition (mapping sound to text), face-recognition (mapping faces to names) and translation.

In all these applications the huge amounts of data that could be accessed through the internet were vital to success; what was more, the number of people using the internet spoke to the possibility of large markets. And the bigger (ie, deeper) the networks were made, and the more training data they were given, the more their performance improved.

Deep learning was soon being deployed in all kinds of new products and services. Voice-driven devices such as Amazon’s Alexa appeared. Online transcription services became useful. Web browsers offered automatic translations. Saying such things were enabled by AI started to sound cool, rather than embarrassing, though it was also a bit redundant; nearly every technology referred to as AI then and now actually relies on deep learning under the bonnet.

In 2017 a qualitative change was added to the quantitative benefits being provided by more computing power and more data: a new way of arranging connections between neurons called the transformer. Transformers enable neural networks to keep track of patterns in their input, even if the elements of the pattern are far apart, in a way that allows them to bestow “attention” on particular features in the data.

Transformers gave networks a better grasp of context, which suited them to a technique called “self-supervised learning”. In essence, some words are randomly blanked out during training, and the model teaches itself to fill in the most likely candidate. Because the training data do not have to be labelled in advance, such models can be trained using billions of words of raw text taken from the internet.

Mind your language model

Transformer-based large language models (LLMs) began attracting wider attention in 2019, when a model called GPT-2 was released by OpenAI, a startup (GPT stands for generative pre-trained transformer). Such LLMs turned out to be capable of “emergent” behaviour for which they had not been explicitly trained. Soaking up huge amounts of language did not just make them surprisingly adept at linguistic tasks like summarisation or translation, but also at things—like simple arithmetic and the writing of software—which were implicit in the training data. Less happily it also meant they reproduced biases in the data fed to them, which meant many of the prevailing prejudices of human society emerged in their output.

In November 2022 a larger OpenAI model, GPT-3.5, was presented to the public in the form of a chatbot. Anyone with a web browser could enter a prompt and get a response. No consumer product has ever taken off quicker. Within weeks ChatGPT was generating everything from college essays to computer code. AI had made another great leap forward.

Where the first cohort of AI-powered products was based on recognition, this second one is based on generation. Deep-learning models such as Stable Diffusion and DALL-E, which also made their debuts around that time, used a technique called diffusion to turn text prompts into images. Other models can produce surprisingly realistic video, speech or music.

The leap is not just technological. Making things makes a difference. ChatGPT and rivals such as Gemini (from Google) and Claude (from Anthropic, founded by researchers previously at OpenAI) produce outputs from calculations just as other deep-learning systems do. But the fact that they respond to requests with novelties makes them feel very unlike software which recognises faces, takes dictation or translates menus. They really do seem to “use language” and “form abstractions”, just as McCarthy had hoped.

This series of briefs will look at how these models work, how much further their powers can grow, what new uses they will be put to, as well as what they will not, or should not, be used for.

© 2024, The Economist Newspaper Limited. All rights reserved. From The Economist, published under licence. The original content can be found on www.economist.com



Source link

Share

Latest Updates

Frequently Asked Questions

Related Articles

Indian Railways to sign MoU with navigation app Mappls: Vaishnaw

Indian Railways will soon sign a memorandum of understanding with Google Maps competitor...

OpenAI’s Marketing Efforts Are Embarrassingly Ineffective, New Consumer Research Finds

OpenAI’s biggest advertising push yet depicts users leveraging ChatGPT in highly produced, intended-to-be-relatable...

Access Denied

Access Denied You don't have permission to access "http://www.gadgets360.com/science/news/scientists-build-world-first-hybrid-chip-merging-2d-materials-with-silicon-circuits-9431655" on this server. Reference #18.79cfdb17.1760264380.58ee8c01 https://errors.edgesuite.net/18.79cfdb17.1760264380.58ee8c01 Source...

Get a Microsoft Office Pro 2021 lifetime license and a training bundle for just $40

TL;DR: Grab Microsoft Office Pro 2021 for Windows plus a complete Microsoft training bundle for just...
Sabung Ayam Online sabung ayam online sv388 Sv388 judi bola judi bola judi bola judi bola JUARA303 Mahjong ways Judi Bola Judi Bola Sabung Ayam Online Live casino mahjong ways 2 sabung ayam online Permainan Klasik ke Mahjong Wins 3 Fitur Menarik di Mahjong Wins 3 Cara Memahami Pola Ziq Zaq dan Simbol dalam Mahjong Wins sabung ayam online mahjong ways jong ways jong ways Bermain dengan Panduan Menetapkan Kesabaran dan Mengelola Saldo Terbongkar Panduan Lengkap Pola Scatter Mengatasi Server Sedot Wc di PG Soft Mahjong Wins live casino online sabung ayam online judi bola SV388 SBOBET88 judi bola judi bola judi bola judi bola judi bola Cerita Seorang Ojol Paket Jadi Miliarder Berkat Meraup Jackpot Bagaimana Kakek Darwin Penjual Mainan Keliling Meraih Jackpot Mahjong Ways 2 Kisah Mengharukan Seorang Ojol Bekasi Berhasil Meraih Kemenangan Mahjong Ways 2 Sponsorin Moto GP Mandalika Sebesar Super Mega Wild Mengisahkan Bapak Penjual Bakso Keliling https://himakom.fisip.ulm.ac.id/ SABUNG AYAM ONLINE MIX PARLAY SLOT GACOR JUDI BOLA SV388 LIVE CASINO LIVE CASINO ONLINE Judi Bola Online SABUNG AYAM ONLINE JUDI BOLA ONLINE Racik Pola Jitu Mahjong Wins 2 yang Bikin Parman Tukang Bakso Raih Keberhasilan Besar Fakta Nyata RTP Tinggi Mahjong Wins 3 yang Sering Membawa Keberuntungan Ojol Bandung Strategi Terbaru Analisis Pola Mahjong Wins 3 untuk Hasilkan Kemenangan Maximal Cara Efektif Terbaru Mahjong Ways 2 dengan Langkah Sederhana Hasil Memuaskan Panduan Terbaru Spin Scatter Hitam Mahjong Wins 3 Demi Hidupkan Kluarganya LIVE CASINO ONLINE JUDI BOLA ONLINE LIVE CASINO ONLINE LIVE CASINO ONLINE sabung ayam online Portal Game Online Saat Ini Menjadi Penghasilan Ojol Pola Rahasia Dari Temannya Ternyata Terbukti Ampuh Terciduk Dapatkan Rezeki Nomplok Di Mahjong Ways 2 Ketika Dapat Maxwin Mewah Setelah Lihat HPnya Ngespin Ayumi Cuma Putar Sekali Ternyata Di Server Kamboja Auto Cuan SV388 SBOBET88 SABUNG AYAM ONLINE JUDI BOLA ONLINE CASINO ONLINE MAHJONG WAYS 2 sabung ayam online judi bola Sore Hari Gunakan Tips Pola Bermain Coba Pola Spin Manual Zigzag Cuma 7x Putaran Spin Klik Manual Sore Hari Bikin Tukang Ojol Langsung Bayar Tagihan Motor Pola Manual Spiral Sederhana Bantu Tukang Ojol Gunakan Teknik Spin Manual Kombinasi Bikin Pegawai Lurah Judi Bola Sabung Ayam Online SLOT MAHJONG SABUNG AYAM ONLINE JUDI BOLA ONLINE Sabung Ayam Online JUDI BOLA Sabung Ayam Online JUDI BOLA SV388, WS168 & GA28 SBOBET88 SV388, WS168 & GA28 SBOBET88 SBOBET88 CASINO ONLINE SLOT GACOR SV388 MIX PARLAY Live Casino Online Slot Gacor SV388, WS168 & GA28 WS168 MIX PARLAY LIVE CASINO ONLINE SLOT GACOR SV388 SBOBET88 Agen Casino Online Slot Gacor Online SV388 JUDI BOLA Live Casino Online Slot Gacor judi bola judi bola judi bola judi bola --indomax77 judi bola online --indomax77 mix parlay --indomax77 situs mix parlay --indomax77 situs parlay --indomax77 sbobet --indomax77 sbobet88 --indomax77 situs bola --indomax77 situs judi bola --indomax77 agen bola --indomax77 agen judi bola --indomax77 agen mix parlay --indomax77 agen parlay --indomax77 Game Online Saat Ini Menjadi Penghasilan Tambahan Bagi Para Ojol Jaka Sedang Gabut Lalu Coba Pola Rahasia Dari Temannya Ternyata Terbukti Ampuh Rai Mendadak Terciduk Dapatkan Rezeki Nomplok Di Mahjong Ways 2 Ketika Dapat Maxwin Mewah Setelah Lihat HPnya Ngespin Sendiri Dengan Gila Terkuak Alasan Ayumi Cuma Putar Sekali Ternyata Di Server Kamboja Auto Cuan