We look at a method of AI development built on the idea of positive and negative feedback

what is reinforcement learning?

Shutterstock

One of the most fascinating subdivisions of artificial intelligence (AI) is reinforcement learning. Itself a subset of machine learning (ML), reinforcement learning technology is widely tested on games, such as Go, but its development might have wider implications on industries and businesses.

This branch of AI aspires to reflect human-like capabilities and has even exceeded these ambitions when applied in gaming contexts. For instance, it’s gone toe-to-toe with several world champions in their specialities.

Ke Jie, for example, is a Go world champion that’s been humbled by a reinforcement learning system. The Chinese competitor had dominated the game from 2014, but he was beaten three times in 2017 by a system developed by Google’s DeepMind division.

The previous year, DeepMind’s AlphaGo system lost to the 18-time Go champion Lee Sedol in the fourth of a five-game series, although it won the other four games. Lee then retired in 2019, citing the dominance of AI and suggesting it “cannot be defeated”.

Although reinforcement learning has proven itself in the realm of gaming, this technology can also be used in robotics and automation. Further breakthroughs, therefore, can have significant implications for businesses and the wider economy.

What is reinforcement learning?

Reinforcement learning (RL) is a method of training ML systems to find their own way of solving complex problems, rather than making decisions based on preconfigured possibilities that a programmer has set.

Positive and negative reinforcement is used, with correct decisions leading to rewards whereas negative decisions are penalised. Although humans normally consider rewards to be a treat of some description, for machines the reward is a positive evaluation of an action.

RL also doesn't rely on human involvement during the training process. In classic ML, using what's known as supervised learning, a machine learning algorithm is given a set of decisions to choose from. Using the game of Go as an example, someone training the algorithm could give it a list of moves to make in a given scenario, which the program could then choose from.

The problem with this model is that the algorithm then becomes only as good as the human programming it, which means the machine cannot learn by itself.

The goal of reinforcement learning is to train the algorithm to make sequential decisions to reach an end goal and over time; the algorithm will learn how to make decisions that reach the goal in the most efficient way using reinforcement. When trained using reinforcement learning, artificial intelligence systems can draw experiences from many more decision trees than humans, which makes them better at solving complex tasks – at least in gamified environments.

Learning to win

Reinforcement learning shares many similarities with supervised learning in a classroom. A framework establishing the ground rules is still required, but the software agent is never told what instructions it should follow, nor is it given a database from which to draw upon. This type of approach allows a system to create its own dataset from its actions, built using trial and error, to establish the most efficient route to a reward.

This is all done sequentially – a software agent will take one action at a time until it encounters a state for which it is penalised. For example, a virtual car leaving a road or track will produce an error state, and revert the problem back to its starting position. For many processes, we don't actually need the system to learn to make new decisions as it develops, rather just refine its data processing capabilities, as is the case with facial recognition technology. However, for some, reinforcement learning is by far the most beneficial form of development.

One of the most famous examples is the case of Google's DeepMind, which uses a Deep Q-Learning algorithm. This was created to master Atari Breakout, the classic 70s arcade game, in which players smash through eight rows of blocks with a ball and paddle. During its development, the software agent was only provided with the information that appeared on screen and was tasked with simply maximising its score.

As you might expect, the agent struggled to get to grips with the game early on. Researchers found it was unable to grasp the controls and consistently missed the ball with the paddle. After a great deal of trial and error, the agent eventually figured out that if it angled the ball so that it became stuck between the highest layer and the top wall, it could break down the majority of the wall with only a small number of paddle hits. Not only that, it was able to understand that each time the ball travelled back to the paddle, the efficiency of the run dropped, and the length of the game increased.

The agent was basing its decisions on a policy network. Every action taken by the agent was recorded by the network, which also notes the result and what could be done differently to change that result. The result, also known as a state, can, therefore, be predicted by the agent.

Problems with reinforcement learning

The example above is useful for understanding the fundamental principles of reinforcement learning, but gaming environments, no matter how large, only offer limited scope for learning and rarely offer anything meaningful beyond simple testing.

Success is not always easily translated into real-world use cases, particularly as it relies on a system of reward and failure states that are often ambiguous in reality. Tasking an agent with solving a particular challenge within tight parameters is one thing, but creating a realistic simulation that's applicable for everyday use is far harder.

If we take the example of an autonomous vehicle system, creating a simulation for it to learn from can be incredibly challenging. Not only does the simulation need to accurately represent a real-world road, and convey the various laws and restrictions that govern car use, but it also needs to take into account constant changes in traffic volume, the sudden actions of other human drivers (who may not be obeying the highway code themselves), and random obstacles.

There are also a variety of technical challenges that limit the potential of this type of learning. There are examples of systems 'forgetting' older actions, results and predictions when new knowledge is acquired. There have also been problems with agents successfully achieving a desired positive state, but doing so in an inefficient or undesired way. For example, in 2018 Deepsense.ai sought to teach an algorithm to run, but found that the agent developed a tendency to jump instead as it arrived at its future positive state far more quickly.

The future of machine learning?

These gaming environments, however interesting, are really only for testing purposes. Real-world applications require agents to learn far more complicated environments, and depending on how abstract or unknown the challenge is, RL might not be the easiest approach.

RL is best applied to specific, quantifiable goals – for example, in teaching self-driving cars how to park, change lanes, overtake other cars, and more.

The tech is also being used in factories, where robots can not only perform tasks more efficiently than humans, but without risk of injury. Google has used RL to control the cooling of its data centres without human intervention, which has resulted in the tech giant reducing its energy spending by 40%.

In trading and finance, an RL agent can be trained to decide whether to hold, buy, or sell stocks using market benchmark standards, removing the need for analysts to make every decision.

Other applications in the future include diagnosing medical conditions, smart prosthetic limbs, and fully automated factories. It’s not an easy technology to implement, but with time, it could be the driving force of future technology.

Keyword: What is reinforcement learning?

CAR'S NEWS RELATED

Hyundai may build EV plant in Georgia

SEOUL/DETROIT – Hyundai plans to build a new electric-vehicle manufacturing plant in the United States and has held discussions with officials in Georgia, near existing plants for the Hyundai and Kia brands, people with knowledge of its plans told Reuters. Hyundai Motor confirmed an imminent plan for a new ...

View more: Hyundai may build EV plant in Georgia

Hamilton and Russell crash in qualifying, Verstappen on pole

Sschumacher’s first points United Ferrari? Sprint for points Budgets and bouncing Both Mercedes cars crashed at the Austrian Grand Prix on Friday in a dramatic ending to a qualifying session that set the field for Saturday’s sprint. Lewis Hamilton and George Russell walked back to the paddock after separate ...

View more: Hamilton and Russell crash in qualifying, Verstappen on pole

2022 BMW iX xDrive40i Sport Review : Eclectic Tech

2022 BMW iX xDrive40i Sport Review : Eclectic Tech 2022 BMW iX xDrive40i Sport – inside 2022 BMW iX xDrive40i Sport – driven 2022 BMW iX xDrive40i Sport 2022 BMW iX xDrive40i Sport Review : Eclectic Tech Singapore – Like the BMW i3, which was first introduced almost a ...

View more: 2022 BMW iX xDrive40i Sport Review : Eclectic Tech

Watch this Inkas armored Toyota Land Cruiser take bullets, mines, hand grenades

We hope you never need an armored car to safely run errands, but if you do, Inkas Armored Vehicle Manufacturing is ready to gear a parent up for the school run. The Canadian armorer has just announced its latest, the 300 Series Toyota Land Cruiser. Given VPAM VR7 certification ...

View more: Watch this Inkas armored Toyota Land Cruiser take bullets, mines, hand grenades

Consumer groups, safety experts recommend standardizing names for advanced driver aids

If you’ve been car shopping lately, you’re probably aware that most new models have at least a few advanced driver assistance systems (ADAS). Tech like automatic emergency braking and forward collision warnings are table stakes to compete in today’s auto market, but a group of safety and consumer experts ...

View more: Consumer groups, safety experts recommend standardizing names for advanced driver aids

TVS Ronin: 15 observations after a showroom visit

The Ronin is a great bike which is far from a Royal Enfield imitator. BHPian rtr_drd recently shared this with other enthusiasts. I had a test ride of the Ronin for a few kms. I rode the base model. Came away impressed indeed. Many things felt really nice on ...

View more: TVS Ronin: 15 observations after a showroom visit

Ford Broncos will reportedly lose nav over chip shortage

Ford has reportedly added yet another casualty to the list of vehicles it plans to offer without some factory options to customers who aren’t in a position to wait for chip shortages to abate. Ford Authority reports that higher-end Bronco trims that would normally come with factory navigation will ...

View more: Ford Broncos will reportedly lose nav over chip shortage

Best cars for a 'Love Bug' remake

Tennessee’s Yard Art The Thorndyke Special The Hot Rod The Hippie Van Jim’s Lamborghini Growing up, it’s safe to say my absolute favorite movie was Disney’s “The Love Bug.” As a kid living in a world before Pixar’s “Cars,” it was pretty much the best car movie. I loved ...

View more: Best cars for a 'Love Bug' remake

Audi timing belt tensioner prompts Utah bomb squad visit

2023 GMC Sierra HD 2500 and 3500 will get more expensive

Our week with EVs: Recapping the diverse collection of electric cars we tested

Europe car sales lowest since 1996 after 12-month decline

More Than Just A Carmaker: Toyota Motor PH Launches Toyota Mobility Solutions

'F1 22' feels fast and familiar | Gaming Roundup

Watch a single-engine plane crash-land on 91 Freeway in California

Ford files 'Mustang Dark Horse' trademark application

Make adventures more comfortable with high-quality Jeep grab handles

New Lucid Air variant to debut & Stealth Look to be on display during Monetary Car Week

Chevy offers incentives to prevent Corvette Z06 flipping

Average U.S. gas price falls below $4/gallon

OTHER CAR NEWS

; Top List in the World https://www.pinterest.com/newstopcar/pins/
Top Best Sushi Restaurants in SeoulTop Best Caribbean HoneymoonsTop Most Beautiful Islands in PeruTop Best Outdoor Grill BrandsTop Best Global Seafood RestaurantsTop Foods to Boost Your Immune SystemTop Best Foods to Fight HemorrhoidsTop Foods That Pack More Potassium Than a BananaTop Best Healthy Foods to Gain Weight FastTop Best Cosmetic Brands in the U.STop Best Destinations for Food Lovers in EuropeTop Best Foods High in Vitamin ATop Best Foods to Lower Your Blood SugarTop Best Things to Do in LouisianaTop Best Cities to Visit in New YorkTop Best Makeup Addresses In PennsylvaniaTop Reasons to Visit NorwayTop Most Beautiful Islands In The WorldTop Best Law Universities in the WorldTop Richest Sportsmen In The WorldTop Biggest Aquariums In The WorldTop Best Peruvian Restaurants In MiamiTop Best Road Trips From MiamiTop Best Places to Visit in MarylandTop Best Places to Visit in North CarolinaTop Best Electric Cars For KidsTop Best Swedish Brands in The USTop Best Skincare Brands in AmericaTop Best American Lipstick BrandsTop Michelin-starred Restaurants in MiamiTop Best Secluded Getaways From MiamiTop Best Things To Do On A Rainy Day In MiamiTop Most Instagrammable Places In MiamiTop Interesting Facts about FlorenceTop Facts About The First Roman Emperor - AugustusTop Best Japanese FoodsTop Most Beautiful Historical Sites in IsraelTop Best Places To Visit In Holy SeeTop Best Hawaiian IslandsTop Reasons to Visit PortugalTop Best Hotels In L.A. With Free Wi-FiTop Best Scenic Drives in MiamiTop Best Vegan Restaurants in BerlinTop Most Interesting Attractions In WalesTop Health Benefits of a Vegan DietTop Best Thai Restaurant in Las VegasTop Most Beautiful Forests in SwitzerlandTop Best Global Universities in GermanyTop Most Beautiful Lakes in GuyanaTop Best Things To Do in IdahoTop Things to Know Before Traveling to North MacedoniaTop Best German Sunglasses BrandsTop Highest Mountains In FranceTop Biggest Hydroelectric Plants in AmericaTop Best Spa Hotels in NYCTop The World's Scariest BridgeTop Largest Hotels In AmericaTop Most Famous Festivals in JordanTop Best European Restaurants in MunichTop Best Japanese Hiking Boot BrandsTop Best Universities in PolandTop Best Tips for Surfing the Web Safely and AnonymouslyTop Most Valuable Football Clubs in EuropeTop Highest Mountains In ColombiaTop Real-Life Characters of Texas RisingTop Best Beaches in GuatelamaTop Things About DR Congo You Should KnowTop Best Korean Reality & Variety ShowsTop Best RockstarsTop Most Beautiful Waterfalls in GermanyTop Best Fountain Pen Ink BrandsTop Best European Restaurants in ChicagoTop Best Fighter Jets in the WorldTop Best Three-Wheel MotorcyclesTop Most Beautiful Lakes in ManitobaTop Best Dive Sites in VenezuelaTop Best Websites For Art StudentsTop Best Japanese Instant Noodle BrandsTop Best Comedy Manhwa (Webtoons)Top Best Japanese Sunglasses BrandsTop Most Expensive Air Jordan SneakersTop Health Benefits of CucumberTop Famous Universities in SwedenTop Most Popular Films Starring Jo Jung-sukTop Interesting Facts about CougarsTop Best Hospitals for Hip Replacement in the USATop Most Expensive DefendersTop Health Benefits of GooseberriesTop Health Benefits of ParsnipsTop Best Foods and Drinks in LondonTop Health Benefits of Rosehip TeaTop Best Air Fryers for Low-fat CookingTop Most Asked Teacher Interview Questions with AnswersTop Best Shopping Malls in ZurichTop The Most Beautiful Botanical Gardens In L.A.Top Best Mexican Restaurants in Miami for Carb-loading rightTop Best Energy Companies in GermanyTop Best Garage HeatersTop Largest Banks in IrelandTop Leading Provider - Audit and Assurance In The USTop Best Jewelry Brands in IndiaTop Prettiest Streets in the UKTop Best Lakes to Visit in TunisiaTop Highest Mountains in Israel