无码少妇一区二区三区免费,妓院一钑片免看黄大片,国语自产视频在线,亚洲AV成人无码国产一区二区,激情久久综合精品久久人妻,日韩免费毛片,综合成人亚洲网友偷自拍,国内自拍视频在线观看,欧美熟妇性xxxx交潮喷,国产成人精品一区二免费网站

China Focus: Data-labeling: the human power behind Artificial Intelligence

Source: Xinhua| 2019-01-17 20:42:21|Editor: ZX
Video PlayerClose

BEIJING, Jan. 17 (Xinhua) -- In a five-story building on the outskirts of Beijing, 22-year-old Zhang Yusen stares at a computer screen, carefully drawing boxes around cars in street photos.

As artificial voices replace human customer services in call centers and robots replace workers on production lines, Zhang, a vocational school graduate, has found a steady job: data-labeling, a new industry laying the groundwork for the development of AI technologies.

SUPERVISED LEARNING

As the "artificial" part of AI, data labeling receives much less media attention than the "intelligence" part of computer algorithms.

Facial recognition, self-driving, diagnosis of tumors by computer systems and the defeat of best human Go player by Alpha Go are ways AI technologies have amazed in recent years.

However, for researchers, the current AI technologies are still quite limited and at an early stage.

Professor Chen Xiaoping, director of Robotics Lab at the University of Science and Technology of China, said all AI technologies so far have come from "supervised" learning in which an AI system is trained with specific forms of data.

Take training a machine to recognize dogs for instance: the system must be fed vast numbers of pictures labeled by humans to tell the system which pictures have dogs and which don't.

Chen noted the human brain is excellent at processing unknown information with reasoning, but it is still impossible for AI. A kindergartener can make the guess of soccer ball from clues like "a black and white round object you can kick," but it's not a easy task for AI. An AI system might be able to tell all different kinds of dogs, but it cannot tell a stuffed animal is not real if such images are not sent to the system.

Yann LeCun, AI scientist at Facebook and widely considered one of the "godfathers" of machine-learning, said recently, "Our best AI systems have less common sense than a house cat."

Behind powerful AI algorithms are vast complicated dataset built and labeled by humans.

ImageNet is one of the world's largest visual databases designed to train AI systems to see. According to its inventors, it took nearly 50,000 people in 167 countries and regions to clean, sort and label nearly a billion images over more than three years.

QUALITY CHECKING

For top researchers like Chen Xiaoping, the next AI breakthrough is expected in self-supervised or unsupervised learning in which AI systems learn without human labeling. But no one knows when it will happen.

"I think in the next five to 10, maybe 15 years, AI systems will still rely on labeled data." said Du Lin, CEO and founder of data-labeling firm BasicFinder.

Du published his first paper about computer vision when he was in high school. After graduating from college, his first windfall came from selling a startup data-digging firm for 4 million U.S. dollars.

In 2014, Du and his partners noticed the rise of AI deep-learning and founded BasicFinder. The company is now a leading data-labeling company, with clients including Stanford University, the Chinese Academy of Sciences, China Mobile and Chinese AI startup SenseTime.

At BasicFinder, a typical work flow starts with taggers like Zhang Yusen. After training two to three months, they draw boxes around cars and pedestrians in street photos, tag ancient German letters, or transcribe snatches of speech.

The labeled images are submitted to quality inspectors who check 2,000 pictures a day. If one image is found inaccurately tagged in every 500 images in random checks, the company is not paid the original price. If the error rate exceeds 1 percent, clients can ask to change data-taggers.

Du said the company has been optimizing work flow to ensure greater accuracy as well as to protect intellectual property and privacy.

HUMAN IN LOOP

A model that requires human interaction is called "human in the loop" and humans remain in the loop much longer than many have expected, said Du.

Data-taggers now work on outsourcing platforms as far afield as Mexico, Kenya, India and Venezuela. Anyone can create an account to become a freelance data-tagger.

But Du strongly disagrees that data-labeling companies, depicted in some media reports as "the dirty little secret" of AI, resemble Foxconn's infamous iPhone factories.

He noted that due to the nature of AI deep-learning, it is the greater accuracy of labeled data that keeps a company alive and thriving, rather than low prices and cheap labor.

China's Caijing magazine reported in October last year that about half of data-labeling companies in China's Henan Province went bust in 2018 as orders dried up.

Du said that in the past two years, many found data-labeling a tough market. The first spurt of growth has ended and a lot of workshop-like companies have been knocked out.

A full-time data-tagger at BasicFinder can earn 6,000 to 7,000 yuan a month, along with accommodation and social benefits. In the first three quarters of 2018, the disposable income per capita in Beijing was 46,426 yuan, around 5,158 yuan a month, according to local government statistics.

Zhang Yusen and his girlfriend, who also works at BasicFinder as a quality inspector are so far enjoying their work.

TOP STORIES
EDITOR’S CHOICE
MOST VIEWED
EXPLORE XINHUANET
010020070750000000000000011100001377521541
偷窥盗摄国产在线视频| 亚洲在av极品无码天堂手机版| 在线天堂免费观看.www| 国产日韩av二区三区| 亚洲自拍精品视频在线| 乱码午夜-极品国产内射| 一区二区丝袜美腿视频| 999久久久免费精品国产| 精品久久人人做爽综合| 国产成人情侣激情视频| 亚洲国产精品久久婷婷老年| 国产精品日韩精品日韩| 亚洲Av综合日韩精品久久久| 最新精品国偷自产在线| 日韩亚洲精品中文字幕| 99国产在线视频| 日本二区三区四区在线观看| 久久久成人毛片无码| 午夜亚洲国产理论片4080 | 大地资源中文在线观看免费版高清| 2021国内精品久久久久精免费| 天天躁日日躁狠狠躁欧美老妇| 少妇一晚三次一区二区三区| 精品国产亚欧无码久久久| 亚洲粉嫩av一区二区黑人| 色婷婷日日躁夜夜躁| 国产亚洲精品俞拍视频| 精品国产免费Av无码久久久| 丰满少妇被猛男猛烈进入久久 | 日本在线a一区视频高清视频| 亚洲综合色区在线播放2019| 久久国产精品一国产精品| 好姑娘6电影在线观看| 欧美z0z0人禽交| 国产精品亚洲αv天堂无码| 99精品人妻少妇一区| 啊灬啊别停灬用力啊无码视频| 久久人人97超碰精品| 国产免费观看av大片的网站| 亚洲精品麻豆一区二区| 花式道具play高h文调教|