AIsurvey,bymicrosoft资源-CSDN文库

需积分: 1 177 浏览量 2022-03-18 14:17:41 上传评论收藏 5.74MB PDF 举报

资源详情

资源评论

资源推荐

AI Enabling Technologies: A Survey

Vijay Gadepally, Justin Goodwin, Jeremy Kepner, Albert Reuther, Hayley Reynolds,

Siddharth Samsi, Jonathan Su, David Martinez

MIT Lincoln Laboratory

244 Wood Street

Lexington, MA, 02421

ABSTRACT

Artificial intelligence (AI) has the opportunity to revolutionize the way the United States Department

of Defense (DoD) and Intelligence Community (IC) address the challenges of evolving threats, data deluge,

and rapid courses of action. Developing an end-to-end AI system involves parallel development of different

pieces that must work together in order to provide capabilities that can be used by decision makers, warfighters,

and analysts. These pieces include data collection, data conditioning, algorithms, computing, robust AI, and

human–machine teaming. Although much of the popular press today surrounds advances in algorithms and

computing, most modern AI systems leverage advances across numerous different fields. Further, while

certain components may not be as visible to end-users as others, our experience has shown that each of these

interrelated components play a major role in the success or failure of an AI system. This article is meant to

highlight many of these technologies that are involved in an end-to-end AI system. The goal of this article is

to provide readers with an overview of terminology, technical details, and recent highlights from academia,

industry, and government. Where possible, we indicate relevant resources that can be used for further reading

and understanding.

DISTRIBUTION STATEMENT A. Approved for public release. Distribution is unlimited.

This material is based upon work supported by the United States Air Force under Air Force Contract No. FA8702-

15-D-0001. Any opinions, findings, conclusions or recommendations expressed in this material are those of the

author(s) and do not necessarily reflect the views of the United States Air Force.

Delivered to the U.S. Government with Unlimited Rights, as defined in DFARS Part 252.227-7013 or 7014 (Feb

2014). Notwithstanding any copyright notice, U.S. Government rights in this work are defined by DFARS 252.227-

7013 or DFARS 252.227-7014 as detailed above. Use of this work other than as specifically authorized by the U.S.

Government may violate any copyrights that exist in this work.

These raw data are often fed into a data conditioning step in which they are fused, aggregated, structured,

accumulated, and converted to information. The main objective for this subcomponent is to transform data

into information. An example of information is a new sensor image (after data labeling) that we need to use

to classify if the object of interest is present in that image or not (like a vehicle of interest). Typical functions

performed under this subcomponent are: standardization of data formats complying with a data ontology, data

labeling, highlights of missing or incomplete data, errors/biases in the data, etc.

The information generated by the data conditioning step feeds into a host of supervised and

unsupervised algorithms such as neural networks. These algorithms are used to extract patterns, predict new

events, fill in missing data, or look for similarities across datasets. These algorithms essentially convert the

input information to actionable knowledge. In our definition, we use the term “knowledge” to describe

information that has been converted into a higher-level representation that is ready for human consumption.

With the knowledge extracted in the algorithms phase, it is important to include the human being in the

decision-making process. This is done in the human–machine teaming phase. Although there are a few

applications that may be amenable to autonomous decision making (e.g., email spam filtering), recent AI

advances of relevance to the DoD have largely been in fields where a human is either in- or on- the-loop. The

phase of human–machine teaming is critical in connecting the data and algorithms to the end user and in

providing the mission users with useful and relevant insight. Human–machine teaming is the phase in which

knowledge can be turned into actionable intelligence or insight by effectively utilizing human and machine

resources as appropriate.

Underpinning all of these phases is the bedrock of modern computing systems made up of a number of

heterogenous computing elements. For example, sensor processing may occur on low power embedded

computers whereas algorithms may be computed in very large data centers. With the end of Moore’s law [1],

we’ve seen a Cambrian explosion of computing technologies and architectures. Understanding the relative

Figure 1.2. Example categories and video screen shots from the Moments in Time Dataset.

Pulling

Applauding

Asking

Drawing

Adult+Male+

Speaking

Skating

benefits of these technologies is of particular importance to applying AI to domains under significant

constraints such as size, weight, and power.

Another foundational technology underpinning AI development is robust or trusted AI. In this area,

researchers are looking at ways to explain AI outcomes (for example, why a system is recommending a

particular course of action); metrics to measure the effectiveness of an AI algorithm (going beyond the

traditional accuracy and precision metrics for complex applications or decisions); verification and validation

(ensuring that results are provably correct under adversarial conditions); security (dealing with malicious or

counter-AI technology); and policy decisions that govern the safe, responsible, and ethical use of AI

technology. Although traditional academic and commercial players are looking at these issues, some non-

profit initiatives such as OpenAI or the Allen Institute are taking a leading role in this area.

In the following sections, we highlight some of the salient technical concepts, research challenges, and

opportunities for each of these core components of an AI system. In order to elucidate these components, we

also use a running example based on research applying high-performance computing (HPC) to video

classification. We would also like to note that each of the components of the AI architecture are vast academic

areas with rich histories and numerous well published results. In order to provide readers with an overall view

of all the components within this section, we concentrate on high-level concepts and also include vignettes of

select research highlights or application examples.

1.1 VIDEO CLASSIFICATION EXAMPLE OVERVIEW

Over the course of this section, in order to provide concrete examples of components of the AI

architecture being discussed, we use a running example based on our research of using high performance

computing for video classification purposes. Specifically, we concentrate on the recently developed Moments

in Time Dataset [2] developed at the Massachusetts Institute of Technology (MIT) Computer Science and

Artificial Intelligence Laboratory (CSAIL). This dataset consists of one million videos given a label

corresponding to an action being performed in the video. Each video is approximately three seconds in length

and is labeled according to what a human observer believes is happening in the video. For example, a video

of a dog barking is classified as “barking” and a video of people clapping would be labeled as “clapping.”

Figure 1.2 shows a few screenshots of videos from the dataset and associated labels. Of course, there are many

areas where a particular label may not be as precise. For example, videos with the action label “cramming”

could imply a person studying before an exam or someone putting something into a box. As of now, each

video in the Moments in Time Dataset is labeled with one of approximately 380 possible labels. Some of the

video clips also contain audio, but it is not necessarily present for all videos.

The Moments in Time Dataset is an example of a well-curated dataset that can be used to conduct

research on video classification. To this effect, the creators of the dataset held a competition in 2018 to

encourage dataset usage and share results that highlight the state of the field. Information about this

competition can be found at: https://moments.csail.mit.edu/challenge2018/

As a metric to present the quality of a particular algorithm, the competition called for presentation of a

top-k accuracy score. This metric is defined as follows: An algorithm will label each of the videos with one

of k labels. The top-k accuracy says that a video was correctly identified if one of its top k labels is the correct

label. For example, a video may be classified (in decreasing probability) as: (barking, yelling, running, …). If

剩余49页未读，继续阅读

评论收藏

内容反馈

gaoyazhao

粉丝: 3
资源: 3

AI survey, by microsoft

评论0

最新资源

AI survey, by microsoft

评论0

use-ml-to-classify-yoga-poses-microsoft-azure:来自Microsoft x CUNY x AI事件

A survey of FPGA design for AI era.pdf

清华大学AI人工智能概论课程 第1章 AI时代的起航 含习题 共56页 .pptx

清华大学AI人工智能概论课程 第2章 感受AI 含习题 共68页 .pptx

清华大学高级人工智能人才培养课程 AI人工智能 智能系统 课程体系 第4章 执行系统 含习题 共31页.pptx

Google可解释人工智能白皮书《AI Explainability Whitepaper》.zip

中山大学 超级计算机学院 Ai人工智能课程 AI课程 第8章 强方法解决问题 共53页.ppt

【重磅】2020-2021 AI人工智能技术领域/行业研究报告大合集（58份）.zip

人工智能行业从CHAT-GPT到生成式AI(Generative AI)：人工智能新范式，重新定义生产力.pdf

人工智能AI产业链全景图.docx

AI人工智能行业建设和应用方案汇总共46份.zip

清华大学AI人工智能概论课程 第7章 自然语言处理 含习题 共42页 .pptx

人工智能-从CHAT-GPT到生成式AI（Generative AI）：人工智能新范式，重新定义生产力.rar

2018人工智能趋势报告

人工智能工具包 OpenAI.7z

清华大学高级人工智能人才培养课程 AI人工智能 智能系统 课程体系 第5章 信息物理系统 含习题 共47页.pptx

人工智能(AI)与智能医学新专业建设设想.docx

AI人工智能：54份行业重磅报告汇总

AI智能人工智能解决方案.pptx

完整车牌号识别程序，可以识别车牌和颜色，可以集成到项目中 支持win7+

ChatGPT教程（终极版）最全整理

Chatgpt 4omni 发布 GPT 4o / chatgpt-4 桌面版 chchatgpt 4 下载 / darkgpt

哈工大-生物信息学作业

博客中Kmeans以及FCM算法数据（免积分）

神经网络回归预测--气温数据集

hugging face的models-openai-clip-vit-large-patch14文件夹

XGBoost+LightGBM+LSTM-光伏发电量预测

中文短信数据集-带标签

Mathwork+Matlab+编程手册

最新资源

清华大学AI人工智能概论课程第1章 AI时代的起航含习题共56页 .pptx

清华大学AI人工智能概论课程第2章感受AI 含习题共68页 .pptx

清华大学高级人工智能人才培养课程 AI人工智能智能系统课程体系第4章执行系统含习题共31页.pptx

中山大学超级计算机学院 Ai人工智能课程 AI课程第8章强方法解决问题共53页.ppt

清华大学AI人工智能概论课程第7章自然语言处理含习题共42页 .pptx

清华大学高级人工智能人才培养课程 AI人工智能智能系统课程体系第5章信息物理系统含习题共47页.pptx

完整车牌号识别程序，可以识别车牌和颜色，可以集成到项目中支持win7+