기계학습 교본: Azure ML스튜디오에서 첫 번째 실험 만들기 (3)

Machine learning tutorial: Create your first experiment in Azure Machine Learning Studio
기계학습 교본: Azure ML스튜디오에서 첫 번째 실험 만들기

By Gary EricsonLast updated: 11/03/2015

In this first machine learning tutorial, we'll create a linear regression model that predicts the price of an automobile based on different variables such as make and technical specifications. To do this, we'll use Azure Machine Learning Studio to develop and iterate on a simple predictive analytics experiment.
이번 첫 번째 기계학습(ML) 교본에서는, 제작회사 및 기술 규격 등과 같은 서로 다른 변수들(variables)에 근거하여 자동차 가격을 예측하는, 선형회귀(linear regression) 모델을 만들겠습니다. 이를 위하여 Azure ML 스튜디오를 사용하여 간단한 예측분석 실험을 개발하여 반복시킵니다.

A Machine Learning Studio experiment consists of dragging components to a canvas, and connecting them in order to create a model, train the model, and score and test the model. The experiment uses predictive modeling techniques in the form of Machine Learning Studio modules that ingest data, train a model against it, and apply the model to new data. You can also add modules to preprocess data and select features, split data into training and test sets, and evaluate or cross-validate the quality of your model.
ML스튜디오 실험은, 구성요소들을 캔버스로 끌어오기, 그리고 모델 만들기, 모델훈련, 모델채점 및 모델테스트를 위한 구성요소간의 연결로 구성됩니다. 이 실험은, 데이터를 가져오고, 이 데이터로 모델을 훈련시키고, 새 데이터에 모델을 적용하는 ML스튜디오 모듈 형태의 예측 모델링 기법을 사용합니다. 또한 데이터 사전(pre) 처리 모듈을 추가할 수 있고, 특성 (features)을 선택할 수 있고, 훈련용 및 테스트용 세트로 데이터를 분리할 수 있고, 모델 품질을 평가하거나 교차 검증할 수 있습니다.

Enter Machine Learning Studio: https://studio.azureml.net, and click the Get started button. You can choose either the Guest Access or sign in with your Microsoft account.

For more general information about Machine Learning Studio, see What is Machine Learning Studio?.

ML스튜디오: https://studio.azureml.net를 엔터하고 Get started버튼을 클릭하십시오. Guest Access 또는 Microsoft account 등록 중에서 선택할 수 있습니다.
ML스튜디오에 관한 보다 자세한 내용은 What is Machine Learning Studio?를 참조하세요.

Five steps to create an experiment 실험만들기 5단계

In this machine learning tutorial, you'll follow five basic steps to build an experiment in Machine Learning Studio in order to create, train, and score your model:
이 ML 교본은, ML스튜디오에서 모델을 만들고, 훈련하고, 채점할 수 있는 실험을 구축하기 위하여 5가지 기본 단계를 따릅니다.

Create a model 모델 만들기

Train the model 모델 훈련시키기

Step 4: Choose and apply a learning algorithm 4단계: 학습 알고리즘 선택 및 적용

Score and test the model 모델 채점하고 테스트하기

Step 5: Predict new automobile prices 5단계: 새 자동차가격 예측

Step 1: Get data 1단계: 데이터 가져오기

There are a number of sample datasets included with Machine Learning Studio that you can choose from, and you can import data from many sources. For this example, we will use the included sample dataset, Automobile price data (Raw). This dataset includes entries for a number of individual automobiles, including information such as make, model, technical specifications, and price.
많은 데이터소스에서 데이터를 선택할 수 있고, 데이터를 가져올 수 있는, 많은 샘플 데이터집합이 ML스튜디오에 포함되어 있습니다. 이 예제에서는 자동차가격 데이터를 나타내는 포함된 샘플 데이터집합 Automobile price data (Raw)를 사용합니다. 이 데이터집합에는 제조사, 모델, 기술규격 및 가격과 같은 정보를 포함하여, 많은 개별 자동차들에 대한 기재사항이 포함되어 있습니다.

1. Start a new experiment by clicking +NEW at the bottom of the Machine Learning Studio window, select EXPERIMENT, and then select Blank Experiment. Select the default experiment name at the top of the canvas and rename it to something meaningful, for example, Automobile price prediction.
먼저 ML창의 아래쪽에서 +NEW(새로 만들기)를 클릭하여 새 실험을 시작한 다음 EXPERIMENT(실험), " Blank Experiment(빈 실험)"을 차례로 선택합니다. 캔버스 위쪽에서 기본실험 이름을 선택하고 이를 의미 있는 이름(예: Automobile price prediction)으로 바꿉니다.

2. To the left of the experiment canvas is a palette of datasets and modules. Type automobile in the Search box at the top of this palette to find the dataset labeled Automobile price data (Raw).
실험 캔버스 왼쪽에는 데이터집합과 모듈의 팔레트가 있습니다. 이 팔레트 맨 위의 검색상자에 automobile을 입력하여 레이블이 Automobile price data (Raw)인 데이터집합을 찾습니다.

Palette search

3. Drag the dataset to the experiment canvas. 실험 캔버스로 데이터집합을 끌어옵니다. 발견된 데이터집합을 더블 클릭하면 캔버스로 자동 이동됩니다.

Dataset

To see what this data looks like, click the output port at the bottom of the automobile dataset, and then select Visualize. The variables in the dataset appear as columns, and each instance of an automobile appears as a row. The far-right column (column 26 and titled "price") is the target variable we're going to try to predict.
이 데이터의 모양을 확인하려면 자동차 데이터집합 아래의 출력포트 우측을 두 번 클릭하고 Visualize(시각화)를 선택 합니다. 데이터집합에서 변수는 열(column)로, 각 자동차 인스턴스는 행(row)으로 나타납니다. 맨 오른쪽 열("price" 26열)이 예측하고자 하는 목적 변수(target variable) 입니다.

Dataset visualization

Close the visualization window by clicking the "x" in the upper-right corner.
오른쪽 위 모서리에서 "x"를 클릭하여 시각화 창을 닫습니다.

Step 2: Preprocess data 2단계: 데이터 전 처리

A dataset usually requires some preprocessing before it can be analyzed. You might have noticed the missing values present in the columns of various rows. These missing values need to be cleaned so the model can analyze the data correctly. In our case, we'll remove any rows that have missing values. Also, the normalized-losses column has a large proportion of missing values, so we'll exclude that column from the model altogether.
데이터집합은 일반적으로 사전 처리를 거쳐야 분석할 수 있습니다. 여러 행의 열에 누락된 값이 있는 것을 볼 수 있습니다. 데이터를 정확하게 분석하려면 이러한 누락된 값을 정리해야 합니다. 지금 누락된 값이 있는 행을 모두 제거합니다. 또한 normalized-losses 열에 값이 누락된 비율이 크므로 여기서는 해당 열을 모델에서 완전히 제외합니다.

TIP: Cleaning the missing values from input data is a prerequisite for using most of the modules.
팁: 입력데이터에서 누락 값을 정리하는 것은 대부분의 모듈 사용의 필수조건입니다.

First we'll remove the normalized-losses column, and then we'll remove any row that has missing data.
먼저 normalized-losses 열을 제거한 다음 누락된 데이터가 있는 행을 제거합니다.

1. Type project columns in the Search box at the top of the module palette to find the Project Columns module, then drag it to the experiment canvas and connect it to the output port of the Automobile price data (Raw) dataset. This module allows us to select which columns of data we want to include or exclude in the model.
모듈 팔레트의 맨 위에 있는 검색상자에 project columns(프로젝트 열)을 입력하여 Project Columns(프로젝트 열) 모듈을 찾은 다음, 이를 실험 캔버스로 끌어와 Automobile price data (Raw) 데이터집합의 출력포트에 연결합니다. 이 모듈을 사용하면 모델에서 포함하거나 제외할 데이터 열을 선택할 수 있습니다.

2. Select the Project Columns module and click Launch column selector in the Properties pane.
Project Columns(프로젝트 열) 모듈을 선택하고, 오른쪽 Properties(속성) 창에서 Launch column selector(열 선택기 시작)을 클릭합니다.

· Make sure All columns is selected in the filter drop-down list, Begin With. This directs Project Columns to pass through all the columns (except those we're about to exclude).
필터 드롭-다운 목록, Begin With에서 All columns(모든 열)을 선택합니다. Project Columns(프로젝트 열)이 모른 열을 통하여 전달되도록 이것이 디렉션합니다(제외할 열은 예외임).

· In the next row, select Exclude and column names, and then click inside the text box. A list of columns is displayed. Select normalized-losses, and it will be added to the text box.
다음 행에서 Exclude(제외) 및 column names(열 이름)을 선택한 후 텍스트 상자 내부를 클릭합니다. 열 목록이 표시됩니다. normalized-losses를 선택하여 텍스트 상자에 추가 합니다.

· Click the check mark (OK) button to close the column selector.
확인 표시(OK) 버튼을 클릭하여 column selector(열 선택기)를 닫습니다.

Select columns

The properties pane for Project Columns indicates that it will pass through all columns from the dataset except normalized-losses.
Project Columns(프로젝트 열)의 properties(속성) 창에 normalized-losses를 제외한 데이터집합의 모든 열이 전달된다는 것이 나타납니다.

Project Columns properties

TIP: You can add a comment to a module by double-clicking the module and entering text. This can help you see at a glance what the module is doing in your experiment. In this case, double-click the Project Columns module and type the comment "Exclude normalized-losses."
팁: 모듈을 2번 클릭하고 텍스트를 입력하여 모듈에 주석을 추가할 수 있습니다. 그러면 모듈이 실험에서 수행하는 내용을 한 눈에 볼 수 있습니다. 이 경우 프로젝트 열 모듈을 두 번 클릭하고 주석 "Exclude normalized-losses"를 입력합니다.

3. Drag the Clean Missing Data module to the experiment canvas and connect it to the Project Columns module. In the Properties pane, select Remove entire row under Cleaning mode to clean the data by removing rows that have missing values. Double-click the module and type the comment "Remove missing value rows."
Clean Missing Data(누락 값 정리) 모듈을 실험 캔버스로 끌어와서 Project Columns(프로젝트 열) 모듈과 연결 합니다. Properties(속성) 창에서, Cleaning mode아래의 Remove entire row(전체 행 제거)를 선택하여 누락 값이 있는 행을 제거하는 방법으로 데이터를 세탁합니다. 모듈을 두 번 클릭하고 주석 "Remove missing value rows"라는 코멘트를 입력합니다.

Clean Missing Data properties

4. Run the experiment by clicking RUN under the experiment canvas.
실험 캔버스 아래에서 RUN(실행)을 클릭하여 실험을 실행합니다.

When the experiment is finished, all the modules have a green check mark to indicate that they finished successfully. Notice also the Finished running status in the upper-right corner.
실험이 완료되면 모든 모듈에 성공적으로 완료되었음을 나타내는 녹색 확인 표시가 표시됩니다. 오른쪽 위 모서리에서 Finished running(실행 완료) 상태도 확인됩니다.

First experiment run

All we have done in the experiment to this point is clean the data. If you want to view the cleaned dataset, click the left output port of the Clean Missing Data module ("Cleaned dataset") and select Visualize. Notice that the normalized-losses column is no longer included, and there are no missing values.
이 시점까지 실험에서 수행된 것은 데이터 정리입니다. 정리된 데이터집합을 보려면, Clean Missing Data(누락 데이터 정리) 모듈(“정리된 데이터집합”)의 출력포트를 클릭하고 Visualize(시각화)를 선택합니다. 데이터가 정리되었으며, 이제 예측모델에서 사용할 특성을 지정할 수 있습니다.

Now that the data is clean, we're ready to specify what features we're going to use in the predictive model.
이제 데이터는 정리되었고 예측 모델에서 어떤 특징을 규정할 지 준비가 되었습니다.

Step 3: Define features 3단계: 특성 정의

In machine learning, features are individual measurable properties of something you’re interested in. In our dataset, each row represents one automobile, and each column is a feature of that automobile. Finding a good set of features for creating a predictive model requires experimentation and knowledge about the problem you want to solve. Some features are better for predicting the target than others. Also, some features have a strong correlation with other features (for example, city-mpg versus highway-mpg), so they will not add much new information to the model, and they can be removed.
ML에서 특성(features)은 관심 있는 부분에 대한 측정 가능한(measurable) 개별 속성(properties)입니다. 데이터집합의 각 행이 1대의 자동차를 나타내고 각 열은 이 자동차의 특성입니다. 예측모델을 만드는데 적절한 특성세트(set of features) 를 찾으려면 해결하려는 문제에 대한 실험과 지식이 있어야 합니다. 어떤 특성은 다른 특성보다 목표 예측에 더 유용합니다. 또한 어떤 특성은 다른 특성과 강력한 상관관계에 있는데(예: city-mpg와 highway-mpg), 이런 경우 해당 특성은 모델에 새 정보를 추가로 많이 주지 못하기 때문에 제거해도 됩니다.

Let's build a model that uses a subset of the features in our dataset. You can come back and select different features, run the experiment again, and see if you get better results. As a first guess, we'll select the following features (columns) with the Project Columns module. Note that for training the model, we need to include the price value that we're going to predict.
데이터집합에서 특성의 서브셋을 사용하는 모델을 구축해보겠습니다. 다시 돌아와서 다른 특성을 선택하고, 실험을 다시 실행하여, 더 나은 결과가 나오는지 확인할 수 있습니다. 첫 번째 추측으로서, Project Columns(프로젝트 열) 모듈을 사용하여 다음의 특성(열)을 선택합니다. 모델훈련을 위해서는 예측하고자 하는 price 값을 포함해야 합니다.

make, body-style, wheel-base, engine-size, horsepower, peak-rpm, highway-mpg, price

1. Drag another Project Columns module to the experiment canvas and connect it to the left output port of the Clean Missing Data module. Double-click the module and type "Select features for prediction."
다른 Project Columns 모듈을 실험 캔버스로 끌어와서 Clean Missing Data 모듈에 연결합니다. 모듈을 두 번 클릭 하고 "Select features for prediction"을 입력합니다.

2. Click Launch column selector in the Properties pane.
Properties 창에서 Launch column selector를 클릭합니다.

3. In the column selector, select No columns for Begin With, and then select Include and column names in the filter row. Enter our list of column names. This directs the module to pass through only columns that we specify.
column selector(열 선택기)에서 Begin With에 대한 No columns을 선택한 후, 필터 행에서 Include 및 column names을 선택합니다. 열 이름 목록을 입력합니다. 그러면 지정한 열만 전달하도록 모듈에게 지시 합니다.

TIP: Because we've run the experiment, the column definitions for our data have passed from the original dataset through the Clean Missing Data module. When you connect Project Columns to Clean Missing Data, the Project Columns module becomes aware of the column definitions in our data. When you click the column names box, a list of columns is displayed, and you can select the columns that you want to add to the list.

TIP: 실험을 이미 실행했기 때문에, 데이터에 대한 열 정의가 Clean Missing Data 모듈을 통해, 원본 데이터집합으로부터 전달 되었습니다. Project Columns을 Clean Missing Data에 연결하면 Project Columns 모듈이 데이터의 열 정의를 인식하게 됩니다. column names(열 이름) 상자를 클릭하면 열 목록이 표시 되고, 목록에 추가하려는 열을 선택할 수 있습니다.

4. Click the check mark (OK) button. 확인 표시(확인) 단추를 클릭합니다.

Select columns

This produces the dataset that will be used in the learning algorithm in the next steps. Later, you can return and try again with a different selection of features.
그러면 다음 단계의 학습 알고리즘에서 사용될 데이터집합이 생성됩니다. 나중에 돌아와서 다른 선택 기능을 사용하여 다시 시도할 수 있습니다.

Step 4: Choose and apply a learning algorithm 4단계: 학습 알고리즘 선택 및 적용

Now that the data is ready, constructing a predictive model consists of training and testing. We'll use our data to train the model and then test the model to see how close it's able to predict prices.
이제 데이터가 준비되었으면, 훈련과 테스팅으로 구성된 예측모델을 구성합니다. 데이터를 사용해서 모델을 훈련시킨 다음 모델을 테스트하여 그것이 얼마나 근접하게 가격을 예측할 수 있는지 봅니다.

Classification and regression are two types of supervised machine learning techniques. Classification is used to make a prediction from a defined set of values, such as a color (red, blue, or green). Regression is used to make a prediction from a continuous set of values, such as a person's age.
분류(Classification)와 회귀(regression)는 지도 기계학습 기술의 2가지 타입입니다. 분류는 색(빨강, 파랑 또는 녹색)과 같이 정의된 값 집합으로부터 예측하는 데 사용됩니다. 회귀는 사람의 나이와 같이 연속된 값 집합으로부터 예측하는 데 사용 됩니다.

We want to predict the price of an automobile, which can be any value, so we'll use a regression model. For this example, we'll train a simple linear regression model, and in the next step, we'll test it.
자동차가격은 임의의 값일 수 있기 때문에 자동차가격을 예측하기 위해 회귀모델을 사용하겠습니다. 이 예제에서는 간단한 선형회귀 (linear regression) 모델을 훈련시키고, 다음 단계에서 이를 테스트합니다.

1. We can use our data for both training and testing by splitting it into separate training and testing sets. Select and drag the Split Data module to the experiment canvas and connect it to the output of the last Project Columns module. Set Fraction of rows in the first output dataset to 0.75. This way, we'll use 75 percent of the data to train the model, and hold back 25 percent for testing.
데이터를 훈련용 집합과 테스팅용 집합으로 분할해서 훈련용과 테스트용 두 가지 목적으로 데이터를 사용합니다. Split Data(데이터분할) 모듈을 선택하여 실험 캔버스로 끌어다 놓고 마지막 Project Columns 모듈의 출력과 연결 합니다. 첫 번째 출력 데이터집합의 행 분수를 0.75로 설정합니다. 그러면 데이터의 75%를 모델 훈련에 사용하고 25%는 테스팅용으로 보유합니다.

TIP: By changing the Random seed parameter, you can produce different random samples for training and testing. This parameter controls the seeding of the pseudo-random number generator.
TIP: 임의의 씨앗 매개변수(Random seed parameter)를 변경하면, 훈련용과 테스팅용으로 여러 무작위 샘플을 생성할 수 있습니다. 이 매개변수는 유사-난수 발생기의 seeding(씨뿌리기)을 제어합니다.

2. Run the experiment. This allows the Project Columns and Split Data modules to pass column definitions to the modules we'll be adding next.
실험을 실행합니다. 그러면 Project Columns 모듈 및 Split Data 모듈이 다음에 추가할 모듈에 열 정의를 전달할 수 있습니다.

3. To select the learning algorithm, expand the Machine Learning category in the module palette to the left of the canvas, and then expand Initialize Model. This displays several categories of modules that can be used to initialize machine learning algorithms.
학습 알고리즘을 선택하려면 캔버스 왼쪽의 모듈 팔레트에서 Machine Learning 범주를 확장한 후 Initialize Model(모델 초기화)를 확장 합니다. 학습 알고리즘을 초기화하는 데 사용할 수 있는 몇 가지 범주의 모듈이 표시됩니다.

For this experiment, select the Linear Regression module under the Regression category (you can also find the module by typing "linear regression" in the palette Search box), and drag it to the experiment canvas.
이 실험에서 회귀 범주 아래에 있는 Linear Regression (선형회귀) 모듈을 선택하여(팔레트 검색 상자에 " linear regression"를 입력하여 모듈을 찾을 수도 있음) 실험 캔버스로 끌어 놓습니다.

4. Find and drag the Train Model module to the experiment canvas. Connect the left input port to the output of the Linear Regression module. Connect the right input port to the training data output (left port) of the Split Data module.
Train Model 모듈을 찾아 실험으로 끌어 놓습니다. 왼쪽 입력포트를 Linear Regression 모듈의 출력에 연결합니다. 오른쪽 입력 포트를 Split Data 모듈의 훈련 데이터 출력(왼쪽 포트)에 연결합니다.

5. Select the Train Model module, click Launch column selector in the Properties pane, and then select the price column. This is the value that our model is going to predict.
Train Model 모듈을 선택하고 Properties 창에서 Launch column selector을 클릭한 후 price 열을 선택합니다. 이것이 모델이 예측하고자 하는 값입니다.

Select "price" column

6. Run the experiment. 실험을 실행합니다.

The result is a trained regression model that can be used to score new samples to make predictions.
결과는 새 샘플의 점수를 매기고 예측하는 데 사용할 수 있는 훈련된 회귀모델입니다.

Applying the machine learning algorithm

Step 5: Predict new automobile prices 5단계: 새 데이터 예측

Now that we've trained the model using 75% of our data, we can use it to score the other 25% of the data to see how well our model functions.
모델을 훈련시켰으며, 이제 모델을 사용하여 나머지 25% 데이터의 점수를 매겨 모델 기능이 얼마나 좋은지 확인할 수 있습니다.

1. Find and drag the Score Model module to the experiment canvas and connect the left input port to the output of the Train Model module. Connect the right input port to the test data output (right port) of the Split Data module.

Score Model(모델채점) 모듈을 찾아서 실험 캔버스로 끌어와서 왼쪽 입력 포트를 Train Model (모델훈련) 모듈의 출력에 연결합니다. 오른쪽 입력 포트를 Split Data(데이터 분할) 모듈의 테스트 데이터 출력(오른쪽 포트)에 연결합니다.

Score Model module

2. To run the experiment and view the output from the Score Model module, click the output port, and then select Visualize. The output shows the predicted values for price and the known values from the test data.
실험을 실행해서 Score Model(모델채점) 모듈의 출력을 보려면 출력포트를 클릭하고 Visualize(시각화)를 선택합니다. 출력에 테스트 데이터에서 가져온 알려진 값과 함께 가격 예측 값이 표시됩니다.

3. Finally, to test the quality of the results, select and drag the Evaluate Model module to the experiment canvas, and connect the left input port to the output of the Score Model module. (There are two input ports because the Evaluate Model module can be used to compare two models.)
마지막으로 결과의 품질을 테스트하기 위해 Evaluate Model(모델평가) 모듈을 선택하고 실험 캔버스로 끌어온 후 왼쪽 입력포트를 Score Model(모델채점) 모듈의 출력에 연결합니다.(Evaluate Model(모델평가) 모듈은 2개의 모델을 비교하는 데 사용될 수 있으므로 2개의 입력 포트가 있습니다.)

4. Run the experiment. 실험을 실행합니다.

To view the output from the Evaluate Model module, click the output port and select Visualize. The following statistics are shown for our model:
실험을 실행하고 모델평가 모듈의 출력을 보려면 출력 포트를 클릭하고 시각화를 선택합니다. 모델에 대한 다음 통계 가 표시됩니다.

o Mean Absolute Error (MAE): The average of absolute errors (an error is the difference between the predicted value and the actual value).
MAE(절대평균오차): 절대평균오차입니다(error는 예측 값과 실제 값 사이의 차이).

o Root Mean Squared Error (RMSE): The square root of the average of squared errors of predictions made on the test dataset.
RMSE(제곱평균오차제곱근): 테스트 데이터집합에 대해 예측한 평균제곱오차의 제곱근입니다.

o Relative Absolute Error: The average of absolute errors relative to the absolute difference between actual values and the average of all actual values.
RAE(상대절대오차): 실제 값과 모든 실제 값 평균 사이의 절대값 차에 관련된 상대적 절대 오차의 평균입니다.

o Relative Squared Error: The average of squared errors relative to the squared difference between the actual values and the average of all actual values.
RSE(상대제곱오차): 실제 값과 모든 실제 값 평균 사이의 제곱 차에 대해 상대적인 평균 제곱 오차입니다.

o Coefficient of Determination: Also known as the R squared value, this is a statistical metric indicating how well a model fits the data.
결정 계수: R 제곱 값이라고도 하며, 모델이 데이터에 얼마나 적합한지 나타내는 통계 메트릭입니다.

For each of the error statistics, smaller is better. 각 오차 통계는 작을수록 좋습니다.
A smaller value indicates that the predictions more closely match the actual values.
값이 작을수록 예측이 실제 값과 더 근접하게 일치함을 나타냅니다.
For Coefficient of Determination, the closer its value is to one (1.0), the better the predictions.
결정 계수의 경우 값이 1(1.0)에 가까울수록 더 잘 예측한 것입니다.

Evaluation results

The final experiment should look like this: 최종 실험은 다음과 같습니다.

Machine learning tutorial: Complete linear regression experiment that uses predictive modeling techniques.

What's next? 다음 작업

Now that you've completed a first machine learning tutorial and have your experiment set up, you can iterate to try to improve the model. For instance, you can change the features you use in your prediction. Or you can modify the properties of the Linear Regression algorithm or try a different algorithm altogether. You can even add multiple machine learning algorithms to your experiment at one time and compare two by using the Evaluate Model module.
이제 첫 번째 ML 교본을 완료했고, 실험을 설정했으며, 이제 반복을 통해 모델을 향상시킬 수 있습니다. 예를 들어 예측에 사용하는 특성을 변경할 수 있습니다. 또는 선형 회귀 알고리즘의 속성을 수정하거나 다른 알고리즘을 시도해 볼 수 있습니다. 한 번에 여러 알고리즘을 실험에 추가하고 모델 평가 모듈을 사용하여 2개를 비교할 수도 있습니다.

TIP: Use the SAVE AS button under the experiment canvas to copy any iteration of your experiment. You can see all the iterations of your experiment by clicking VIEW RUN HISTORY under the canvas. See Manage experiment iterations in Azure Machine Learning Studiofor more details.
TIP: 실험 반복의 복사본을 만들려면 실험 캔버스 아래에 있는 다른 이름으로 저장 단추를 사용하세요. 캔버스 아래에서 실행기록 보기를 클릭하여 실험의 모든 반복을 볼 수 있습니다. 자세한 내용은 Azure ML스튜디오에서 실험 반복 관리를 참조하세요.

When you're satisfied with your model, you can deploy it as a web service to be used to predict automobile prices by using new data. See Deploy an Azure Machine Learning web service for more details.
For a more extensive and detailed walkthrough of predictive modeling techniques for creating, training, scoring, and deploying a model, see Develop a predictive solution by using Azure Machine Learning.
모델이 마음에 들면 모델을 새 데이터를 사용한 자동차가격 예측에 사용할 웹서비스로 게시할 수 있습니다. 자세한 내용은 Azure ML 웹서비스 게시를 참조하세요.
예측 모델 만들기, 훈련, 채점 및 게시에 대한 보다 광범위하고 자세한 연습은 Azure ML을 사용하여 예측 솔루션 개발을 참조하세요.

저작자표시

'AZ ml' 카테고리의 다른 글

음식점 추천 기계학습 모델 결과 해석하기 (6) (0)	2016.01.13
기계학습의 모델결과 해석방법 (5) (0)	2016.01.13
신용위험 예측 기계학습 모델 만들기 (4) (0)	2016.01.13
What is Azure Machine Learning Studio? Azure기계학습 스튜디오란 무엇인가요? (2) (0)	2016.01.06
Introduction to machine learning on Microsoft Azure MS 애저의 기계학습 소개 (1) (0)	2016.01.06

머신러닝 GPT

기계학습 교본: Azure ML스튜디오에서 첫 번째 실험 만들기 (3)