特徴量 - yuuk1's Digital Garden

# 特徴量 Created: October 9, 2020 12:09 PM Tags: terminology > データをテーブルであると考えるとよいだろう。それぞれのデータポイント(個々のメール、個々の顧客、個々のトランザクション)が行になり、データポイントを記述する個々の特性(例えば、顧客の年齢や、トランザクションの量や、トランザクションが行われた場所など)が列になる。 > 個々のエンティティもしくは行を、機械学習では**サンプル**(もしくはデータポイント)と呼び、エンティティの持つ特性を表現する列を**特徴量**と呼ぶ。 [[📖Pythonではじめる機械学習]] > A feature is an input variable—the x variable in simple linear regression. A simple machine learning project might use a single feature, while a more sophisticated machine learning project could use millions of features, specified as: $x_{1}, x_{2}, ... x_{N}$ > In the spam detector example, the features could include the following: - words in the email text - sender's address - time of day the email was sent - email contains the phrase "one weird trick." [Framing: Key ML Terminology | Machine Learning Crash Course](https://developers.google.com/machine-learning/crash-course/framing/ml-terminology) --- 特徴量が入力変数である一方で、[[ラベル]] が予測する結果の出力である。