考慮使用 CIFAR-10 資料集進行資料處理,資料包括 32×32 像素的多筆彩色照片。下列程式碼的資料處理,請選出正確的選項。 ```python from tensorflow.keras import datasets, utils import pandas as pd (x_train, y_train), (x_test, y_test) = datasets.cifar10.load_data() # type(x_train) -> numpy.ndarray # x_train.shape, y_train.shape, x_test.shape, y_test.shape # -> (50000, 32, 32, 3) (50000, 1) (10000, 32, 32, 3) (10000, 1) # x_train.min() -> 0 # x_train.max() -> 255 ```
iPAS 考題解析
考慮使用 CIFAR-10 資料集進行資料處理,資料包括 32×32 像素的多筆彩色照片。下列程式碼的資料處理,請選出正確的選項。 ```python from tensorflow.keras import datasets, utils import pandas as pd (x_train, y_train), (x_test, y_test) = datasets.cifar10.load_data() # type(x_train) -> numpy.ndarray # x_train.shape, y_train.shape, x_test.shape, y_test.shape # -> (50000, 32, 32, 3) (50000, 1) (10000, 32, 32, 3) (10000, 1) # x_train.min() -> 0 # x_train.max() -> 255 ```
- A. 訓練集(x_train)資料集個數為 100000 筆
- B. 測試集(x_test)資料集個數為 10000 筆 ✓ 正確答案
- C. 訓練集(x_train)是 Pandas 資料框(DataFrame)物件
- D. 如果希望將訓練集(x_train)像素值轉換為[0, 1]的範圍,則可以輸入:x_train = x_train.astype('int32') / 255.0
詳細解析
由程式碼輸出可知:x_train.shape = (50000, 32, 32, 3),訓練集有 50000 筆;x_test.shape = (10000, 32, 32, 3),測試集有 10000 筆(B 正確)。type(x_train) = numpy.ndarray,不是 Pandas DataFrame(C 錯誤)。像素值歸一化應用 float32 而非 int32:x_train = x_train.astype('float32') / 255.0(D 錯誤)。