Prepare Data for DL Training

Azure Databricks recommends using TFRecord format as the data source for deep learning with TensorFlow. TFRecord format is a simple record-oriented binary format that many TensorFlow applications use for training data.

tf.data.TFRecordDataset is the TensorFlow dataset, which is comprised of records from TFRecords files. For more details about how to consume TFRecord data, see the TensorFlow guide Consuming TFRecord data.

The following topics describe and illustrate the recommended ways to save your data to TFRecord files:

The following topic describes and illustrates the recommended way to load TFRecord files: