{ "cells": [ { "cell_type": "markdown", "metadata": {}, "source": [ "This notebooks demonstrate how to execute parallel machine learning training using [`dask-ml`](https://ml.dask.org/) and motrainer.\n", "\n", "The example dataset `./example1_data.zarr/` can be generated using the following Jupyter Notebook:\n", "- [Covert a nested DataFrame to a Dataset](https://vegewaterdynamics.github.io/motrainer/notebooks/example_read_from_one_df/)" ] }, { "cell_type": "code", "execution_count": 1, "metadata": {}, "outputs": [], "source": [ "import motrainer\n", "import numpy as np\n", "import xarray as xr" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Load data" ] }, { "cell_type": "code", "execution_count": 2, "metadata": {}, "outputs": [ { "data": { "text/html": [ "
<xarray.Dataset>\n", "Dimensions: (space: 5, time: 8506)\n", "Coordinates:\n", " latitude (space) float64 dask.array<chunksize=(5,), meta=np.ndarray>\n", " longitude (space) float64 dask.array<chunksize=(5,), meta=np.ndarray>\n", " * time (time) datetime64[ns] 2007-01-02 ... 2020-01-01T01:00:00\n", "Dimensions without coordinates: space\n", "Data variables:\n", " BIOMA1 (space, time) float64 dask.array<chunksize=(3, 8506), meta=np.ndarray>\n", " BIOMA2 (space, time) float64 dask.array<chunksize=(3, 8506), meta=np.ndarray>\n", " TG1 (space, time) float64 dask.array<chunksize=(3, 8506), meta=np.ndarray>\n", " TG2 (space, time) float64 dask.array<chunksize=(3, 8506), meta=np.ndarray>\n", " TG3 (space, time) float64 dask.array<chunksize=(3, 8506), meta=np.ndarray>\n", " WG1 (space, time) float64 dask.array<chunksize=(3, 8506), meta=np.ndarray>\n", " WG2 (space, time) float64 dask.array<chunksize=(3, 8506), meta=np.ndarray>\n", " WG3 (space, time) float64 dask.array<chunksize=(3, 8506), meta=np.ndarray>\n", " curv (space, time) float64 dask.array<chunksize=(3, 8506), meta=np.ndarray>\n", " sig (space, time) float64 dask.array<chunksize=(3, 8506), meta=np.ndarray>\n", " slop (space, time) float64 dask.array<chunksize=(3, 8506), meta=np.ndarray>\n", "Attributes:\n", " license: data license\n", " source: data source