Cloudera Machine Learning (CML)
This article explains the steps to deploy the CML service on the Kubernetes platform after successful configuration of the CDP PvC Data Services Management Console.
CML Deployment
In CM, navigate to
Data Services
. ClickOpen CDP Private Cloud Data Services
.The system will redirect the browser to the following page. Click
Machine Learning
.Navigate to
ML Workspaces
. ClickProvision Workspace
.Fill in the fields as shown in the following example.
The outcome of the workspace creation should be successful as shown below. Click
workspace1
.The system will redirect the browser to the CML dashboard.
AMP Project Creation
Navigate to
AMP
. You may select one of the Applied ML Prototypes. In this demo, clickDeep Learning for Image Analysis
.Click
Configure Project
.Select the parameters. Click
Launch Project
.Click
Open
to browse the application as shown in the following animated gif. The application user may click on any image at the lower section and then the application will analyze the library and subsequently display a list of similar image’s type that was clicked by the user.
CML Artifacts inside ECS Platform
# kubectl get pv | head -1 ; kubectl get pv | grep cdp-env-1
NAME CAPACITY ACCESS MODES RECLAIM POLICY STATUS CLAIM STORAGECLASS REASON AGE
pvc-7022f231-463b-4622-8a21-15f7a729b7d1 2Gi RWO Delete Bound cdp-env-1-d668dab9-monitoring-platform/storage-volume-monitoring-prometheus-alertmanager-0 longhorn 66s
pvc-946e99e2-0ef1-4ba5-8504-246f02c3142f 60Gi RWO Delete Bound cdp-env-1-d668dab9-monitoring-platform/monitoring-prometheus-server longhorn 68s
pvc-c96965bb-b564-4cb5-b373-0fbcc7ead110 2Gi RWO Delete Bound cdp-env-1-d668dab9-monitoring-platform/storage-volume-monitoring-prometheus-alertmanager-1 longhorn 11s
# kubectl get ns
NAME STATUS AGE
cdp Active 8h
cdp-env-1-d668dab9-monitoring-platform Active 2m16s
default Active 8h
default-43e25dc6-monitoring-platform Active 7h55m
ecs-webhooks Active 8h
infra-prometheus Active 8h
kube-node-lease Active 8h
kube-public Active 8h
kube-system Active 8h
kubernetes-dashboard Active 8h
local-path-storage Active 8h
longhorn-system Active 8h
shared-services Active 3h50m
vault-system Active 8h
yunikorn Active 8h
# kubectl -n cdp-env-1-d668dab9-monitoring-platform get pods
NAME READY STATUS RESTARTS AGE
monitoring-cm-health-exporter-59dc48c4d-twhnj 3/3 Running 0 2m56s
monitoring-logger-alert-receiver-c89955fb7-zgxn5 2/2 Running 0 2m56s
monitoring-metrics-server-exporter-94c45955b-pwsb6 2/2 Running 0 2m56s
monitoring-platform-proxy-c5f8f5578-zglkb 2/2 Running 0 2m56s
monitoring-prometheus-alertmanager-0 3/3 Running 0 2m56s
monitoring-prometheus-alertmanager-1 3/3 Running 0 119s
monitoring-prometheus-kube-state-metrics-66f7f6f9b4-tzrfs 2/2 Running 0 52s
monitoring-prometheus-pushgateway-64b5765874-9n8c4 2/2 Running 0 2m56s
monitoring-prometheus-server-c4b8b4456-8lcz9 3/3 Running 0 2m56s
snmp-notifier-86c7564b69-bkdwm 2/2 Running 0 52s
# kubectl -n cdp-env-1-d668dab9-monitoring-platform get pvc
NAME STATUS VOLUME CAPACITY ACCESS MODES STORAGECLASS AGE
monitoring-prometheus-server Bound pvc-946e99e2-0ef1-4ba5-8504-246f02c3142f 60Gi RWO longhorn 3m13s
storage-volume-monitoring-prometheus-alertmanager-0 Bound pvc-7022f231-463b-4622-8a21-15f7a729b7d1 2Gi RWO longhorn 3m13s
storage-volume-monitoring-prometheus-alertmanager-1 Bound pvc-c96965bb-b564-4cb5-b373-0fbcc7ead110 2Gi RWO longhorn 2m16s
# kubectl -n workspace1 get pvc
NAME STATUS VOLUME CAPACITY ACCESS MODES STORAGECLASS AGE
livelog-data-livelog-0 Bound pvc-fd0b773a-e785-4b5c-afc1-fcd98afcc56b 1Ti RWO longhorn 16s
model-metrics-data-model-metrics-db-0 Bound pvc-05b25f2a-29cd-4341-8c69-a83c7e70efef 100Gi RWO longhorn 16s
persist-dir-secret-generator-0 Bound pvc-a16af508-e380-4243-944c-1473d2a6b711 10Mi RWO longhorn 16s
postgres-data-versioned-db-0 Bound pvc-39d799cf-6773-46f6-8f16-16778e008824 1Ti RWO longhorn 16s
projects-pvc Bound projects-share-workspace1 1Ti RWX 16s
s2i-git-server-repos-s2i-git-server-0 Bound pvc-cb5c9ac7-0201-44be-aeb5-dec057db5870 1Ti RWO longhorn 16s
s2i-queue-pvc Bound pvc-2394a6d2-eecf-452e-b17f-4ab477bb1310 2Gi RWO longhorn 16s
# kubectl get pv | head -1 ; kubectl get pv | grep workspace1
NAME CAPACITY ACCESS MODES RECLAIM POLICY STATUS CLAIM STORAGECLASS REASON AGE
projects-share-workspace1 1Ti RWX Retain Bound workspace1/projects-pvc 49s
pvc-05b25f2a-29cd-4341-8c69-a83c7e70efef 100Gi RWO Delete Bound workspace1/model-metrics-data-model-metrics-db-0 longhorn 36s
pvc-2394a6d2-eecf-452e-b17f-4ab477bb1310 2Gi RWO Delete Bound workspace1/s2i-queue-pvc longhorn 43s
pvc-39d799cf-6773-46f6-8f16-16778e008824 1Ti RWO Delete Bound workspace1/postgres-data-versioned-db-0 longhorn 39s
pvc-a16af508-e380-4243-944c-1473d2a6b711 10Mi RWO Delete Bound workspace1/persist-dir-secret-generator-0 longhorn 39s
pvc-cb5c9ac7-0201-44be-aeb5-dec057db5870 1Ti RWO Delete Bound workspace1/s2i-git-server-repos-s2i-git-server-0 longhorn 37s
pvc-fd0b773a-e785-4b5c-afc1-fcd98afcc56b 1Ti RWO Delete Bound workspace1/livelog-data-livelog-0 longhorn 39s
# kubectl -n workspace1 get pods
NAME READY STATUS RESTARTS AGE
api-6945bd58bc-m64kv 1/1 Running 3 21m
cron-657dbd8b65-csfjv 2/2 Running 0 21m
db-0 2/2 Running 0 21m
db-migrate-2.0.28-b66-5j2pk 0/1 Completed 0 21m
ds-cdh-client-575bcc8799-54s8h 3/3 Running 0 21m
ds-operator-94fb98c6b-rkqjq 2/2 Running 1 21m
ds-reconciler-f7486fbd7-nll5t 2/2 Running 6 21m
ds-vfs-7478cffd6-mctqk 2/2 Running 0 21m
feature-flags-5857fc68d-nk8lg 2/2 Running 0 21m
fluentd-forwarder-5848c4b87d-zq9ln 1/1 Running 0 21m
hadoop-cli-7.2.10-hf1-t94d-qgjxr 0/1 Completed 0 8m48s
hadoop-cli-7.2.11-hf4-vfc03-mxpxr 0/1 Completed 0 8m48s
hadoop-cli-7.2.8-hf1-5auu1d-4fngq 0/1 Completed 0 8m48s
livelog-0 2/2 Running 0 21m
livelog-publisher-ccq7k 2/2 Running 0 21m
livelog-publisher-r66pm 2/2 Running 0 21m
mlx-workspace1-pod-evaluator-765985c848-mvn54 1/1 Running 0 21m
model-metrics-6f74997bf7-bzkkn 1/1 Running 4 21m
model-metrics-db-0 1/1 Running 0 21m
model-proxy-7fb9bfcb6d-x7z5p 2/2 Running 0 21m
prometheus-postgres-exporter-5c7b57bf66-8nd7v 1/1 Running 0 21m
runtime-addon-trigger-2.0.28-b66-nhqcf 0/1 Completed 6 21m
runtime-initial-repo-inserter-2.0.28-b66-w9jgn 0/1 Completed 0 21m
runtime-manager-c5844d5f9-rcspl 2/2 Running 0 21m
s2i-builder-5fd4c7bc59-25q5p 2/2 Running 0 21m
s2i-builder-5fd4c7bc59-74l99 2/2 Running 0 21m
s2i-builder-5fd4c7bc59-94lhz 2/2 Running 0 21m
s2i-client-57c7d8564-nhstd 2/2 Running 0 21m
s2i-git-server-0 2/2 Running 0 21m
s2i-queue-0 2/2 Running 0 21m
s2i-server-d79fdb6f8-m8t2s 2/2 Running 0 21m
secret-generator-0 2/2 Running 0 21m
spark247-13-hf2-5uhfhd-t5v7k 0/1 Completed 0 8m48s
spark311-13-hf2-mx7m2j-57xbh 0/1 Completed 0 8m48s
tcp-ingress-controller-59849bb8d5-zslzd 2/2 Running 8 21m
usage-reporter-7449585bf5-lgc2c 2/2 Running 0 21m
web-5cb768ddc4-9qs4k 2/2 Running 1 21m
web-5cb768ddc4-dzbbv 2/2 Running 1 21m
web-5cb768ddc4-qdhg2 2/2 Running 1 21m
# kubectl get ns
NAME STATUS AGE
cdp Active 10h
cdp-env-1-85b71ddc-monitoring-platform Active 3h17m
default Active 11h
default-470e0dec-monitoring-platform Active 10h
ecs-webhooks Active 11h
infra-prometheus Active 10h
kube-node-lease Active 11h
kube-public Active 11h
kube-system Active 11h
kubernetes-dashboard Active 10h
local-path-storage Active 10h
longhorn-system Active 10h
shared-services Active 3h16m
vault-system Active 10h
workspace1 Active 24m
workspace1-user-1 Active 2m15s
yunikorn Active 10h
# kubectl -n workspace1-user-1 get pods
NAME READY STATUS RESTARTS AGE
nv8ou6d2xskx8boy 4/4 Running 0 2m38s
# kubectl -n workspace1-user-1 get pvc
NAME STATUS VOLUME CAPACITY ACCESS MODES STORAGECLASS AGE
28f3a063 Bound 3eda7502 20Gi RWX cdsw-storageclass-whiteout 2m50s
5bdf02aa Bound b07df1d0 20Gi RWX cdsw-storageclass-whiteout 2m50s
8ed58984 Bound bdc25cd8 20Gi RWX cdsw-storageclass-whiteout 2m50s
abe060d7 Bound 338b9a4c 20Gi RWX cdsw-storageclass-whiteout 2m50s
# ll /var/nfsshare/
total 0
drwxr-xr-x 8 8536 8536 204 May 22 10:54 addons
drwxr-xr-x 5 8536 8536 78 May 22 11:07 cdn
drwxr-xr-x 2 8536 8536 6 May 22 10:43 output
drwxr-xr-x 3 8536 8536 15 May 22 11:01 projects
drwxr-xr-x 2 8536 8536 6 May 22 10:43 scratch