~
Step {{ out.getCurrentStep(snippetData) }}/{{ out.getStepCount(snippetData) }} - Partition {{ out.getPartitionsSnippetStateSize(snippetData, 'DONE', 'FAILED', 'ABORTED') }}/{{ out.getTotalAmountOfPartitions(snippetData) }} - {{snippetData.trainInfo.resumed ? "resumed" : "started"}} {{snippetData.trainInfo.startTime | friendlyTimeDeltaHHMMSS }} ago — training epoch {{snippetData.modelTrainingInfo.currentEpoch + 1}}{{snippetData.modelTrainingInfo.nbEpochs ? '/'+snippetData.modelTrainingInfo.nbEpochs : ''}} Done {{snippetData.trainInfo.endTime||snippetData.trainInfo.startTime+snippetData.trainInfo.trainingTime | friendlyTimeDeltaShort}} ({{snippetData.trainInfo.endTime|date:'yyyy-MM-dd HH:mm:ss'}}) — {{out.getPartitionsSnippetStateSize(snippetData, 'ABORTED')}} aborted Overrides ({{ModelDataUtils.countOverrides(snippetData)}}) Trained {{snippetData.timeCreated | friendlyTimeDeltaShort}} ({{snippetData.timeCreated|date:'yyyy-MM-dd HH:mm:ss'}}) Imported {{snippetData.importedOn | friendlyTimeDeltaShort}} ({{snippetData.importedOn|date:'yyyy-MM-dd HH:mm:ss'}}) Created {{snippetData.importedOn | friendlyTimeDeltaShort}} ({{snippetData.importedOn|date:'yyyy-MM-dd HH:mm:ss'}}) Created {{snippetData.creationTag.lastModifiedOn | friendlyTimeDeltaShort}} ({{snippetData.creationTag.lastModifiedOn | date:'yyyy-MM-dd HH:mm:ss'}}) Will start soon Failed Aborted
Suspending optimization… View logs
ACTIONS Active version
Partitions{{ out.getTotalAmountOfPartitions(snippetData) }}
Running{{ out.getPartitionsSnippetStateSize(snippetData, 'RUNNING') }}
Trained{{ out.getPartitionsSnippetStateSize(snippetData, 'DONE') }}
Pending{{ out.getPartitionsSnippetStateSize(snippetData, 'PENDING') }}
Failed{{ out.getPartitionsSnippetStateSize(snippetData, 'FAILED') }}
Aborted{{ out.getPartitionsSnippetStateSize(snippetData, 'ABORTED') }}
Partitions{{ out.getTotalAmountOfPartitions(snippetData) }}
Trained{{ out.getPartitionsSnippetStateSize(snippetData, 'DONE') }}
Failed{{ out.getPartitionsSnippetStateSize(snippetData, 'FAILED') }}
Re-used{{ out.getPartitionsSnippetStateSize(snippetData, 'REUSED_DONE', 'REUSED_FAILED', 'REUSED_ABORTED') }}

Training will start soon…

Partition
Metric
Status
{{ s.name }} {{ s.name }}
{{ s.summary.snippet.mainMetric | mlMetricFormat :currentMetric :(currentMetric.substr(0, 3) === 'NB_' ? 0 : 3) :snippetData.mainMetricStd }}

Training failed

{{snippetData.trainInfo.failure.detailedMessage}}
Read the overall model logs or the logs for partition: {{summaries[0].name}} - More info might be available in backend log More info might be available in backend log (ask your admin)

Training aborted.

{{f.key}} {{f.value}}
{{f.key}} {{f.value}}
{{key | prettyConfigOption}} {{value}}
Number of epochs {{ snippetData.llmSMInfo.nbEpochs }}
Learning rate {{ snippetData.llmSMInfo.learningRateMultiplier }}
Batch size {{ snippetData.llmSMInfo.batchSize }}
LoRa rank {{ snippetData.llmSMInfo.loraRank }}
LoRa alpha {{ snippetData.llmSMInfo.loraAlpha }}
LoRa dropout {{ snippetData.llmSMInfo.loraDropout }}
Number of epochs {{ snippetData.llmSMInfo.nbEpochs }}
Batch size {{ snippetData.llmSMInfo.batchSize }}

No model-specific details available.

Most important features
  
Most important variables
  
Top coefficients
Clusters sizes
  

Training will start soon…

Training aborted.

Training failed

Read the logs - More info might be available in backend log More info might be available in backend log (ask your admin)
{{snippetData.trainInfo.failure.detailedMessage}}