Logo

Getting Started

  • Installation
    • Docker
    • Verification
  • Using Composer
    • Functional API
    • MosaicML Trainer
  • Welcome Tour
    • Our First Method!
    • A Simple Instrumented Trainer
    • Introducing… Events, Engines, and State
    • Next: The MosaicML Trainer
  • Custom Models and Datasets
    • Models
    • Datasets
    • Trainer init
    • Trainer with YAHP
  • Distributed Training (DDP)
    • Single-Node Example
    • Detailed Usage

core

  • composer.Algorithm
  • composer.Callback
    • Callback
    • RankZeroCallback
  • composer.Engine
    • Trace
  • composer.Event
    • Events List
    • API Reference
  • composer.Logger
    • BaseLoggerBackendHparams
    • BaseLoggerBackend
    • RankZeroLoggerBackend
  • composer.State
  • composer.core.surgery
  • composer.types
    • Tensor Types
    • Batch Types
    • Dataset and Data Loader Types
    • Trainer Types
    • Miscellaneous Types

composer

  • composer.algorithms
    • Alibi
    • Augmix
    • BlurPool
    • Channels Last
    • ColOut
    • CutOut
    • Ghost Batch Normalization
    • Label Smoothing
    • Layer Freezing
    • MixUp
    • Progressive Resizing
    • RandAugment
    • Sequence Length Warmup
    • Sharpness-Aware Minimization
    • Scaling the Learning Rate Schedule
    • Selective Backpropagation
    • Squeeze-and-Excitation
    • Stochastic Depth
    • Stochastic Weight Averaging
  • composer.callbacks
    • Callbacks
    • Callback Hyperparameters
  • composer.datasets
    • Base Classes and Hyperparameters
    • Datasets
  • composer.functional
    • composer.algorithms.functional.augment_and_mix
    • composer.algorithms.functional.apply_blurpool
    • composer.algorithms.functional.apply_alibi
    • composer.algorithms.functional.colout
    • composer.algorithms.functional.cutout
    • composer.algorithms.functional.smooth_labels
    • composer.algorithms.functional.freeze_layers
    • composer.algorithms.functional.apply_ghost_batchnorm
    • composer.algorithms.functional.mixup_batch
    • composer.algorithms.functional.resize_inputs
    • composer.algorithms.functional.randaugment
    • composer.algorithms.functional.scale_scheduler
    • composer.algorithms.functional.selective_backprop
    • composer.algorithms.functional.apply_se
    • composer.algorithms.functional.apply_seq_length_warmup
    • composer.algorithms.functional.apply_stochastic_depth
  • composer.loggers
    • Backends
    • Backend Hyperparameters
  • composer.models
    • Base Models
    • Image Models
    • Language Models
    • Metrics and Loss Functions
  • composer.optim
    • DecoupledSGDW
    • DecoupledAdamW
  • composer.optim.scheduler
    • WarmUpLR
    • ConstantLR
    • ComposedScheduler
  • composer.trainer
    • Examples
    • Trainer Hparams
    • API Reference
  • composer.trainer.devices
    • Devices
    • Device Dataloaders

Methods Library

  • ALiBi
    • TL;DR
    • Attribution
    • Applicable Settings
    • Hyperparameters
    • Example Effects
    • Implementation Details
    • Suggested Hyperparameters
    • Considerations
    • Composability
    • Code
  • AugMix
    • TL;DR
    • Attribution
    • Applicable Settings
    • Hyperparameters
    • Example Effects
    • Implementation Details
    • Suggested Hyperparameters
    • Considerations
    • Composability
    • Code
  • BlurPool
    • TL;DR
    • Attribution
    • Code and Hyperparameters
    • Applicable Settings
    • Example Effects
    • Implementation Details
    • Suggested Hyperparameters
    • Considerations
    • Composability
    • Acknowledgments
    • Code
  • Channels Last
    • TL;DR
    • Attribution
    • Applicable Settings
    • Hyperparameters
    • Example Effects
    • Implementation Details
    • Considerations
    • Composability
    • Code
  • ColOut
    • TL;DR
    • Attribution
    • Applicable Settings
    • Hyperparameters
    • Example Effects
    • Implementation Details
    • Suggested Hyperparameters
    • Considerations
    • Composability
    • Code
  • Cutout
    • TL;DR
    • Attribution
    • Applicable Settings
    • Hyperparameters
    • Example Effects
    • Implementation Details
    • Suggested Hyperparameters
    • Considerations
    • Composability
    • Code
  • Decoupled Weight Decay
    • TL;DR
    • Attribution
    • Code and Hyperparameters
    • Applicable Settings
    • Implementation Details
    • Considerations
    • Composability
  • Ghost BatchNorm
    • TL;DR
    • Attribution
    • Hyperparameters
    • Applicable Settings
    • Example Effects
    • Implementation Details
    • Suggested Hyperparameters
    • Considerations
    • Composability
    • Code
  • Label Smoothing
    • TL;DR
    • Attribution
    • Applicable Settings
    • Hyperparameters
    • Example Effects
    • Implementation Details
    • Suggested Hyperparameters
    • Considerations
    • Composability
    • Code
  • Layer Freezing
    • TL;DR
    • Attribution
    • Hyperparameters
    • Applicable Settings
    • Example Effects
    • Implementation Details
    • Suggested Hyperparameters
    • Considerations
    • Composability
    • Code
  • MixUp
    • TL;DR
    • Attribution
    • Hyperparameters
    • Example Effects
    • Implementation Details
    • Suggested Hyperparameters
    • Considerations
    • Composability
    • Code
  • Progressive Image Resizing
    • TL;DR
    • Attribution
    • Applicable Settings
    • Hyperparameters
    • Example Effects
    • Implementation Details
    • Suggested Hyperparameters
    • Considerations
    • Composability
    • Detailed Results
    • Code
  • RandAugment
    • TL;DR
    • Attribution
    • Applicable Settings
    • Hyperparameters
    • Example Effects
    • Implementation Details
    • Suggested Hyperparameters
    • Considerations
    • Composability
    • Code
  • Scale Schedule
    • TL;DR
    • Attribution
    • Hyperparameters
    • Example Effects
    • Implementation Details
    • Suggested Hyperparameters
    • Composability
    • Code
  • Scaling Laws
    • TL;DR
    • Attribution
    • Applicable Settings
    • Hyperparameters
    • Detailed Results
    • Considerations
    • Effects & Implications
  • Selective Backprop
    • TL;DR
    • Attribution
    • Applicable Settings
    • Hyperparameters
    • Example Effects
    • Implementation Details
    • Suggested Hyperparameters
    • Considerations
    • Composability
    • Detailed Results
    • Code
  • Sequence Length Warmup
    • TL;DR
    • Hyperparameters
    • Applicable Settings
    • Effects
    • Implementation Details
    • Suggested Hyperparameters
    • Considerations
    • Composition
    • Attribution
    • Code
  • Sharpness Aware Minimization
    • TL;DR
    • Attribution
    • Code and Hyperparameters
    • Applicable Settings
    • Example Effects
    • Implementation Details
    • Suggested Hyperparameters
    • Considerations
    • Composability
    • Code
  • Squeeze-and-Excitation
    • TL;DR
    • Attribution
    • Code and Hyperparameters
    • Applicable Settings
    • Example Effects
    • Implementation Details
    • Suggested Hyperparameters
    • Considerations
    • Composability
    • Code
  • Stochastic Depth (Block-Wise)
    • TL;DR
    • Attribution
    • Applicable Settings
    • Hyperparameters
    • Example Effects
    • Implementation Details
    • Suggested Hyperparameters
    • Considerations
    • Composability
    • Code
  • Stochastic Depth (Sample-Wise)
    • TL;DR
    • Attribution
    • Hyperparameters
    • Applicable Settings
    • Example Effects
    • Implementation Details
    • Suggested Hyperparameters
    • Considerations
    • Composability
    • Code
  • Stochastic Weight Averaging
    • TL;DR
    • Attribution
    • Applicable Settings
    • Hyperparameters
    • Example Effects
    • Implementation Details
    • Considerations
    • Composability

Model Library

  • CIFAR ResNet
    • Overview
    • Attribution
    • Architecture
    • Family members
    • Default Training Hyperparameters
  • EfficientNet
    • Overview
    • Attribution
    • Architecture
    • Family members
    • Default Training Hyperparameters
  • GPT-2
    • Overview
    • Attribution
    • Architecture
    • Family Members
    • Implementation Details
    • Exploring Tradeoffs Between Quality and Training Speed / Cost
  • ImageNet ResNet
    • Overview
    • Attribution
    • Architecture
    • Family Members
    • Implementation details
    • Default Training Hyperparameters
  • UNet
    • Overview
    • Attribution
    • Architecture
    • Implementation Details
    • Exploring Tradeoffs Between Quality and Training Speed/Cost
MosaicML
  • »
  • Search


© Copyright 2021, MosaicML, Inc.. Revision d17e69f2.

Built with Sphinx using a theme provided by Read the Docs.