ML in Health Science: AI Snake

Epochs: ?

Number of times the model will cycle through the training data.

Batch Size: ?

Number of training examples utilized in one iteration.

Learning Rate: ?

How much to change the model in response to the estimated error each time the model weights are updated.

Learning Rate Decay: ?

Rate at which the learning rate decreases after each epoch.

L2 Regularization: ?

Penalty term added to the loss function to encourage smaller weights and reduce overfitting.

Leaky ReLU Alpha: ?

Alpha parameter for the Leaky ReLU activation function, controlling the slope of the activation for negative inputs.

Dropout Rate: ?

Fraction of input units to drop to prevent overfitting during training.

Early Stop Patience: ?

Number of epochs with no improvement after which training will be stopped.

Early Stop Delta: ?

Minimum change in the monitored quantity to qualify as an improvement.

Gradient Clip Norm: ?

Threshold to clip gradient norms to prevent exploding gradients during training.

Entropy Coefficient: ?

Coefficient for the entropy term in the loss function to encourage exploration.

Noise Parameter: ?

Level of noise added to the state inputs to improve robustness.

Food Reward: ?

Reward value for the snake when it eats food.

Collision Penalty: ?

Penalty value for the snake when it collides with itself.

Time Penalty: ?

Penalty value for the snake for each move it makes (encourages faster completion).

No Progress Penalty: ?

Penalty value for the snake if it fails to make progress towards the food.

Proximity Reward: ?

Reward value for the snake based on its proximity to the food.

Replay Buffer Size: ?

Size of the replay buffer that stores experiences for training.

α Priority Exponent: ?

Exponent for prioritizing experiences in the replay buffer.

Buffer Epsilon: ?

Small value added to priorities to ensure all experiences have a non-zero probability of being selected.

Exploration Epsilon: ?

Probability of selecting a random action instead of the best action during training.

Epsilon Decay Rate: ?

Rate at which the exploration epsilon decreases after each episode.

Minimum Epsilon: ?

Minimum value for the exploration epsilon to ensure some exploration is always present.