What is Sigmoid Function?

Deep Learning

Sigmoid Function

The sigmoid function is an activation function that maps any input to a value between 0 and 1, making it useful for binary classification outputs. It has been largely replaced by ReLU in hidden layers but remains standard for output layers.

Understanding Sigmoid Function

The sigmoid function is a mathematical activation function that maps any real number to a value between 0 and 1, producing an S-shaped curve. Historically, it was the default activation in neural networks, valued for its smooth gradient and probabilistic interpretation. The sigmoid remains essential in binary classification output layers, where it converts raw logits into probability estimates, and in gating mechanisms within LSTM and GRU recurrent networks. However, for hidden layers in deep networks, sigmoid has largely been replaced by ReLU and its variants because sigmoid suffers from the vanishing gradient problem, where gradients become extremely small during backpropagation, slowing training. The closely related softmax function generalizes sigmoid to multi-class classification scenarios.

Is AI recommending your brand?

Find out if ChatGPT, Perplexity, and Gemini mention you when people search your industry.

Check your brand — $9

Sim-to-Real Transfer

Back to full glossary

Sigmoid Function

Understanding Sigmoid Function

Is AI recommending your brand?

Related Deep Learning Terms

Activation Function

Adam Optimizer

Adapter Layers

Attention Mechanism

Autoencoder

Backpropagation

Batch Normalization

Batch Size