[SGD] Add documentation

Currently, the SGD implementation is lacking an easy-to-follow getting started documentation in https://ray.readthedocs.io (give an example how to do training, including how to feed the data, explain how the learning rate is being chosen, basically the full steps on how a serial model is transformed into a parallel one with the distributed SGD implementation).