The first applications appear really simple because they illustrate cognitive abilities that imitate planning/model-based RL related to neuroscience/psychology.
Both the Keras one and the one from spinning up can be called "vanilla policy gradients". The one in Keras is closer to REINFORCE, and the one from spinning up use actor-critic and a multilayer perceptron.