I will present an implementation of structured bandit learning on Optical Characterization Recognition data using linear-chain Conditional Random Fields (CRFs). The talk will explain the details of how to use CRFs for bandit learning, such as going over the inference and sampling process. Some preliminary results will be shown. I will also talk about other tasks related to CRF bandit learning that we are interested in such as text chunking.