Notes from the Art and Science of Animal Training Conference (ORCA): More on The Premack Principle

horse and ball
In the last article, I wrote about the Premack Principle which states that:

  • • Behaviors are reinforcers, not stimuli
  • • More probable behaviors reinforce less probable behaviors.
  • • Less probable behaviors punish more probable behaviors.

What can I do with information? First, the idea of behaviors as reinforcers opens up many new possibilities for ways to reinforce behavior. And second, it means I need to pay more attention to the effect that each behavior is having on other behaviors that occur either before or after it.

In many cases, especially if I train with food, this information may not change much about what I do. But there are some types of training where knowing about the Premack Principle can make it easier to come up with a training plan that will be successful and will help you make decisions along the way. At the Art and Science of Animal Training Conference, there were two presentations that explained how to deliberately use the Premack Principle in your training.

They were:

  • Emily Larlham – Reversibility of Reward: Harnessing the Power of your Animal’s worst distractions
  • Alexandra Kurland – Do it again: The use of patterns in training

I am going to share some notes from each one separately and then end with a summary about what we have learned at the conference about the Premack Principle.

Emily Larlham – Reversibility of Reward: Harnessing the Power of your Animal’s worst distractions

Emily started us off with a video showing one of her dogs who loves to chase frogs. For this dog, frog chasing is clearly a behavior that has a high probability of occurring in certain environments. Then she shared a video of her dog walking calmly among some chickens, showing that in that scenario, chasing was actually a lower probability behavior.

Her question was “Would you like to get from this to this?”

Of course, most of us would say “yes”, but it can sometimes seem impossible to imagine how we can replace one behavior with another, especially if the dog has a previous history of doing the less desirable behavior, or if it’s hard to control the environment so we can train a new behavior in small steps.

But it is very possible, and she provided some simple guidelines for how to do it. She did make a point of saying that this topic could not adequately be covered in a 20-minute presentation, so she was just providing a basic outline and some examples in order to illustrate the process. If you want to learn more about how to do this, she has additional resources on her website (www.dogmantics.com).

Before getting into details, she brought up the question of whether or not we are brainwashing dogs. I thought this was a really interesting point because when we try to change behavior, we do always need to think about the function of that behavior and what effect it has on the well-being of the animals. We also need to think about if the new behavior can serve the same function as the original behavior, or if we need to change something else so the needs of the dog are still being met.

Every case is going to be different, but she has found that there can be positive changes in health and well-being when a dog learns a new, calmer way to respond to stimuli that would previously have caused a great deal of excitement and stress.

The steps:

  1. Train desirable behavior first, and maybe an interrupter
  2. Train controlled version of undesirable behavior (I’m not sure exactly what she means)
  3. Set up environment in which the dog chooses to the desirable behavior after being released to do the undesirable behavior. If using food, the dog must be calm enough to take treats.
  4. Add criteria/generalize for the final scenario

She had some great examples and video showing the steps. Here are some of them.

Example 1: Teaching a dog to come away from an item of interest.

  • She had a video of her dog learning to recall away from a snuffle mat. She also showed teaching a dog to recall away from a Frisbee.
  • In both cases, she started by making sure the reinforcer she provided for the recall was of higher value than the item of interest.
  • She also built the behavior in small steps, starting with a recall from a short distance or a recall before the dog got too close to the Frisbee.

Example 2: Teaching a dog to orient to the handler, even in the presence of a rabbit.

This was with an Irish Wolfhound and they started by practicing the desirable behavior, which was having the dog orient to the handler. The rabbit was introduced in stages. First, they had a person stand or walk around at a distance. Then they had the person carry the rabbit, and so on. The complete progression was:

  • Practice orienting to handler
  • Practice with a person (no rabbit) at a distance
  • Practice with  a person holding the rabbit, still at a distance
  • Practice with a person and rabbit placed on the ground, still at a distance
  • You can vary the distance as needed, maybe adding distance with each new step, then repeating it with less distance before moving on to the next step.
  • You may want to use interrupter so dog doesn’t practice getting too excited
  • You can decrease distance as part of each step or as a gradual process over the whole sequence, or some combination.

Example 3: She also showed some video of a dog that wanted to chase deer and would get very excited, jumping up and down and straining at the leash. She was able to teach the dog to remain calm, even when deer were present, and the dog can now be walked without trying to chase deer.

A few other points:

  • The distraction can become a cue for the new behavior. In the second example, seeing the rabbit triggers the desire to chase it. The trainer’s goal is to change the response of the dog so that the sight of the rabbit becomes a cue to do the behavior she wants. It’s important to build this slowly by starting with a “picture” that doesn’t elicit the undesired response and then slowly change it so the rabbit cues the new response.

  • The distraction can be used as a reinforcer, if appropriate.  There are some behaviors that can used as reinforcers for the new behavior you have trained. She often uses “permission to sniff” as a reinforcer for waiting.

She ended with a review of these important concepts:

  • Maintain correct arousal level
  • Ability to control training set-ups
  • Building strength of desired behaviors first
  • Break down training into small steps
  • Reinforce dog’s choice
  • Manage/prevent unwanted behavior in between
  • Generalization/brush up

Emily’s presentation was on using the Premack Principle to decrease unwanted behavior by strengthening  new behaviors and make them more probable.  The next section is on how to use the Premack Principle to teach new behaviors by using patterns.

Alexandra Kurland- Do it again: The use of patterns in training

In Alexandra’s presentation, she showed how patterns can be used in training. Patterns are another way to use behaviors as reinforcers, especially if you build your patterns carefully so they include higher probability behaviors that can be used to reinforce lower probability behaviors.

Alex started with a video showing a simple pattern in which a horse went from mat to mat, passing a cone along the way. The set-up of the environment provided clear information to the horse about what to do next. This kind of exercise is a great way to teach some basic skills or behaviors in a systematic and thoughtful way.

Alexandra’s basic premise is that teachers and learners thrive on patterns.

Patterns create:

  • Predictability
  • Opportunities for do-overs (each repeat of the pattern provides another opportunity to practice or improve each behavior)
  • Time to plan ahead (you and your horse know what’s coming next so you can be prepared)
  • Consistent high rates of reinforcement (you can reinforce as much as needed. A pattern can have reinforcement at multiple places, not just at the end)
  • Emotional balance (if well designed)
  • An adjustable degree of difficulty

Patterns add a level of complexity that draws attention to areas where you need to do more work. A behavior might be easy for the horse when you practice it on its own, but be more difficult when inserted in a pattern. Figuring out why can identify areas that need more work. Combining several behaviors together in sequence may also make it easier to see a general trend such as a weakness in one particular skill or a missing component.

What do you need to know to teach patterns?

Patterns are connected to Loopy Training. Loopy training is Alexandra Kurland’s term for training where a behavior is built in loops. A loop can contain one or more ABC cycles (antecedent -> behavior -> consequence).

The basic guidelines for Loopy Training are:

  • Start with a tight loop with clean behavior (one ABC loop)
  • When the loop is clean, you get to move on. Not only do you get to move on, but you should move on. (Alex calls this phrase “The Loopy Training Mantra”)
  • Moving on means expanding the loop to include more behaviors, with additional cues and reinforcers if needed.
  • A loop is clean when it is fluid and there are no unwanted behaviors
  • Both sides of the click need to be clean

Some additional details about loops:

  • For every behavior you teach, there is an opposite behavior you need to teach and you need to balance behaviors within a loop. For example, a well-balanced loop would contain movement and stationary behaviors.
  • This is not recipe driven training. Every loop can be customized for every horse.
  • People get stuck in patterns because they stay too long. The Loopy Training mantra tells you to move on when the loop is clean.

Example of a Pattern created with Loopy Training: Walking with the trainer to a mat

Basic Principle: There’s always more than one way to shape a behavior. What is the future of the behavior? What skills do you want to learn?

Alex often uses mats in her training. They can be used to ask the horse to go forward (go to the mat) and to stop (stand on the mat) so they help to create well-balanced loops. But before you can use mats in your patterns, you need to have basic mat manners which include:

  • The horse walks on a slack lead beside me, or at liberty
  • He stops on the mat on his own
  • He stays on the mat
  • He leaves the mat when asked

To meet the criteria listed above, the horse and trainer need to learn how to go forward together, how to stop together and how to step back (if the horse overshoots the mat). She teaches these first.

Step 1: Teach the individual behaviors first:

Forward:

  • Can the horse come forward when asked?
  • Alex showed how you can use targeting to ask a horse to come forward
  • She can continue to use the target or add a cue to come forward
  • Stopping can also be taught out of targeting as the horse learns to come forward and stop at the target

Step back:

  • Can the horse step back when asked?
  • Alex showed how to use food delivery to teach backing as part of a basic targeting exercise.
  • Once the horse has learned to back through food delivery, she can add a cue
    Once the horse can come forward, stop and step back, she can start to train matwork.

Step 2: Teach Going to the mat

Alex teaches matwork using what she calls a “runway.” The runway is made of two sets of cones arranged to form an open V (wide). A mat is placed at the base of the V. You can click and reinforce at any point in this pattern. When Alex first teaches it, she will reinforce for most correct responses. Once the horse knows the pattern, she can thin out the reinforcement if desired.

Runway lesson:

  • Start at the open end of the V
  • Ask the horse to come forward one step and then step back one step (needlepoint) at the open part of the V – this allows the trainer to practice asking for one step at a time. The trainer can move on when she chooses, usually when she sees some improvement.
  • Walk with the horse to the mat
  • Stop with the horse on the mat
  • If the horse doesn’t step on the mat, you can ask the horse to come slightly forward (using your needlepoint skills). When the horse does put a foot on the mat, you can use a high rate of reinforcement to reinforce that behavior.
  • Walk off the mat and turn so you are coming around the side (outside the cones) to return to the top of the V. You can switch directions so you work from both sides.
  • The pattern builds the skills that are needed for performance work (turns, forward and back).

The matwork pattern works well because it shows a nice balance of behaviors. It also allows for a lot of adjustability. The trainer can spend more time on the behaviors that need improvement, but is less likely to get stuck on one for too long, as the pattern itself encourages the trainer to move on after any small improvement.

You can make lots of different patterns to teach different behaviors and skills. I use a lot of patterns in my own training and have found that they provide a nice framework for teaching more advanced behavior combinations and can be adjusted to keep the training interesting.

Here are some of the main points from the three presentations on the Premack Principle:

• Think of reinforcement as being about behaviors. What does your animal want to do? Can you use it as a reinforcer?

• Don’t forget that Premack is also about punishers. If you follow a behavior you want with one that the horse is less likely to choose on his own, there will be a decrease in the behavior you want.

• The value of reinforcers can change. Premack showed that the same behaviors can function as reinforcers and punishers, so a behavior can be reinforcing in one situation and punishing in the next.

• You can shift a behavior from more probable to less probable by building reinforcement history for that behavior in an environment where the animal can be successful, and then slowly raising the criteria until he can do the new behavior under the original conditions.

• When choosing a behavior to use as reinforcer, consider behaviors that the animal enjoys or that have a strong reinforcement history.

• As with any type of reinforcer, you do want to choose one that provides the appropriate level of motivation and doesn’t tip the animal over into a state where he is too excited to learn. Barbara Heidenreich gave a presentation on non-food reinforcers that included some tips on choosing and using non-food reinforcers.

• Chains and sequences of behaviors will be stronger if the behaviors are arranged so that more probable behaviors follow less probable behaviors.

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s