2 February 2012

Unlearning bad habits


One of the major obstacles in teaching at an advanced level is that we often have students with preconceived ideas. (The term "students" should be intepreted in the broader sense. It may include ourselves, for example.) Here is an example I have come across to many times, in the teaching of probability.

I ask the students to prove that if we take the points of a homogeneous Poisson process on the real line and translate each one of them by an independent random variable then we get again a Poisson process with the same rate.

For example, we may perform the translations by independent standard normal random variables.

The student often has a preconceived idea that a Poisson process is a random function of time $(N_t, t \ge 0)$ such that

  1. It starts from zero: $N_0=0$
  2. It is right-continuous, non-decreasing with values in $\{0,1,2,\ldots\}$
  3. It has stationary and independent increments.

One problem that the student faces (if he or she does not remember the definition I gave in class) is that the question above says to consider a Poisson process on the real line. So, after some thought, the student realizes that the following definition works:
Let $N'$ be an independent copy of $N$. Extend $N$ on $t \in (-\infty, 0)$ by letting $N_t := N'_{t-}$. It can then be checked that the process $(N_t, -\infty < t < \infty)$ satisfies

  1. $N_0=0$
  2. It is right-continuous, non-decreasing with integer values
  3. It has stationary and independent increments.

Rightly then, one can say that $(N_t, -\infty < t < \infty)$ is a Poisson process on the real line. Next, the student attempts to solve the problem by, say, doing this. They define the times of discontinuities of $N$ by $\cdots < S_{-1} < S_0 < S_1 < S_2 < \cdots$, agreeing, for instance, that $S_0 \le 0 < S_1$ (after all, the origin of time must be placed somewhere--and this agreement is up to us), then define $T_n := S_n +X_n$, for each integer $n$, where the $X_n$ are independent identically distributed random variables, and then try to show that the process with discontinuities at the times $T_n$ has exactly the same law as $N$.

This is an almost impossible task. The reason is that the student immediately realizes that the new points are not even ordered in the same way as their indices (something that was true for the old points). In fact, the ordering of the new points is random! Definitely, there is a first new point to the right of the origin of time, but this is not necessarily the point $T_1$. In fact, if the $X_n$ are standard normal, the first point to the right of $0$ could be the point $T_{517}$ with some probability, or the point $T_{-29}$, etc.

The student then may ask me for a hint. I try to bring in to them the idea that the above definition may be OK for some purposes, but that it has fundamental drawbacks. It is much better to first define a Poisson process as a random discrete subset of the real line such that the number of points contained in fixed (nonrandom) disjoint subsets of the real line are independent random variables. From this, it follows that the number of points in a set is a Poisson random variable with mean depending on a deterministic function of the set (this function being a nonnegative measure which, in the homogeneous case, is a multiple of the Lebesgue measure). From this definition it is not hard to show that the previous one is a theorem. Moreover, this is a definition which extends to higher dimensions and even to infinite dimensions.

But getting rid of preconceived ideas is very hard. Especially when teachers insist on a traditional way of approaching things.

One should not, actually, underestimate the fact the inertia of many teachers (as I said, the term "student" includes ourselves) to unlearn something and learn it from a different point of view. This is a major obstacle in the teaching of Mathematics.



6 comments:

  1. I wish I understood the math (I only got 50%), but the principles are still clear. Excellent illustration.

    ReplyDelete
  2. Thanks for the compliment Sabio. I am dead serious about the inefficacy of classical teaching methods. I have met so many individuals who cannot teach ... And often this is correlated with their ability to understand...

    As for the "I only got 50% [in maths]", I heard this before many times. I sympathize! I often claim that what one learns in school is not mathematics...

    ReplyDelete
  3. Well, you've got to admit, "Poisson Distribution" and such is not everyday High School math -- or even manditory university non-major Math. I only ran into it in Statistics where we don't care about theory ('cause we're too dumb) but how to apply.
    Smile

    I taught graduate school medicine. I told students time and again: "Up to now in your career you have been rewarded for guessing. Stop fucking guessing or you will kill someone. Know what you are doing and how you are doing it -- the answer is secondary."

    So I understand much of what you are doing.

    ReplyDelete
  4. Yes, I admit it is neither high-school math nor non-major math.
    However, it is (should be) taught to math majors.

    The case I describe in this posting refers to post-graduate (PhD) students in mathematics and, in fact, in applications of probability. Why, if they don't understand what I'm talking about, don't you think we've got a problem?

    ReplyDelete
  5. Oooops, you seem to have misunderstood your comment (4 days later). I was agreeing with you and giving you a personal example of my teaching experience that confirms it.

    Or so I thought.

    Blog communications are so dangerous.

    ReplyDelete
  6. Ok Sabio--thanks for clarifying. Yes, blog *and* , email communication can be dangerous. We treat them like oral conversation, but without looking at the other person's face which gives so much more information... (smilies, etc, are not adequate).

    ReplyDelete




T H E B O T T O M L I N E

What measure theory is about

It's about counting, but when things get too large.
Put otherwise, it's about addition of positive numbers, but when these numbers are far too many.

The principle of dynamic programming

max_{x,y} [f(x) + g(x,y)] = max_x [f(x) + max_y g(x,y)]

The bottom line

Nuestras horas son minutos cuando esperamos saber y siglos cuando sabemos lo que se puede aprender.
(Our hours are minutes when we wait to learn and centuries when we know what is to be learnt.) --António Machado

Αγεωμέτρητος μηδείς εισίτω.
(Those who do not know geometry may not enter.) --Plato

Sapere Aude! Habe Muth, dich deines eigenen Verstandes zu bedienen!
(Dare to know! Have courage to use your own reason!) --Kant