neopolitan's philosophical blog: Rectangular Circles - Yet Another Response to Mathematician

Tuesday, 1 December 2015

Rectangular Circles - Yet Another Response to Mathematician

This is yet another response to Mathematician. Here's what he wrote, interspersed with my responses (minor editing for format, and I've excised the final part that I've already addressed back in the comments section which can be read in its entirety here):

> At N=100, the 1/2 method does not have gaps or clumping.

The whole point of my "subintervals in intervals" example, was to show you that the problem of "gaps or clumping" is only a problem if you think it is. In the interval example, if you require that there is no "gaps or clumping" when N is finite, then the only possible answer is 0.

I think we agree that this is not reasonable at all. And that the most obvious way to select a subinterval will give gaps and clumping.

So how do you choose A PRIORI, in which contexts "gaps or clumping" are problematic, and in which contexts they are not?

It seems to me that you are blending the discussion about the disc and the discussion about the interval.

The "1/2 method" that I am talking about refers to the disc (and only the disc). I am pretty certain that you are clear on this, but I want to be as totally certain as I can be.

I'm not as convinced as you are that the only possible proportion of subintervals greater than L/2 is zero when using a method that eliminates gaps and clumping. What I am pretty sure of, however, is that if we looked at the distribution of subintervals and found that they were clustered around the ends of the interval and their lengths clustered around what could be described as "very very short" (much less than L/2), then we'd have reason to doubt how fair this distribution was.

Perhaps there's a good mathematical reason to not care about such clumping (and the implied "gap" between the ends of the interval in which the density of subintervals would dip), but don't you agree that using such a distribution would not meet the general understanding of "at random" - perhaps not even your own understanding of what "at random" would mean in this context?

> if distribution continued towards an as yet unknown value, or whether it still approaches zero

As far as I understand what you are trying to do, I'm pretty sure that it will approach 0.

I'm not sure that what you are doing proves anything at all, but that's another problem.

I wasn't trying to prove anything. I was just pondering the puzzle that you presented.

> Perhaps it was not clear to you, but the "corrected" 1/3 method, ends up being the 1/2 method

No it was clear.

> And, no, I don't agree that it's the same thing

Let me repeat something for sake of clarity:

For any (c,θ) in [-1,1]x[0,pi], there exists a unique chord that is at distance c from the center, in direction θ.

The "1/2 method of selecting a chord", amounts to pick a couple (c,θ) uniformly in the rectangle [-1,1]x[0,pi]. Do we agree on that?

When you draw your picture to show "granularity", what you are doing is that you choose a θ, once and for all, and then you take 100 values of c that are evenly spaced in [-1,1].

What I'm suggesting is that you do the opposite: Choose a c, once and for all, and then take 100 values of θ that are evenly spaced in [0,pi].

In the end, this is exactly the same method, but you're not drawing the same picture. (In mathematical terms, you are just doing a projection on one of the coordinates)

This is possibly where the meat of the issue is.

I agree that for any (c,θ) in [-1,1]x[0,π], there exists a unique chord that is at distance c from the centre, in direction θ. To be absolutely clear, I am interpreting this to mean that you are talking about a chord that is offset from the locus by c at its midpoint and that, therefore, the direction mentioned is the direction from the locus to that midpoint.

This is not what I thought you meant before. I thought you meant to pick a point at c from the locus (direction irrelevant), and then consider the chords that pass through that point with gradients defined by θ. You'd agree that such a scheme, picking a single value of c, won't give you ALL the chords (certainly not if you pick any value of c less than R, being the radius of the disc), right?

However, you seem to misunderstand my intention. I made clear (somewhere, I can dig it up if you insist) that I was notionally selecting a single value of θ (direction from locus to midpoint) only because that single value can represent all possible values of θ. The same applies when selecting a single Point 1 on the circumference in the 1/3 method.

I fully expect that, to get the ALL the chords, you’d have to consider all possible values of θ - in no way was I suggesting that I should "choose a value of θ, once and for all".

So, I understand that if someone foolishly suggested that we select a value of c, "once and for all", and then look at the chords at c given all possible values of θ (as a direction from the locus to the midpoint of a chord), then you'll never get ALL chords. You'll get an infinite number of chords with precisely the same length but different gradients.

Perhaps I am still misunderstanding your point. I think I must be, because I do not believe that you are this foolish (insert smile here to minimise any unintended offense).

I want to step back a bit to your question:

The "1/2 method of selecting a chord", amounts to pick a couple (c,θ) uniformly in the rectangle [-1,1]x[0,pi]. Do we agree on that?

I agree, with a minor reservation. I'm a bit uncomfortable calling [-1,1]x[0,π] a "rectangle": that space represents a circle (hence my little joke in the title of this article). However, I think I get what you mean - it's a useful way to visualise things for the purpose of considering a uniform distribution of values of c and θ.

What occurs to me is that this can be used in association with the 1/4 method.

My "fix" involved selecting a midpoint from this space (precisely like you seem to be suggesting), while the standard 1/4 method involves selecting from a reduced space. I think it might be, notionally, a bit like this (think density rather than direct correspondence with values of θ):

I'm not suggesting that these are accurate representations of the shapes corresponding to the 1/3 and 1/4 methods, I just used a triangle for 1/3 and cut out circular chunks for 1/4 because it was convenient. However, the concept does point towards the notion that the 1/2 and 1/4 methods are missing chords - and where they are missing from.

> because I am not focussed on how we select chords, I am focussed on ensuring that we have ALL chords (and where N is less than infinity, a representative sample of ALL chords).

Can you provide a single example of a chord that you can get with the 1/2 method, but that you cannot get with the 1/3 method?

See above. Of course I can't point to a single example, which you would clearly realise, but I can (at least conceptually) show that there are fewer chords near the locus with the standard 1/3 and 1/4 methods than there are with the 1/2 method.

> Between -R and -R/2 and R/2 and R, there will be a decrease in the proportion of chords greater than sqrt(3)

You are apparently thinking that "c" should be taken in a predefined direction, and then choose another direction θ. It's not what I said. Just fix some c, once and for all, and then choose a bunch of θ, and then draw the chords corresponding to (c,θ).

So, when c is between, R/2 and R (and between -R and -R/2), the proportion of chords greater than sqrt(3) is 0. So the final answer is 1/2. (Which is absolutely not surprising because it's exactly the same method)

See above. I think I've already addressed your "once and for all" objection, perhaps once and for all (but I am not holding my breath).

I don't know how you end up with 1/2 with what you've said here, but I do agree that all my methods - the standard 1/2 method, the "corrected" 1/3 method and the " corrected " 1/4 method - are effectively (and exactly) the same method.

---

I note that there might have been confusion about my use of the word "fixed" when I mean "corrected". When I used "fixed" previously, I did not mean "never to be changed" as in "fixed in stone". I meant "fixed" as in "my keyboard is broken, I am going to have to get it either fixed or replaced".

18 comments:

Mathematician2 December 2015 at 00:56
> It seems to me that you are blending the discussion about the disc and the discussion about the interval.

Yes, that's exactly what I'm doing. They are problems of the same type. If there is "general understanding of what at random means" it should apply to both problems.

> but don't you agree that using such a distribution would not meet the general understanding of "at random"

That's Bertrand's paradox in a nutshell : there is no general understanding of "at random".

> you are talking about a chord that is offset from the locus by c at its midpoint and that, therefore, the direction mentioned is the direction from the locus to that midpoint.

Yes.

> This is not what I thought you meant before.

I thought so too, that's why I explained it again more precisely.

> You'll get an infinite number of chords with precisely the same length but different gradients.

Yes. That's my point. Why would it be foolish ? What makes this method for drawing a representative sample of chords more foolish than your method (which get an infinite amount of chords with precisely the same gradient but different lengths) ?
I honestly cannot see the difference A PRIORI.

> I'm a bit uncomfortable calling [-1,1]x[0,π] a "rectangle"

That's unfortunate. It's standard notation.

> show that there are fewer chords near the locus with the standard 1/3 and 1/4 methods than there are with the 1/2 method.

Yes. And I perfectly agree with this argument. My point is that requiring the probability distribution to give as much chords near the locus than near the boundary is a relatively "arbitrary" requirement.
In the interval example, with the method of selection by uniformly choosing the two endpoints (giving a result of 1/4), there are more subintervals near the middle of the interval.
The only possible "fix", is to choose a completely absurd way of selecting a subinterval : Pick a point P uniformly in the interval, then the selected subinterval is [P,P] (yes, you read that well). And yes, it's the ONLY possible way to avoid getting more subintervals near the middle. I think that we both agree that this is not what "at random" should mean in this context.

So you think that the requirement is natural in the chord problem, only because it gives a sensical result at the end.

As I said, Take a slightly different problem, "pick a straight line segment at random inside the disc (not necessarily a chord)", and tell me A PRIORI if you requirement is sensical in this context.

To sum up :
I agree with mostly all your computations
I'm not very fond of the way you present the results of your computations (saying that you have the "correct" answer to Bertrand paradox, that you "fix" the "wrong" answers, and so on ...).
And I certainly disagree with your apparent opinion that Bertrand's question can be answered unambiguously.

ReplyDelete
Replies
neopolitan2 December 2015 at 01:29
> Yes. And I perfectly agree with this argument. My point is that requiring the probability distribution to give as much chords near the locus than near the boundary is a relatively "arbitrary" requirement.

That wasn't entirely my point. What I mean is that the very fact that there are fewer chords near the locus indicates that chords are missing given my goal to identify ALL chords. I can't point to a precise example of a missing chord, but the results do indicate that a set of chords are missing. It might be more accurate (or not) to say that chords with a certain characteristic (close to locus) are less frequently instantiated with the 1/3 and 1/4 methods than with the 1/2 method. Perhaps you think that chords are being counted multiple times in the 1/2 method, in which case I could reverse your challenge and ask you to identify them.

> And yes, it's the ONLY possible way to avoid getting more subintervals near the middle.

I don't have a problem with more subintervals near the middle of the interval. It's what I would expect. I don't expect chords to cluster around the rim of the disc. I'm fully aware that what I expect doesn't dictate what is and what is not mathematically "real". (I don't know how many times I will have to make painfully obvious statements like this.)

> And I certainly disagree with your apparent opinion that Bertrand's question can be answered unambiguously.

In strict mathematical terms, that would appear not to be the case (because in strict mathematical terms the missing chords don't matter, or they are not missing, or their "missingness" is a meaningless concept) - and what you assert as my opinion is not my opinion, given that caveat.

A large part of my interest in the Bertrand Paradox is how it applies to the real world, and from what you seem to be saying, it doesn't. There are no infinite sets that we deal with in the real world - if we are splitting up an interval in the real world, we'd eventually reach a point at which we have the planck length, below which further division appears to be meaningless. Therefore, if I were running a randomised drug efficacy trial, I would not have to worry overly about the mathematical basis of the randomisation due to any peculiarity of mathematics revealed by the Bertrand Paradox. I'd have to worry about biases in my patient selection process and the purity of control groups and so on, but there should be no worries that I should come to two or more perfectly valid yet different results regarding the effectiveness of the drug being trialled. "Megacorp Pharma announced today that the new cancer drug KillCan is 50% effective, 33% effective and 25% effective, depending on how you read the results and mathematicians confirm that all three results, and in fact any other results you want, are equally valid and correct. Soon to be available from you GP." That just won't happen. Or will it?
ReplyDelete
Replies
Mathematician2 December 2015 at 13:55
> Perhaps you think that chords are being counted multiple times in the 1/2 method, in which case I could reverse your challenge and ask you to identify them.

No individual chord is counted "multiple times" or is "missed" in any of the presented method. The probability that a "random" chord is exactly the same as a given predefined chord is 0. So all three methods could be seen as "uniform" because the probability of each possible outcome is the same (it is 0)

> I don't have a problem with more subintervals near the middle of the interval. It's what I would expect. I don't expect chords to cluster around the rim of the disc.

"madness is doing the same thing and expecting different results" (sorry, that was too tempting)

I know that it's not exactly the same problem, and I don't think that you are mad, but I would really like to understand why you don't expect the same kind of property in both problems ?

And what about the third problem "pick a random segment inside the disc". What do you expect ?

> "Megacorp Pharma announced today that the new cancer drug KillCan is 50% effective, 33% effective and 25% effective, depending on how you read the results and mathematicians confirm that all three results, and in fact any other results you want, are equally valid and correct"

Wait what ? The different anwer to Bertrand question do not come from the fact that I can "read the results" differently. It comes from the fact that I can read the QUESTION differently.

This makes me think of the famous completely unrelated example :

Imagine that a medical test for a rare disease comes back positive. You know that a test will give the correct answer with a 99% probability. What is the probability that you actually have the disease ?

If you assume that this is a well-defined question, then the only possible answer is 99%. Because if this is a well-defined question, then by your beloved principle of indifference, you should assume that the prior probability that you are sick is 1/2, and an easy computation gives the result.

But, in a real situation this is certainly not a well-defined question ! I'm not given the probability P(sick). So I can not answer the question. The actual "real-life" answer depends on the probability distribution of the disease in the population (which has no reason to be uniform).

In Bertrand paradox it's even worse, because there is no obvious notion of uniform on the set of chords.
ReplyDelete
Replies
Mathematician5 December 2015 at 20:42
> It seems to me that you are redefining the problem with the problem that is the Bertrand Paradox, away from "the term 'at random' is ill-defined" or "the term 'at random' is well understood, but the probability density is ill-defined" towards "we don't really know what a chord is"

I'm sorry but I have to be a little bit formal to explain exactly what is, in my humble opinion, Bertrand paradox. And perhaps you will understand that all my arguments are actually going in the same direction ...

Let C be a circle. Denote by Ω(C) the set of chords on the circle C. This is a well-defined set.
I have a random variable X : Ω(C) -> R+ , corresponding to the length of a chord.
Bertrand question is "what is the probability of the event (X >= Rsqrt(3) ) ?"

The question concerns the probability of some event. Which means that I need a probability measure on Ω(C). But nothing in the question tells me which probability measure to take on Ω(C).
So, I could just stop here and answer "I don't have enough information to answer the question".

But perhaps the probability measure is implied by the question ? Perhaps, the person asking the question thought there was an "obvious" probability measure on Ω(C) ?

As far as I can tell, there is no obvious probability measure on Ω(C). It's not a finite set, it's not a subset of a vector space, it's not a manifold, or a quotient of some Lie group, ...
My point is that Ω(C) is not the kind of set where we already have an obvious choice of probability measure (I'm using an authority argument here. As a mathematician, I know for a fact that there is nothing obvious about defining a "nice" probability measure on a given abstract set.)

So this is bad, we don't have an obvious probability measure on Ω(C). So I could just stop here and say "I don't have enough information to answer the question".

But maybe a chord in Ω(C) can be described in an obvious way by some parameters, and then I could take an obvious probability measure on the space of parameters, and this would give me an obvious probability measure on Ω(C) ?

That's great because there is an obvious bijection between Ω(C) and C^2 (chord -> endpoints )
But wait, there is also an obvious bijection between Ω(C) and [0,pi[x[-R,R] (chord -> (θ,c) )
Oh and there is also an obvious bijection between Ω(C) and D, where D is the disc (chord -> midpoint)

Note that a bijection is the same thing as a description (or parametrization), it gives me an unambiguous way to describe a given chord with some parameters. I think it's also what you mean by "identifying a chord"

And fortunately there are also obvious probability measures on [0,pi[x[-R,R], on C^2, and on D.
So you can get "obvious" probability measures on Ω(C), by simply taking the pushforward-measure of these uniform probability measures.

The problem is that different bijections/parametrizations/descriptions will give different probability measures on Ω(C).

For me, that's the point of Bertrand paradox :
Two different ways of describing chords will give two different parameter spaces and hence two different "obvious" probability measures, and in the end it will give two different answers.
ReplyDelete
Replies
Mathematician8 December 2015 at 06:26
> I think I disagree with "endpoints -> chord", but it depends on precisely what you mean by "->".

I mean precisely that we have a function, from the set of pairs of endpoints to the set of chords (the obvious one, which associate to each pair of endpoints, the unique chord between these points). And I claim that this function is a bijection.

Which means that you can get ALL chords from their endpoints. "ALL" as in "all of them". Not a single chord will be missing. It's a complete and perfect bijection (this is a redundantly redundant phrasing) between the set of pairs of endpoints and the set of chords.

If you don't understand that fact or disagree with it, then I'm seriously wondering what you understand of Bertrand paradox. If you really think that the problem of Bertrand paradox is that we are "missing" some curves in the 1/3 method, then you are so far away from understanding, that it's a little bit scary.

> The very fact that we reach different answers depending on our parameterisation seems to suggest that a claim that "the set of (ALL) chords implies the set of (ALL) pairs of endpoints" might be problematic, doesn't it?

It's only problematic if you think that Bertrand's question is well-defined and/or has a unique answer, doesn't it ?

The very fact that you are asking this question seems to suggest that you didn't understand anything at all about the 15 different answers I already gave you. And also that you don't understand that much Bertrand paradox, Jaynes answer and probabilities in general. I'm sorry to be harsh but that's really what I get from your last two answers.

I was trying to give you the benefit of the doubt for too long, but it's a little bit discouraging ... I'm not sure I want to continue this discussion. You obviously don't want to learn the language of probability (or even basic mathematical language), and this makes every argument a nightmare :
When I try to be precise you say that you don't understand (e.g. "random variable"), and when I'm a little bit imprecise then you are nitpicking on my choice of words (the use of "get" in a previous answer) ...

So I quit ...
ReplyDelete
Replies

Add comment

Feel free to comment, but play nicely!

Sadly, the unremitting attention of a spambot means you may have to verify your humanity.