On Jan 16, 9:23 pm, Bjoern Schliessmann <usenet- [EMAIL PROTECTED]> wrote: > Lie wrote: > > [EMAIL PROTECTED]> wrote: > >> I used to systematically use it - like I've always systematically > >> used 'this' in C++ and Java. > > > And that is what reduces readability. > > IMHO not, IOPHO not. This is the nth time (n >> 1) this discussion > comes up here. If I have learned one thing from those very lengthy > discussions, it's that Python's "self" handling is not going to > change.
And ah... yes, I don't wish it go away either. In VB, excessive Me reduces readability, but in Python it doesn't. > > A proficient VB/C/Java programmer would frown upon the extra, > > unneeded garbage as they thought it was clear already that the > > variable refers to a class-level variable. > > C programmers surely have no opinion concerning C because it has no > native classes. C-family, ok? Please stop taking my words to its letters. > Personally, I've seen many C++ programs with complex class designs > where it definitely helps to consistently use "this->". I cannot > remember all local (and global) variables in bigger methods. In that case, you have the _option_ to do it. > > There is one major positive point: convenience and shorter code. > > (isn't that two?) > > Shorter code is not per se positive, neither is it convenient. If it > was, everyone would use perl. Not always, but sometimes it do help not to be obliged to use the class name especially in short, simple programs (as opposed to big projects that requires hundreds of thousands of lines of code) that is changed frequently (being in heavy development). A good real-life example for this would be for new learner of programming or the language, they would write a lot of ten liners that is changed a LOT of times due to their (yet) incomplete understanding of the concepts. > >> it's the first argument of the function - which usually happens to be > >> the current instance when the function is used as a method. > > > And that's the point, self (or anything you name it) is almost always > > the current instance > > # this is a plain function. In this function, > # 'obj' can be whatever that happens to have a (numeric) > # 'stuff' attribute > def func(obj, arg): > return (obj.stuff + arg) / 2.0 > > # this is a class with an instance attribute 'stuff' > class Foo(object): > def __init__(self, bar): > self.stuff = bar + 42 > > # this is another (mostly unrelated) class > # with a class attribute 'stuff' > class Bar(object): > stuff = 42 > > # this is a dummy container class: > class Dummy(object): pass > > # now let's play: > import new > > d = Dummy() > d.stuff = 84 > print func(d, 1) > > d.baaz = new.instancemethod(func, d, type(d)) > print d.baaz(2) > > f = Foo(33) > print func(f, 3) > Foo.baaz = func > f.baaz(4) > > print func(Bar, 5) > Bar.baaz = classmethod(func) > Bar.baaz(6) > > > and that makes it functionally the same as Me and > > this in VB and Java. > > Depends on the context, cf above !-) Please again, stop taking letters to the words, I don't meant them to be exactly the same, rather the same would meant that they generally can be considered equal, exceptions exists of course. And btw, I don't understand what you meant by your example, they seemed to be a completely OK program for me, even though it's a bit confusing to follow[2]. [2] btw, the reason it's a bit confusing to follow is one of my points: It is a Bad Thing(tm) to use the same name for different variables even in a language like Python that enforce explicit naming of classes > >>> Most other languages > >>> 1) automatically assign the containing class' object > >> s/containing class' object/current instance/ > > >>> in a keyword > >>> (Java: this, VB: Me) behind the screen, > > >> That's not very far from what a Python method object does - > >> automatically assign the current instance to something. The difference > >> is that Python uses functions to implement methods (instead of having > >> two distinct contructs), so the only reliable way to "inject" the > >> reference to the current instance is to pass it as an argument to the > >> function (instead of making it pop from pure air). > > > It isn't very far, but Python makes it obvious about the assignment > > (not behind the screen). > > Exactly. And given both the simplicity of the solution and what it let > you do, that's a *very* GoodThing(tm) IMHO. I agree, it's a Good Thing but it doesn't make the point less pointy, the difference between Me/this and self is just the explicit assignment. Other things that is possible because of the explicit assignment is just a "coincidence" of design choice. > >>> and 2) automatically searches > >>> variable name in both the local variable table and the containing > >>> class variable table (so to refer to a class variable named var from a > >>> method inside the class, we only need to write var, not self.var as in > >>> python). > >> This - as you know - cannot work well with Python's scoping rules and > >> dynamicity. Anyway, implicit object reference is definitively a > >> BadThing(tm) wrt/ readbility, specially with multiparadigm languages > >> (like Python or C++). Why do you think soooo many C++ shops impose the > >> m_something naming scheme ? > > > Implicit object reference for the containing class has little harm, if > > a class is so complex that there are more than 10 class-level > > variable, then it is obvious that that class needs to be fragmented to > > smaller classes. > > Not necessarily. There are general rules (like 'keep your classes small > and focused', which I wholefully agree with), there are guidelines (like > 'more than 10 member variables might smell like refactoring), and > there's real life, where one very often faces classes that have much > more than 10 member variables and still are as small and focused as > possible. I'd add "whenever possible" then. I don't consider 10 as a guideline, but rather as a large number I randomly picked for an example, because in most project 10 is A LOT for number of variables, and I'm sure that most well designed program would have less than 10 variables per class (except in cases where it is just impossible to fragment the class any further). > > Remembering less than 10 variable and avoiding naming > > collision among just 10 variable is not hard (and 10 is really too > > many, most classes should only use 2-4 variables), > > I really doubt you'll be able to write any working non-trivial software > trying to strictly follow this rule. In what way have I said that the rule is strict? No rule is ever strict in my book, ALL rules are meant to be bend in exceptional cases and ALL exceptions are meant to be excepted themselves (applying the definition recursively). > > especially if you > > have a good IDE that employs Intellisense-like technology (IDLE has > > it). > > IDLE is certainly not a "good IDE" in my book. What you've just said actually enforce the sentence, even a bad IDE like IDLE have Intellisense-like feature :) I agree that IDLE isn't a good IDE, but it is an example that I could use since it is the only Python IDE I have ever used (although I'm interested to find another, the demand isn't that high ...yet... to _have_ to find another IDE) > > And it is always a Bad Thing(tm) to use the same name for two > > variable in the class and in function (which is the main and only > > source of possible ambiguity) in ANY language, even in Python. > > Ho, yes.... Like, this would be bad ? > > class Person(object): > def __init__(self, firstname, lastname, birthdate, gender): > self.firstname = firstname > self.lastname = lastname > self.birthdate = birthdate > self.gender = gender > > C'mon, be serious. It's often hard enough to come with sensible names, > why would one have to find synonyms too ? Try to come with something > more readable than the above, and let us know. Seriously, this braindead > rule about "not using the same name for an attribute and a local var" > obviously comes from languages where the "this" ref is optional, and > FWIW it's obviously the wrong solution to a real problem (the good > solution being, of course, to use the fully qualified name for > attributes so there's no possible ambiguity). The code fragment you've given way above (about the Foo, Bar, bazz, and func) also suffers from the bad habits of using the same name for different variables. And it's not a "braindead" rule (even though I used the word always, it still isn't a braindead always without exceptions) The example you've given IS the most readable form since the function is _simple_, consider a function that have complex codes, possibly calculations instead of simply assigning initial values I'm sure you'd slip up between the self.* variables and the * variables once or twice, possibly becoming the source of hard-to-find bugs. In other hand in a _complex_ function, if you thought of a new name, like initialfirstname, etc you'd know what is what and what is where immediately And in languages that doesn't enforce explicit naming of classes, when there is the two or more same names, the one with the smallest scope is picked, so in _simple_ functions, the trick of using full qualified names and overloaded local names is still possible and feasible. In complex functions, the trick fails even in Python, because even if Python and our full-concentration-brain is aware of the difference between self.* and *, our spreaded-concentration-brain that is scanning the code for the source of bugs might get stumbled on the confusing use of self.* and *. > >> Anyway, I actually know 3 languages (4 if C# works the same) that has > >> this implicit 'this' (or whatever the name) 'feature', and at least 5 > >> that don't. So I'm not sure that the "most other languages" qualifier > >> really applies to point 2 !-) > > > What's this 5 languages? > > Smalltalk, Python, PHP, javascript, Ruby. I don't remember how Scheme, > CLOS and OCaml handle the case. Among all them, only Javascript is considerably mainstream. Javascript, being virtually the only web-based scripting language that is supported by most browsers. > > Are they a mainstream, high-level languages > > or lesser known, low-level languages? C-family, Java, and Basic are > > the Big Three of high-level programming language. > > None of C, C++, Java nor Basic qualify as "hi-level". C is the lowest > possible level above assembly, C++ is often refered to as an "object > oriented assembler", Java is way too static, crippled, verbose an > unexpressive to qualify as "hi-level" (even if it suffers from some > problems usually associated with higher level languages). I won't even > comment on basic (is that really a language at all ?). Your criteria on being high-level is simply just odd. The rest of the world recognizes C-family, Java, and Basic as high-level languages. I agree C is a lower-level language compared to C++, C#, Basic, and Python, but standing by itself, it still qualifies as high-level language. If I have to say it, Python is actually lower level than Basic. While Java is just below Python and C and C++ is just below Java. Why do I consider Basic the highest-level? Because it is the cleanest to scan (no confusing symbols, i.e. no curly braces, no confusing use of parens (Python uses (), [], and {}, VB only use ()[3]), and that is also the reasons of some of its warts. In what way C++ resembles an assembler? Have you ever programmed in assembly? How hard is it to create a simple program in assembly? And how hard is it to create a complex program in C++ (which AFAIK is used by hundreds of mega projects including CPython)? And have you ever used Basic at all? Some programmers would instantly frown upon Basic, simply because they don't know that Basic is "just another language". Apart from a syntax that is very different from most languages (absence of curly braces, the minimal use of non-alphabet characters in the syntax, minimal use of short keywords, opting for keywords that is (hopefully) understandable to non VBers[4]), and apart from the quality of the implementation, the language itself isn't a bad language. Sure they may have some warts, but I dare to say "Python have no warts". [3] In fact the new VB.NET 2008 adds a syntax that uses curly braces to VB, but it's only for a syntax sugar, in Python, the difference between the different kind of parentheses is significant. [4] The only significant exception to, a (hopefully) understandable keyword names to non-VBers, is probably the most important keyword, "Dim" for variable declaration. > >>> In VB, Me is extremely rarely used, > >> I used to systematically use it - like I've always systematically used > >> 'this' in C++ and Java. > > > And that is what reduces readability. A proficient VB/C/Java > > programmer > > There are quite a few proficient C/C++/Java programmers here. As far as > I'm concerned, I would not pretend being one - I just have a good enough > knowledge of C, Java and (alas) VB to be able to get up to speed in a > reasonnable time frame. > > As a side note, the problem just doesn't exists in C, which has > absolutely no support for OO. When I said C, it might mean C and C-family, so please stop misunderstanding me. As a side note, the conversation we're just having is a real life example of using the name C when it means C-family, your mindset instantly protest that C doesn't have objects, while my mindset says "C" can't have meant C as a language since I don't state C++ and C# too while I should have, and knowing the fact that C in C-family doesn't have objects would make it clear that it is an exception. I'm not saying my mindset is better than yours (it have its positives and negatives), in fact I apologize for getting you confused. > > would frown upon the extra, unneeded garbage as they > > thought it was clear already that the variable refers to a class-level > > variable. > > In C++, the canonical way to make this "clear" is to use the m_name > convention. There must be some reason C++ programmers feel a need for > this "extra, unneeded garbage" ?-) In some cases, an extremely complex class that can't be fragmented any further, the m_ convention is surely useful, but in most cases you could skip them out. And the canonical way to make this "clear" is not the m_ convention, it's the name itself. A well-designed class would choose names that is recognizable instantly from the name itself, even without the pseudo-name appended to it (or prepended). btw you must have been memorizing names braindeadly, because the only way you could stumble on that is by memorizing names braindeadly. Names shouldn't be memorized, it should be inferred and memorized. For example, when you met a variable name firstname and lastname inside a class called Person, you'd immediately realize that it is Class Level variable because you know that the function you're currently working on use the name initialfirstname and initiallastname. > > It is a different story if, like Python, the use of self is > > enforced by the language, the self wouldn't be viewed as extra > > unnecessary garbage. > >>> in Python, self is all > >>> over the place. Well, there is positive and negative to both sides, > >>> convenience in VB, and flexibility in Python. > >> As far as I'm concerned, there's *no* positive point in implicit object > >> reference, and there has never been (and before some paranoid nutcase > >> around accuse me of overzealous biggotry : I already held this very same > >> opinion years before I discovered Python). > > > There is one major positive point: convenience and shorter code. > > (isn't that two?) > > These are not rated as "positive" in my book. That's perhaps why Python > is so far MFL ? Not in _your_ book. Different languages have been devised throughout the world to solve various problems, some of which are convenience in using the language itself. > > As I've pointed out, there is little harm in class-level variable's > > implicit reference. > > Have some working experience on any non-trivial C++ project ? No (you could say I'm a student so I've never "worked"[1]). But I've done some medium-sized projects in other languages. [1] If you understand the irony, you'd realized I was deliberately misunderstanding you > >>> Compare the following codes: > >>> VB.NET: > >>> Public Class A > >>> Dim var > >>> Public Function aFunction() > >>> return var > >> Add three levels of inheritence and a couple globals and you'll find out > >> that readability count !-) > > > It's the mental model that have to be adapted here, if the current > > class is inheriting from another class, you've got to think it as > > names from parent class as it is a native names, so you don't actually > > need to know where the variable comes from > > In C++ (and VB IIRC), it might as well be a global So sorry but yes, I > have to know where it comes from. How many times would you use globals, it is a Bad Thing(tm) to use globals in the first case. In some exceptional cases globals might be unavoidable, but it is trivial to work out that you have to reduce the amount of globals to a minimum, in almost any cases to a number you can use a hand to count with. And applying the hacks mentioned, why don't you use the m_ convention for globals, and retains the convenience of m_-free variables in your class variable. You use class variable much more often than globals, and in most projects class- level variable is used just as often as local-variable. > > since knowing where it > > comes from is breaking the encapsulation > > Nope, it's knowing what you're doing and how the piece of software at > hand is working. And FWIW, while data hiding is one possible mean of > encapsulation, it's by no way a synonym for encapsulation. I agree that knowing an underlying class's implementation is useful (in fact, very useful) but what I'm talking is about should-ness, we shouldn't _need_ to know the underlying implementation, but if we know it, it's fine and it's great, since you can do tricks that you couldn't do otherwise (at your own risks). No it's not synonym, but one way of encapsulation. > > (which, in Python is very > > weakly implemented, which favors flexibility in many cases[1]). > > > [1] In Python, it is impossible to create a completely private > > variable, which is the reason why the mental model of these other > > languages doesn't fit Python. > > Never heard about the infamous '#define private public' hack in C++ ? > And don't worry, there are also ways to get at so called 'private' vars > in Java. No, but it's violating the language's rule. Python OTOH, provides formal ways to got to private vars. > >> In any non-trivial piece of C++ code, and unless the author either used > >> the explicit 'this' reference or the 'm_xxx' naming convention, you'll > >> have hard time figuring out where a given name comes from when browsing > >> a function's code. > > > If you're used to the implicit naming scheme it's easy to know where a > > variable came from, if not the current scope, it's the class' scope > > You forgot the global scope. How many global variables do you have in your projects on average? If you always have a pageful list of globals in your projects, then you should consider unlearning and relearning the basic programming concepts. It's easy to keep track of globals, as you shouldn't have a lot of them even in a huge project. > > As a final note: > > I don't think implicit class reference is superior to explicit class > > reference, neither > > ... I'm sure you don't believe it since I'm talking on implicit's side, but that's the fact, I just pointed you out that implicits do have its positive side (even if you don't consider them positive in _your_ book) but that doesn't meant I believe it is better than the other. To clear things up: As a final note: I don't think implicit class reference is superior to explicit class reference, but I don't think the vice versa is true either. I consider they have their own positive and negative sides and different languages have different "better" definition, for Python it is explicit, for C/Java/Basic it's implicit. -- http://mail.python.org/mailman/listinfo/python-list