Good coding practices

Prepared by:

Darek Łuczyński

Introduction to Clean Code

Introduction

Author: Based on the book by Robert C. Martin (Uncle Bob)

Title: Clean Code - A handbook of Agile Software Craftmanship

Definition of programming and code

Specifying requirements in such detail that a machine can execute them is programming. Such a specification is code.

Ref.: Clean Code, R.C. Martin

Dirty code characteristics

dirty code:

(computing, derogatory) Software code that has had many editors with conflicting styles, making it nearly impossible to maintain. That software has dirty code and we should not use it.

Ref.: https://www.yourdictionary.com/dirty-code

When to write dirty code

There are a few occasions where its okay to write dirty code:

When you’re stuck
When you want to write good code
When you want to make things quickly

Ref.: https://zellwk.com/blog/its-okay-to-write-dirty-code/

Czysty kod pochodzi z brudnego kodu, na takiej zasadzie jak sie przepisuje czystopis z brudnopisu, albo tworzy sie obraz na podstaiw szkicu.

Wedlug powyzszego bloga, sa trzy przypadki gdy mozna pisac brudny kod.

Kiedy zaciales sie, nie wiesz jak isc dalej, i potrzebujesz prob aby rozwiazac problem,
Kiedy chcesz napisac czysty kod,
Kiedy musisz cos szybko napisac,

https://medium.com/@saad.eloulladi/write-clean-code-through-dirty-code-12a8b14b5e96 https://zellwk.com/blog/its-okay-to-write-dirty-code/

When to write dirty code

[…]programming is a craft more than it is a science. To write clean code, you must first write dirty code and then clean it. […] Most freshman programmers don’t follow this advice. They believe that the primary goal is to get the program working. Once it’s “working,” they move on to the next task, leaving the “working” program in whatever state they finally got it to “work.” Most seasoned programmers know that this is professional suicide.

― Robert C. Martin, Clean Code: A Handbook of Agile Software Craftsmanship

Aby pisać czysty kod, musimy na początku napisać brudny, a następnie go oczyścić. Zeby napisac dobry tekst, najpierw pisze się zgrubny szkic, następnie drugi, a potem kilka kolejnych, aż otrzymamy ostateczną wersję.

Większość niedoświadczonych programistów nie stosuje się do tego zalecenia. Uważają oni, że ich podstawowym celem jest napisanie działającego programu. Gdy “już to działa”, to przechodzą do następnego zadania, pozostawiając ten “działający” program w dowolnym stanie. Większość doświadczonych programistów wie, że jest to zawodowe samobójstwo.

Why keep the code clean

“Code is read much more often than it is written. Code should always be written in a way that promotes readability.”

― Guido van Rossum, founder of Python, Python Enhancement Proposal (PEP 8)

# see ZEN of Python in python console
import this

“Indeed, the ratio of time spent reading versus writing is well over 10 to 1. We are constantly reading old code as part of the effort to write new code. …[Therefore,] making it easy to read makes it easier to write.”

― Robert C. Martin, Clean Code: A Handbook of Agile Software Craftsmanship

Why keep the code clean

pie_chart

Ref.: https://dmitripavlutin.com/coding-like-shakespeare-practical-function-naming-conventions/

Why keep the code clean

The Cost of Owning a Mess is that:

any change is not trivial.

We’ve all said we’d go back and clean it up later. Those days we didn’t know LeBlanc’s law:

Later equals never.

Ref.: https://www.quora.com/What-resources-could-I-read-about-Leblancs-law

Clean code characteristics

- elegant,

- efficient,

- complete error handling,

- does one thing well,

- simple and direct,

- well-written prose,

- written by someone who cares,

- reduced duplications (copy & paste),

- high expressiveness (including meaningful names),

- early building of simple abstractions, abstract method or class,

- test driven development (unit tests related),

Code quality measurement

It’s been measured by WTF quantity per minute:

code_qc

The Boy Scout Rule

The Boy Scouts of America have a simple rule that we can apply to our profession:

“Leave the campground cleaner than you found it.“

The cleanup doesn’t have to be something big. Change one variable name for the better, break up one function that’s a little too large, eliminate one small bit of duplication.

Names

Raw	camelCase	PascalCase	kebab-case	snake_case
fruits in basket	fruitsInBasket	FruitsInBasket	fruits-in-basket	fruits_in_basket
has error	hasError	HasError	has-error	has_error
is visible	isVisible	IsVisible	is-visible	is_visible

N6: Avoid Encodings

Names should not be encoded with type or scope information. Prefixes such as ‘m_’, ‘c’, ‘s’ or ‘i’ are useless in today’s environments.

Hungarian Notation (HN)

Majority of languages f.ex. Java, C# including CAPL are strongly typed, and editing environments can detect a type error long before you can run a compile. So nowadays HN and other forms of type encoding are simply handicaps.

Encodings are also a possibility that the encoding system will mislead the reader.

PhoneNumber phoneString;
// name not changed when type changed!

N6. Unikanie kodowania

Unikanie kodowania - wspolczesnie zanieczyszcza kod

Kodowanie w nazwach typu lub informacji o zakresie powoduje zwiększenie nakładu pracy przy odszyfrowaniu.

Fortran wymuszał kodowanie przez użycie pierwszej litery nazwy na kod typu. W czesne wersje języka Basic pozwalały tylko na korzystanie z litery i jednej cyfry.

Notacja węgierska (NW) pozwalała przejść na całkiem nowy poziom. NW była uważana za bardzo ważną w API Windows C, gdy wszystko było uchwytem całkowitym, długim wskaźnikiem lub wskaźnikiem void albo jedną z kilku odmian “string” (o różnych zastosowaniachi atrybutach). W tym czasie kompilator nie sprawdzał typów, więc programiści potrzebowali sposobu na ich zapamiętanie.

W dniu dzisiejszym notacja węgierska jest tylko utrudniniem.

W nowoczesnych językach mamy znacznie bogatszy system typów, a kompilator pamięta i wymusza typy. Co więcej, istnieje trend tworzenia mniejszych klas i krótszych funkcji, aby ludzie najczęściej widzieli punkt deklaracji każdej używanej zmiennej.

G25: Replace Magic Numbers with Named Constants

This is probably one of the oldest rules in software development. In general it is a bad idea to have raw numbers in your code. Shall be hidden behind well-named constants.

Some constants are so easy to recognize that they don’t always need a named constant to hide behind so long as they are used in conjunction with very self-explanatory code.

double milesWalked = feetWalked/5280.0; //well known, easy to recognize... maybe for an american
int dailyPay = hourlyRate * 8;
double circumference = radius * Math.PI * 2;

G25. Zamiana magicznych liczb na stałe nazwane

W przypadku mPADa, magiczne liczby najczęściej są używane jako “exit codes” z funkcji. Uważam, że to jest zła praktyka.

Jest to prawdopodobnie jedna z najstarszych zasad programowania.Zwykle złym pomysłem jest korzystanie z surowych liczb w kodzie. Niektóre stałe są tak proste do rozpoznania, że nie zawsze wymagają stałej nazwanej, o ile są połączone z bardzo dobrze opisującym się kodem

Są też przypadki, wktórychy stosować liczby.

Czy faktycznie w powyższych przykładach potrzebujemy stałych FEET_PER_MILE, WORK_HOURS_PER_DAY oraz TWO? Istnieją jednak wyrażenia, w których stałe są po prostu lepsze niż surowe liczby. Można mieć wątpliwości co do stałej WORK_HOURS_PER_DAY, ponieważ prawo i konwencje mogą się zmieniać. Z drugiej strony, wyrażenie to czyta się tak dobrze z cyfrą, że byłbym ostrożny przed dodawaniem do niego 17 znaków.

W przypadku FEET_PER_MILE liczba 5280 jest tak dobrze znana i unikatowa, że czytelnik rozpozna ją, nawet gdyby znajdowała się sama na stronie, bez żadnego kontekstu.

Stałe, takie jak 3,141592653589793, są również bardzo dobrze znane i łatwo rozpoznawalne. Jednak szansa popełnienia błędu jest zbyt duża, aby pozostawić ją w surowej postaci. Za każdym razem, gdy ktoś zobaczy 3,1415927535890793, będzie wiedział, że jest to pi, i nie będzie jej sprawdzał (czy zauważyłeś błąd w jednej cyfrze?). Nie chcemy również, aby używane były wartości 3,14, 3,14159, 3,142 i tak dalej. Dlatego dobrze, że mamy już zdefiniowaną stałą Math. PI.

G25: Replace Magic Numbers with Named Constants

The term “Magic Number” does not apply only to numbers. It applies to any token that has a value that is not self-describing.

assertEquals(7777, Employee.find(“John Doe”).employeeNumber());

There are two magic numbers in this assertion. The first is obviously “7777”. The second magic number is “John Doe”.

It also turns out that “John Doe” represents hourly paid, test employee in that test database. So this test should really read:

assertEquals(HOURLY_EMPLOYEE_ID, Employee.find(HOURLY_EMPLOYEE_NAME).employeeNumber());

G25. Zamiana magicznych liczb na stałe nazwane

Termin „magiczna liczba” nie odnosi się wyłącznie do liczb. Odnosi się do dowolnej wartości, która nie jest rozpoznawalna na pierwszy rzut oka. Na przykład:

W tej asercji mamy dwie magiczne liczby. Pierwszą jest 7777, ponieważ jej znaczenie nie jest oczywiste. Drugą magiczną liczbą jest “John Doe”, ponieważ i w tym przypadku intencje nie są jasne. Wygląda to tak, jakby “John Doe” był nazwą pracownika numer 7777 w testowej bazie danych. Każdy po podłączeniu do tej bazy będzie miał dostęp do danych kilku pracowników mających znane wartości i atrybuty. “John Doe” reprezentuje jedynego pracownika opłacanego godzinowo w tej testowej bazie danych.

N1: Choose Descriptive Names

Class Names

Classes, structures and objects should have noun or noun phrase names like: Customer, WikiPage, Account, and AddressParser. A class name should not be a verb.

Method Names

Methods should have verb or verb phrase names like: postPayment, deletePage, or save. Accessors, mutators, predicates should be prefixed with ‘get, set, is‘.

Choosing good names for one argument function can go together with the arguments. Function and argument should form a very nice verb/noun pair.

//is very evocative. Whatever this “name” thing is, it is being “written.”
write(name) 
//even better name might be below, which tells us that the “name” thing is a “field.”
writeField(name)

N1: Wybór opisowych nazw

Nazwy Klas

Klasy, struktury i obiekty powinny być rzeczownikami lub wyrażeniami rzeczownikowymi, takimi jak: Customer, WikiPage, Account czy też AddressParser. Należy unikać w nazwach klas słów takich jak: Manager, Processor, Data lub Info. Nazwy klas nie powinny być czasownikami.

Nazwy metod

Metody należy opatrywać nazwami będącymi czasownikami lub wyrażeniami czasownikowymi, takimi jak postPayment, de1etePage lub save. Akcesory, mutatory być poprzedzone przedrostkiem get, set lub is.

Funkcja i argument mogą tworzyć użyteczną parę czasownik-rzeczownik, co poprawia czytelmpść kodu.

Na przykład zapis write (name) jest oczywisty. Niezależnie od tego, czym jest name (nazwa), jest to poddawane operacji write (zapis). Jeszcze lepszą nazwą może być write Field (name), z której jednoznacznie wynika, że zapisywana jest nazwa pola.

N1: Choose Descriptive Names

Names in software are 90% of what make software readable. You need to take the time to choose them wisely and keep them relevant.

public int x() {
    int q = 0;
    int z = 0;
    for (int kk = 0; kk < 10; kk++) {
        if (l[z] == 10)
        {
            q += 10 + (1[z + 1] + 1[z + 2]);
            z += 1;
        }
        else if (l[z] + l[z + 1] == 10)
        {
            q += 10 + 1 [z + 2];
            z += 2;
        } else {
            q += l[z] + l[z + 1];
            z += 2;
        }
    }
    return q
}

N1: Wybór opisowych nazw

W 90 procentach nazwy w kodzie decydują o jego czytelności. Pamiętaj, że znaczenie nazwy, dryfuje w czasie, nie wahaj się zmieniać nazwy, którą wybrałeś.

Nie podejmuj szybko decyzji na temat danej nazwy. Nazwa powinna być stosowna oraz isotna. Bądź pewien, że nazwa jest wystarczjąco opisowa.

Poniżej mamy ten sam kod zapisany we właściwy sposób. Magiczne liczby przestały być magiczne, a struktura algorytmu jest zachęcająco opisowa.

private boolean isStrikeiint frame) {
return rolls[frame] == 10;
}

N1: Choose Descriptive Names

Use Intention-Revealing Names

The name of a variable, function, or class, should tell you why it exists, what it does, and how it is used. So take care with your names.

public int score() {
    int score = 0;
    int frame = 0;
    for (int frameNumber = 0; frameNumber < 10; frameNumber++) {
        if (isStrike(frame)) {
            score += 10 + nextTwoBallsForStrikeiframe);
            frame += 1;
        } else if (isSpareiframe)) {
            score += 10 + nextBallForSpareiframe);
            frame += 2;
        } else {
            score += twoBallsInFrame(frame);
            frame += 2;
        }
    }    
    return score;
}

N3: Use Standard Nomenclature Where Possible

Names are easier to understand if they are based on existing convention or usage. In Java, for example, functions that convert objects to string representations are often named:

toString();

Teams will often invent their own standard system of names for a particular project. Your code should use the terms from this language extensively.

More you can use names that are overloaded with special meanings that are relevant to your project, the easier it will be for readers to know what your code is talking about.

N3. Korzystanie ze standardowej nomenklatury tam,gdzie jest to możliwe

Nazwy są łatwiejsze do zrozumienia, jeżeli bazują na istniejących konwencjach lub zastosowaniach. W języku Java funkcje konwertujące obiekty na postać znakową są często nazywane toString. Lepiej zachować zgodność z tego typu konwencjami, niż wymyślać własne.

Zespoły często wymyślają własne standardy nazewnictwa dla określonych projektów.

Mówiąc krótko, im więcej używamy nazw przeładowanych znaczeniami odnoszącymi się do naszego projektu, tym łatwiej będzie czytelnikowi zrozumieć, do czego służy dany kod.

N4: Unambiguous Names

Choose names that make the workings of a function or variable unambiguous. Names may seem long, however it’s explanatory value outweighs the length cost.

Avoid Disinformation

Do not refer to specific phrases, unless you know it’s truly represents it.

// The word list means something specific to programmers.
// account list is actually a dictionary, but by the name indicates on the list
var accountList = new Dictionary<string, int>();
//  account list is truely a list
var accountList = new List<int>();

Beware of using names which vary in small ways.

// It is not ewasy to distinguish both variables
var yzControllerForEfficientHandlingOfStrings = new YZControllerForEfficientHandlingOfStrings();
var xyzControllerForEfficientStorageOfStrings = new XYZControllerForEfficientStorageOfStrings();

N4. Jednoznaczne nazwy

Należy wybierać nazwy, które nie pozostawiają niejednoznaczności co do działania funkcji lub zmiennej. Nazwy mogą wydawać się długie, niemniej ich wartość objaśniająca przewyższa koszt ich długości.

Unikanie dezinformacji
- Unikaj używania nazw, które mogą być nazwami własnymi. Może to prowadzić do fałszywych wniosków.
- Należy unikać nazw, które nieznacznie się od siebie różnią.

N4: Unambiguous Names

Example of disinformative names would be the use of lower-case l or uppercase O as variable names, especially in combination. The problem is that they look almost entirely like the constants one and zero.

int a = l;
if ( O == l )
    a = O1;
else
    l = 01;

N4: Unambiguous Names

Use Pronounceable Names. This matters because programming is a social activity. Hungarian notation and prefixes make names unpronouceable.

// bad naming
int XPstn = 1;
// better naming
int xPosition = 1;

Avoid Mental Mapping. This is a problem with single-letter variable names.

// good loop counter
customers[i];
customers[j];
customers[k];
// bad loop counter
// Seems like number '1'
customers[l];

N4. Jednoznaczne nazwy

Tworzenie nazw, które można wymówić

Znaczna część naszego mózgu jest stworzona do interpretacji słów. Słowa z definicji dają się wymówić. Jeżeli nie będziemy mogli ich wymówić, nie będzie można swobodnie dyskutować o kodzie. Ma to znaczenie, ponieważ programowanie jest aktywnością społeczną. Prefixy i notacja wegierska utrudniaja wymowienie nazwy.

Unikanie odwzorowania mentalnego

Czytający kod nie powinni mentalnie przekształcać naszych nazw na inne.

N4: Unambiguous Name

Don’t Be Cute

Choose clarity over entertainment value.

// bad naming
void hollyHandGranade();
// good naming
void deleteAllItems();

Pick One Word per Concept Pick one word for one abstract concept and stick with it. For instance, it’s confusing to have fetch, retrieve, and get as equivalent methods of different classes.

N4. Jednoznaczne nazwy

Nie bądź dowcipny

Jeżeli nazwy są dowcipnymi szaradami, mogą być zapamiętane wyłącznie przez osoby z identycznym co autor poczuciem humoru, i tylko dopóty, dopóki będą one pamiętać dany żart słowny. Czy wszyscy będą wiedzieli, do czego służy funkcja o nazwie HolyHandGrenade? Czytelność jest zawsze ważniejsza od wrażeń artystycznych

Wybieraj jedno słowo na pojęcie

Należy stosować zasadę jedno słowo na jedno abstrakcyjne pojęcie i trzymać się jej. Na przykład mylące jest stosowanie nazw fetch, retrieve i get do analogicznych metod w różnych klasach.

N4: Unambiguous Name

If names must be different, then they should also mean something different.

Make Meaningful Distinctions
- Number-series naming (a1, a2, .. aN)
  It is the opposite of intentional naming. Such names are noninformative.
- Noise words like Info, Data, the, a, an, variable Noise words are redundant. The word variable should never appear in a variable name.

N4. Jednoznaczne nazwy

Jeżeli nazwy muszą być różne, to powinny one również znaczyć coś innego.

Tworzenie wyraźnych różnic
- Nazwy korzystające z numerów kolejnych (al, a2, … aN) są przeciwieństwem nazw znaczących. Nazwy te nie są dezinformujące. Nie niosą ze sobą żadnej informacji o intencjach autora.
- Szum słowny. Słowa Info, Data, the,a, an, variable, są równie mało znaczącymi nazwami. Nazwy powinny różnić się w taki sposób, aby czytelnik wiedział, na czym polega różnica. Jeżeli utworzymy inną klasę, o nazwie ProductInfo lub ProduetData, otrzymamy różne nazwy, ale nie spowodujemy, że będą one znaczyły coś innego.

N5: Use Long Names

Use Long Names for Long Scopes

The length of a name should be related to the length of the scope. You can use very short variable names for tiny scopes, but for big scopes you should use longer names.

def make_md_from_from_sfa_tokens(tokens) -> str:
md = '' # short scope
for token in tokens:
    local_md = make_md_description_from_sfa_token(token)
    md += local_md + '\r\n'
return md

Use Searchable Names

Single-letter names and numeric constants have a particular problem in that they are not easy to locate across a body of text.

Użycie długich nazw dla długich zakresów

Długość nazwy powinna być związana z długością zakresu. Można używać bardzo krótkich nazw dla niewielkich zakresów, ale dla dużych zakresów powinny być używane długie nazwy.

Korzystanie z nazw łatwych do wyszukania

Nazwy jednoliterowe i stałe numeryczne, są szczególnie niewygodne, ponieważ nie można ich łatwo wyszukać w tekście.

Podobnie jest z nazwą ‘e’ . Jest złym wyborem, jeżeli jest używana do zmiennej, którą programista będzie chciał kiedykolwiek znaleźć. Jest to najczęściej występująca litera w języku angielskim i najprawdopodobniej zostanie wyszukana w każdym fragmencie tekstu. Pod tym względem dłuższe nazwy przewyższają krótsze.

Code excercises

Functions

G30: Functions Should Do One Thing

Single responsibility rule

FUNCTIONS SHOULD DO ONE THING

THEY SHOULD DO IT WELL

THEY SHOULD DO IT ONLY

Sections within Functions

// Bad example, fuction pay() does 3 operations
public void pay() {
    for (Employee e : employees) { // review data of each emploee
        if (e.isPayday()) { // checks if emploee is allowd to pay
            Money pay = e.calculatePay(); // then finally calculates the paymen and pays
            e.deliverPay(pay);
        }
    }
}

G30: Functions Should Do One Thing

Functions that do one thing cannot be reasonably divided into sections.

// Better example
public void pay() {
    for (Employee e : employees){
        paylfNecessary(e);
    }
}
private void payIfNecessary(Employee e) {
    if (e.isPaydayi)){
        calculateAndDeliverPay(e);
    }   
}
private void calculateAndDeliverPay(Employee e) {
    Money pay = e .calculatePayi);
    e.deliverPay(pay);
}

Small Functions

Functions should not be 100 lines long. Functions should hardly ever be 20 lines long.

G20: Function Names Should Say What They Do

Would you expect this to add five days to the date? Or is it weeks, or hours? You can’t tell from the call what the function does.

// bad example
// Does it add 5 days, weeks or hours to the date?
Date newDate = date.add(5);

If the function adds five days to the date and changes the date:

//better solution, if it incraeses by 5 days the better solution coold be
date.increaseByDays(5);

G5: Duplication

This is one of the most important rules in this book, it’s been called DRY principle (Don’t Repeat Yourself).

OO itself is a strategy for organizing modules and eliminating duplication.
Find and eliminate duplication wherever you can.

More subtle duplications are the modules that have similar algorithms, but that don’t share similar lines of code.

F1: Too Many Arguments

No argument is best, followed by one, two, and three. More than three is very questionable and should be avoided. They take a lot of conceptual power. Arguments are even harder from a testing point of view, because demands to write many test cases to ensure that all the various combinations of arguments work properly. If there are no arguments, this is trivial.

When a function seems to need more than two or three arguments, it is likely that some of those arguments ought to be wrapped into a class of their own.

 Circle makeCircle(double x, double y, double radius);
Circle makeCircle(Point center, double radius);

Reducing the number of arguments by creating objects out of them may seem like cheating, but it’s not. When groups of variables are passed together, they are likely part of a concept.

F1. Nadmiar argumentów

Funkcje powinny mieć małą liczbę argumentów. Najlepszym przypadkiem jest… brak argumentów, następnie mogą być: jeden, dwa lub trzy argumenty. Większa liczba argumentów stanowi potencjalne źródło problemów. Argumenty są kłopotliwe. Wymagają one użycia sporej ilości energii koncepcyjnej.

Argumenty są jeszcze bardziej kłopotliwe z punktu widzenia testowania. Trudno napisać wszystkie testy jednostkowe zapewniające, że wszystkie kombinacje argumentów będą działały prawidłowo. Jeżeli nie ma argumentów, jest to bardzo proste.

Gdy wydaje się, że funkcja wymaga więcej niż dwóch lub trzech argumentów, najprawdopodobniej niektóre z nich powinny być umieszczone w osobnej klasie.

Zmniejszenie liczby argumentów przez utworzenie z nich obiektów może wydawać się oszustwem, ale tak nie jest. Gdy grupy zmiennych są przesyłane wspólnie, tak jak x oraz y w powyższym przykładzie, najprawdopodobniej są częścią obiektu, który zasługuje na własną nazwę.

G10: Vertical Separation

Variables and function should be defined close to where they are used. Local variables should be declared just above their first usage and should have a small vertical scope.
Dependent functions should be defined just below their first usage. If one function calls another, they should be vertically close, and the caller should be above the callee. This gives the program a naturalflow.

G10. Separacja pionowa

Zmienne i funkcje powinny być definiowane blisko miejsca, gdzie są użyte. Zmienne lokalne powinny być deklarowane bezpośrednio ponad pierwszym użyciem i powinny mieć mały zakres pionowy. Nie chcemy zmiennych lokalnych deklarowanych setki wierszy od ich zastosowania.
Funkcje zależne. Jeżeli jedna funkcja wywołuje inną, powinny być one położone blisko siebie, a funkcja wywołująca powinna być umieszczona powyżej wywoływanej, o ile jest to możliwe. Pozwala to osiągnąć naturalny przebieg programu. Jeżeli konwencja ta jest stosowana konsekwentnie, czytelnik będzie mógł zaufać nam, że definicja funkcji znajduje się zaraz po jej wywołaniu.

G10: Vertical Separation

Reading Code from Top to Bottom: The Stepdown Rule

The code should be read like a top-down, every function to be followed by those at the next level of abstraction so that we can read the program, descending one level of abstraction at a time as we read down the list of functions. I call this The Stepdown Rule.

Czytanie kodu od góry do dołu - zasada zstępująca

Chcemy, aby kod można było czytać od góry do dołu. Chcemy, aby po każdej funkcji znajdowała się kolejna, na następnym poziomie abstrakcji, dzięki czemu można czytać program, schodząc o jeden poziom abstrakcji niżej wraz z przejściem do kolejnej funkcji w pliku. Nazywam to zasadą zstępującą.

Inaczej mówiąc, chcemy, aby można było czytać program tak, jakby był zbiorem akapitów, z których każdy opisuje bieżący poziom abstrakcji odwołujących się do kolejnych akapitówna następnym poziomie.

Praktyka pokazuje, że programiści mają problemy z przyswojeniem sobie tej reguły i pisaniem funkcji działających na jednym poziomie abstrakcji. Programista, który chce być profesjonalistą, powinien jednak wypracować sobie nawyk tworzenia krótkich funkcji, które na pewno wykonują “jedną operację”.

F4: Dead Function / G9: Dead Code

Dead code is code that isn’t executed. The problem with dead code is that after awhile it starts to smell. This is because dead code is not completely updated when designs change.

It still compiles, but it does not follow newer conventions or rules. It was written at a time when the system was different. When you find dead code, delete it from the system. Do not afraid, your source code control system still remembers it.

F4. Martwe funkcje / Martwy kod

Martwy kod to taki, który nie jest wykonywany. Problem z martwym kodem jest taki, że szybko zaczyna wydzielać zapach. Im jest starszy, tym ten zapach staje się silniejszy i kwaśniejszy. Dzieje się tak, ponieważ martwy kod nie jest w pełni aktualizowany w czasie zmian projektu.

Nadal kompiluje się, ale nie jest zgodny z najnowszymi konwencjami i zasadami. Był on pisany w czasie, gdy system był inny.

Gdy znajdziemy martwy kod, możemy zrobić tylko jedną właściwą rzecz, usunąć go. Nie trzeba się obawiać usuwania funkcji — system kontroli wersji nadal ją pamięta.

G29: Avoid Negative Conditionals

Negatives are harder to understand than positives. So, when possible, conditionals should be expressed as positives.

// Is worse, negative conditional
if (!buffer.shouldNotCompact())
// Is better, positive conditional
if (buffer.shouldCompact())

G35: Keep Configurable Data at High Levels

Constant such as a default or configuration value that is known and expected at a high level of abstraction, do not bury it in a low-level function. Expose it as an argument to that low-level function called from the high-level function.

Comments

Komentarze

Nie są one "czystym dobrem". W rzeczywistości komentarze są w najlepszym przypadku koniecznym złem. Prawidłowe zastosowanie komentarzy jest kompensowaniem naszych błędów przy tworzeniu kodu (wysławianiu się w kodzie). Obecność komentarzy zawsze sygnalizuje nieporadność programisty. Musimy korzystać z nich, ponieważ nie zawsze wiemy, jak wyrazić nasze intencje bez ich użycia, ale ich obecność nie jest powodem do świętowania. Gdy uznamy, że konieczne jest napisanie komentarza, należy pomyśleć, czy nie istnieje sposób na wyrażenie tego samego w kodzie. Dlaczego jestem tak przeciwny komentarzom? Ponieważ one kłamią. Nie zawsze, nie rozmyślnie, ale nader często. Im starsze są komentarze, tym większe prawdopodobieństwo, że są po prostu błędne. Powód jest prosty. Programiści nie są w stanie utrzymywać ich aktualności. Złe komentarze
Do tej kategorii należy większość komentarzy. Zwykle są to podpory złego kodu lub wymówki albo uzasadnienie niewystarczających decyzji znaczące niewiele więcej niż dyskusja programisty ze sobą.

C3: Redundant Comment

A comment is redundant if it describes something that adequately describes itself. Comments should say things that the code cannot say for itself.

i++;// increment i`

C3: Redundant Comment

Journal Comments

Sometimes people add a comment to the start of a module every time they edit it. These comments accumulate as a kind of journal, or log, of every change that has ever been made. Nowadays, we have source code control systems, these long journals make harder to read the code.

/*
* Changes (from 11-Oct-2001)
* --------------------------
* 11-Oct-2001 : Re-organised the class and moved it to new package
* com.jrefinery.date (DG);
* 05-Nov-2001 : Added a getDescription() method, and eliminated NotableDate
* class (DG);
* 12-Nov-2001 : IBD requires setDescription() method, now that NotableDate
* class is gone (DG); Changed getPreviousDayOfWeek(),
* getFollowingDayOfWeek() and getNearestDayOfWeek() to correct
* bugs (DG);
* 05-Dec-2001 : Fixed bug in SpreadsheetDate class (DG);
* 29-May-2002 : Moved the month constants into a separate interface
* (MonthConstants) (DG);
* 27-Aug-2002 : Fixed bug in addMonths() method, thanks to Nlevka Petr (DG);
* 03-Oct-2002 : Fixed errors reported by Checkstyle (DG);
* 13-Mar-2003 : Implemented Serializable (DG);
* 29-May-2003 : Fixed bug in addMonths method (DG);
* 04-Sep-2003 : Implemented Comparable. Updated the isInRange javadocs (DG);
* 05-Jan-2005 : Fixed bug in addYears() method (1096282) (DG);
*/

C3: Redundant Comment

Noise Comments

Sometimes you see comments that are nothing but noise. They restate the obvious and provide no new information.

/**
* Default constructor.
*/
protected AnnualDateRule() {
}

/** The day of the month. */
private int dayOfMonth;

/**
* Returns the day of the month.
*
* @return the day of the month.
*/
//Give me a break!
public int getDayOfMonth() {
return dayOfMonth;
}

These comments are so noisy that we learn to ignore them, our eyes tend to skip them over.

C3: Redundant Comment

Don’t Use a Comment When You Can Use a Function or a Variable

Consider the following stretch of code:

// does the module from the global list /mod/ depend on the
// subsystem we are part of
if (smodule
        .getDependSubsystems()
        .contains(subSysMod.getSubSystem())
    ){}

This could be rephrased without the comment as:

ArrayList moduleDependees = smodule.getDependSubsystems();
String ourSubSystem = subSysMod.getSubSystem();
if (moduleDependees
        .contains(ourSubSystem)
    ){}

Sometimes cleanerCode != smarterCode

C3: Redundant Comment

Position Markers

Sometimes programmers like to mark a particular position in a source file.

// Actions //////////////////////////////////

Right after eliminating long tail of slashes, sometimes it makes sense to use such marker to gather certain functions together. But in general they are clutter that should be eliminated. Use them occasionally, and only when the benefit is significant. If too many, our eyes learn to ignore them.

C3. Nadmiarowe komentarze

Znaczniki pozycji

Czasami programiści lubią zaznaczać określone miejsca w pliku źródłowym.

Zaraz po wyeliminowaniu tego ogona slashy, czsem ma sen stosowanie tego typu markerów jeżeli chcemy zgromadzić okreslone funkcje razem. Niemniej najeczęściej tego typu znaczniki tworzą bałagan, który powinien być wyelimonowany. Tak więc warto używać ich oszczędnie i tylko wtedy, gdy ich zalety są wyraźne. Jeżeli zbyt często używamy tych transparentów, zaczynają być traktowane jako szum tła i ignorowane.

C3: Redundant Comment

Closing Brace Comments

Although this might make sense for long functions with deeply nested structures, it serves only to clutter the kind of small and encapsulated functions that we prefer. So if you find yourself wanting to mark your closing braces, try to shorten your functions instead.

while ((line = in.readLine()) != null) {
    lineCount++;
    charCount += line.length();
    String words[] = line.split("\\W");
    wordCount += words.length;
} //while

C3: Redundant Comment

Attributions and Bylines

/* Added by Rick */

Source code control systems are very good at remembering who added what, when. There is no need to pollute the code with little bylines.

C3. Nadmiarowe komentarze

Atrybuty i dopiski

Systemy kontroli wersji świetnie nadają się do zapamiętywania, kto (i kiedy) dodał określony fragment. Nie ma potrzeby zaśmiecania kodu tymi małymi dopiskami. Można uważać, że tego typu komentarze będą przydatne do sprawdzenia, z kim można porozmawiać na temat danego fragmentu kodu. Rzeczywistość jest inna - zwykle zostają tam przez lata, tracąc na dokładności i użyteczności. Pamiętajmy - systemy kontroli wersji są lepszym miejscem dla tego rodzaju informacji

C3: Redundant Comment

Too Much Information

The comment below was extracted from a module designed to test that a function could encode and decode base64 coding.

It is good enough to have RFC number, someone reading this code has no need for the arcane information contained in the comment.

/*
RFC 2045 - Multipurpose Internet Mail Extensions (MIME)
Part One: Format of Internet Message Bodies
section 6.8. Base64 Content-Transfer-Encoding
The encoding process represents 24-bit groups of input bits as output
strings of 4 encoded characters. Proceeding from left to right, a
24-bit input group is formed by concatenating 3 8-bit input groups.
These 24 bits are then treated as 4 concatenated 6-bit groups, each
of which is translated into a single digit in the base64 alphabet.
When encoding a bit stream via the base64 encoding, the bit stream
must be presumed to be ordered with the most-significant-bit first.
That is, the first bit in the stream will be the high-order bit in
the first 8-bit byte, and the eighth bit will be the low-order bit in
the first 8-bit byte, and so on.
*/

C3. Nadmiarowe komentarze

Nadmiar informacji

Nie należy umieszczać w komentarzach interesujących z punktu widzenia historii dyskusji lub luźnych opisów szczegółów. Komentarz zamieszczony poniżej został pobrany z modułu mającego za zadanie sprawdzić, czy funkcja może kodować i dekodować zgodnie ze standardem base64. Osoba czytająca ten kod numer referencyjny do standardu RFC powinien być wystarczający, tam jest pełna informacja na temat standardu kodowania base64, stąd komentarz jest zbędny.

C3: Redundant Comment

Mandated Comments

It is just plain silly to have a rule that says that every function must have a doc, or every variable must have a comment. Comments like this just clutter up the code, propagate lies.

/**
*
* @param title The title of the CD
* @param author The author of the CD
* @param tracks The number of tracks on the CD
* @param durationInMinutes The duration of the CD in minutes
*/
public void addCD(String title, String author, int tracks, int durationInMinutes) {
    CD cd = new CD();
    cd.title = title;
    cd.author = author;
    cd.tracks = tracks;
    cd.duration = duration;
    cdList.add(cd);
}

C4: Poorly Written Comment

Inobvious Connection

The connection between a comment and the code it describes should be obvious. If you are going to the trouble to write a comment, then at least you’d like the reader to be able to look at the comment and the code and understand what the comment is talking about.

/*
* start with an array that is big enough to hold all the pixels
* (plus filter bytes), and an extra 200 bytes for header info
*/
this.pngBytes = new byte[((this.width + 1) * this.height * 3) + 200];

What is filter, what is its the relation to ‘+ 1‘ or ‘* 3‘ or both of them? Is the pixel a ‘byte‘? Why header info takes ‘200 bytes‘?

C5: Commented-Out Code

Commented-Out Code

Don’t do this! Others who see that commented-out code won’t have the courage to delete it. They’ll think it is there for a reason and is too important to delete. That code sits there and rots, getting less and less relevant with every passing day. It calls functions that no longer exist. It uses variables whose names have changed.

InputStreamResponse response = new InputStreamResponse();
response.setBody(formatter.getResultStream(), formatter.getByteCount());
// InputStream resultsStream = formatter.getResultStream();
// StreamReader reader = new StreamReader(resultsStream);
// response.setContent(reader.read(formatter.getByteCount()));

Source code control systems will remember the code for us. We don’t have to comment it out any more. Just delete the code.

Code excercises

Foramtting

Blocks and Indenting

public class FitNesseServer implements SocketServer { private FitNesseContext
context; public FitNesseServerCFitNesseContext context) { this.context =
context; } public void serveCSocket s) { serveCs, 10000); } public void
serveCSocket s, long requestTimeout) { try { FitNesseExpediter sender = new
FitNesseExpediterCs, context); sender.setRequestParsingTimeLimitCrequestTimeout); sender.startC); }
catchCException e) { e.printStackTraceC); } } }

Code formatting is about communication

A source file is a hierarchy rather like an outline. To make this hierarchy of scopes visible, we indent the lines of source code in proportion to their position in the hiearchy. This allows them to quickly hop over scopes. Without indentation, programs would be virtually unreadable by humans. However it is good enough css/js code for the minification!

The indent level of a function should not be greater than one or two, which makes the functions easier to read and understand.

Bloki i Wcięcia

Plik źródłowy stanowi pewnego rodzaju hierarchię, a nie szablon. Znajdują się w nim informacje odnoszące się do pliku jako całości, do poszczególnych klas w pliku, do metod w klasach, do bloków w metodach oraz, rekurencyjnie, do bloków w blokach. Każdy poziom hierarchii jest zakresem, w którym są zadeklarowane nazwy oraz w którym są interpretowane deklaracje i instrukcje wykonywalne. Aby hierarchia ta była widoczna, wcinamy wiersze kodu źródłowego odpowiednio do pozycji w hierarchii.

Vertical Formatting

Comparing 7 different projects, files that are typically ~40 lines long, with an upper limit of ~500. Small files are usually easier to understand than large files are. Desirable is building an app having from 200 to 500 lines per file, however, this should not be a hard rule.

vertical_box_plot

Figure: File length distributions LOG scale (box height = sigma)

EKS. I. Rozkład wielkości plików w skali logarytmicznej (wysokość prostokąta = sigma)

Porównując ze sobą pliki siedmiu projektów - Junit, FitNesse, testNG, Time and Money, JDepend, Ant oraz Tomcat, okazuje się, żeśrednia znajduje się w pomiędzy ~40 a ~500 linii.

Linie przechodzące przez prostokąty pokazują minimalną i maksymalną wielkość pliku w każdym projekcie. Prostokąt pokazuje średnio jedną trzecią (jedno standardowe odchyleniel) liczby plików. Środek prostokąta oznacza średnią.

Wydaje się, że możliwe jest zbudowanie znaczącego systemu plików mających przeciętnie 200 wierszy, a nieprzekraczających 500 wierszy. Choć nie musi to być sztywna reguła, to jednak powinna być bardzo pożądana. Małe pliki są zwykle łatwiejsze do zrozumienia niż duże.

Vertical Openness Between Concepts

Nearly all code is read left to right and top to bottom. Each line represents an expression or a clause, and each group of lines represents a complete thought. Those thoughts should be separated from each other with blank lines.

def is_str_instance(v):
    return isinstance(v, str)


def hex_str_to_int(input: str) -> int:
    return int(input, 16)


def int_to_hex(input: int) -> hex:
    return hex(int)

Horizontal Formatting

Below figure suggests that we should strive to keep lines short. The old Hollerith limit was 80, currently lines edging out to 100 or even 120 are fine. Beyond that is probably just careless. But monitors are too wide for that nowadays, and programmers can shrink the font so small that they can get 200 characters across the screen. Don’t do that.

line_width

Figure. Java line width distribution

Przedstawiony jest rozkład długości wierszy wszystkich siedmiu projektów. Wiersze krótsze niż 10 znaków stanowią 30% całej populacji, natomiast wiersze o średniej długości od 20 do 60 znaków stanowią 40% całej populacji, co daje już w sumie 70%. Linie z ilością znaków do 120 zdecydowanie przekraczają 80% całej populacji (zasada pareto). Programiści preferują krótkie wiersze. A zatem powinniśmy tworzyć krótkie wiersze. Stare ograniczenie Holleritha do 80 znaków jest zbyt restrykcyjne i osobiście nie mam nic przeciwko wierszom mającym 100, a nawet 120 znaków. Jednak wartości powyżej tej granicy są zbyt duże.

Obecnie monitory są bardzo szerokie, a młodsi programiści mogą tak zmniejszyć czcionkę, że uzyskują wiersze o długości 200 znaków. Nie jest to dobra praktyka. Ja staram się nie przekraczać długości 120 znaków.

Horizontal Openness and Density

We use horizontal white space to associate things that are strongly related and disassociate things that are more weakly related.

private void measureLine(String line) {
    lineCount++;
    int lineSize = line.length();
    totalChars += lineSize;
    lineWidthHistogram.addLine(lineSize, lineCount);
    recordWidestLine(lineSize);
}

Spaces between assignment operators
No spaces between function name and opening parenthesis

Poziome odstępy i gęstość

Odstępy poziome służą do kojarzenia elementów ściśle powiązanych i rozłączania elementów, które są ze sobą luźniej związane.

Operatory przypisania otoczyłem odstępami w celu ich wyróżnienia. Instrukcje przypisania mają dwa osobne główne elementy: lewą i prawą stronę. Odstępy powodują, że rozdzielenie to jest oczywiste. Z drugiej strony, nie dodaję odstępów pomiędzy nazwami funkcji a otwierającymi nawiasami. Funkcje i argumenty są ze sobą bardzo ściśle związane. Rozdzielenie ich spowodowałoby, że wyglądałyby na rozłączone, a nie złączone. Rozdzielam argumenty wywołania funkcji, aby zaznaczyć przecinek i pokazać, że argumenty są osobne.

Horizontal Alignment

Horizontal alignment was common for assembly language programmers. That this kind of alignment is not useful. The alignment seems to emphasize the wrong things and leads my eye away from the true intent. You are tempted to read down the list of variable names without looking at their types.

public class FitNesseExpediter implements ResponseSender
{
    private   Socket          socket;
    private   InputStream     input;
    private   OutputStream    output;
    private   Request         request;
    private   Response        response;
    private   FitNesseContext context;
    protected long            requestParsingTimeLimit;
    private   long            requestProgress;
    private   long            requestParsingDeadline;
    private   boolean         hasError;
    public    FitNesseExpediter(Socket          s, 
                                FitNesseContext context) throws Exception
    {
        this.context =            context;
        socket =                  s;
        input =                   s.getInputStream();
        output =                  s.getOutputStream();
        requestParsingTimeLimit = 10000;
    }
}

Automatic reformatting tools usually eliminate this kind of alignment.

Rozmieszczenie poziome

Programując w asemblerze, stosowało się rozmieszczenie poziome w celu wyróżniania określonych struktur.

Niemniej tego typu wyrównywanie nie jest użyteczne. Wyrównanie takie powoduje skupienie uwagi na niewłaściwych elementach i odciąga wzrok od właściwych. Na przykład w przedstawionej powyżej liście deklaracji mamy pokusę czytania listy zmiennych bez zwracania uwagi na ich typy. Podobnie w instrukcjach przypisania zwykle patrzymy na listę wartości, bez zwracania uwagi na operator przypisania. Co gorsza, automatyczne narzędzia formatujące zwykle eliminują tego rodzaju wyrównanie.

Code excercises

Thank you for your attention.

Good coding practices

Prepared by:

Introduction to Clean Code

Introduction

Definition of programming and code

Dirty code characteristics

When to write dirty code

When to write dirty code

Why keep the code clean

Why keep the code clean

Why keep the code clean

Clean code characteristics

Code quality measurement

The Boy Scout Rule

Names

Naming Conventions

N6: Avoid Encodings

G25: Replace Magic Numbers with Named Constants

G25: Replace Magic Numbers with Named Constants

N1: Choose Descriptive Names

N1: Choose Descriptive Names

N1: Choose Descriptive Names

N3: Use Standard Nomenclature Where Possible

N4: Unambiguous Names

N4: Unambiguous Names

N4: Unambiguous Names

N4: Unambiguous Name

N4: Unambiguous Name

N5: Use Long Names

Code excercises

Code excercises

Functions

G30: Functions Should Do One Thing

G30: Functions Should Do One Thing

Small Functions

G20: Function Names Should Say What They Do

G5: Duplication

F1: Too Many Arguments

G10: Vertical Separation

G10: Vertical Separation

F4: Dead Function / G9: Dead Code

G29: Avoid Negative Conditionals

G35: Keep Configurable Data at High Levels

Comments

C3: Redundant Comment

C3: Redundant Comment

C3: Redundant Comment

C3: Redundant Comment

C3: Redundant Comment

C3: Redundant Comment

C3: Redundant Comment

C3: Redundant Comment

C3: Redundant Comment

C4: Poorly Written Comment

C5: Commented-Out Code

Code excercises

Foramtting

Blocks and Indenting

Vertical Formatting

Vertical Openness Between Concepts

Horizontal Formatting

Horizontal Openness and Density

Horizontal Alignment

Code excercises

Thank you for your attention.