All information at species level is hooked to scientific names. It is crucial to have the list of species very quickly at the beginning of the project, if possible from electronic lists.
It is also crucial to identify taxonomic references to validate the choice of the current accepted names, and to link the names to these references when available on the web.
The lists are extracted in the following decreasing order:
Unless synonyms are under electronic format, they are not entered as a priority; the only synonyms encoded are those that are used in sources of other information.
The source is always recorded, as well as the type to allow the user to assess the reliability of the name.
The common names in English and other languages are entered only when available in electronic format or from compilations. However, some groups are prioritized when common names are well known such as, e.g., in marine mammals.
Some common names were already entered in Species 2000 by a FishBase team member.
The lists are extracted in the following decreasing priority:
In addition to country, the state/provincial levels are considered. A geographic standard was established for the marine areas on the same model as the TDWG geographic standard for the terrestrial areas.
Distribution by country and subdivisions: from printed compilations and monographs (FAO publications first), from published distribution maps, from country lists, in that order.
Distribution by FAO areas: from FAO publications, from published distribution maps, from distribution by country above, in that order.
Distribution by ecosystem: from distribution by FAO area and country above and from printed compilations. Distribution by Large Marine Ecosystems will be by oceans and then by principal seas.
Distribution by depth: from printed compilations and monographs (FAO publications first).
This data is a crucial key point for biodiversity and ecosystem studies, but rarely available from electronic sources.
This information is extracted on opportunistic basis mainly from FAO publications and printed monographs. Targeted species searches were performed for important species, e.g., threatened, invasive and commercially important species.
This information is rarely available from electronic sources. Moreover, various standards are used, and may depend on the taxonomic group. The FishBase standard is used after it is reassessed and completed for invertebrates.
The phylum- to class-group levels are classified as small, medium or large groups.
This encoding strategy has changed since the completion of all the small and many of the medium groups in 2007. Each encoder’s weekly programme now consists of encoding data for 50% of the remaining groups and encoding life history parameters for targeted species groups. In addition, the 5 remaining encoders are each in charge of special topics, viz.: faunal lists, life-history parameters, ecological parameters, pictures and targeted reference-searches.
Note that the rapid completion and availability of results/data on small and medium groups were psychologically important to the encoder and the donor alike as they measure and assess the achievements of the project. Moving forward from encoding scientific names to encoding, e.g., life-history parameters, gave the encoders a sense of accomplishment in spite of the huge tasks still ahead. Our short-term doable targets and their completion provide us with milestones with which we measure our accomplishments. This strategy has so far been proven useful.
Some indicators of data encoding progress were established at the beginning of the project reflecting the completion of data encoding by taxonomic group, and the advancement of the project relatively to the expected number of species.
It is important to consider these indicators at various taxonomic levels from phylum to species as we can show rapid completion at phylum to family levels, whereas genus and species level are a long-term goal.