Editing WikiDB/Tables

From TestWiki
Jump to: navigation, search

Warning: You are not logged in. Your IP address will be publicly visible if you make any edits. If you log in or create an account, your edits will be attributed to your username, along with other benefits.

The edit can be undone. Please check the comparison below to verify that this is what you want to do, and then save the changes below to finish undoing the edit.
Latest revision Your text
Line 1: Line 1:
There are three additional tables that WikiDB will add to the database in order to store the data required by the extension. In all cases, the data stored in these tables is derived from data elsewhere on your wiki, therefore it is always safe to delete and rebuild these tables, if necessary.
+
There are three new tables in the database that store the WikiDB data. If you have any suggestions about how these could be further optimised, please discuss them on the [[Talk:WikiDB/Tables|talk page]].
 
+
If you have any suggestions about how these tables could be further optimised, please discuss them on the [[Talk:WikiDB/Tables|talk page]].
+
 
+
 
+
----
+
'''IMPORTANT NOTES - READ THESE BEFORE YOU CONTINUE!'''
+
 
+
# WikiDB comes with scripts to add these tables to the database, and to populate them with data (or to reset/re-populate them if necessary, e.g. because your configuration has changed).  The scripts will also allow you to upgrade existing installations if the schema has changed.  Therefore, so long as you have command-line access to run the [[WikiDB/Maintenance scripts|maintenance scripts]] then you should never need to manually modify the database.
+
# The schema may change as the extension develops and changes may be incompatible with the current version.  It may not be obvious how to change from one version to another, and if the incorrect version is used, the behaviour is undefined.  Therefore, take care if you are unable to run the command-line scripts, and so need to modify the tables manually.
+
# If a new release makes changes to the schema, then the major version of the WikiDB software will be incremented, so it should be easy to tell if you need to update your schema.  See [[WikiDB/Versioning]] for more details.
+
# ''The table definitions on this page are live'', and are pulled from the DB that is powering this wiki.  Any changes made to the live DB will automatically be reflected here (which is achieved using my [[Debugging tools]] extension).  This has two implications.  Occasionally I upgrade the wiki before a stable release is made available, for testing purposes.  In this case the schema shown here may be more recent than the latest released version.  Check [[Special:Version]] to see what version of WikiDB this wiki is running.  Secondly, because these are updated live and are not shown in the page history, there is no easy way to view the schema for old versions of WikiDB.  If this is a problem, you should check the SQL in <code>~/maintenance/sql/tables.sql</code>, though this is not in a format that is ready to be run directly on your database.
+
# I cannot be held responsible for any problems or damage that may arise from using the software, deploying with the wrong schema or running the SQL on this page... standard disclaimer, etc. :)
+
 
+
----
+
  
 +
''(Note: The table definitions on this page are live.  Changes to the DB will automatically be reflected here.  This is achieved using my [[Debugging tools]] extension.)''
  
 
== wikidb_tables ==
 
== wikidb_tables ==
Line 22: Line 9:
  
 
Purpose:
 
Purpose:
* To stop us having to parse the table's article text whenever data is requested.
+
* To stop us having to parse the article text whenever data is requested.
  
 
Needs to be updated:
 
Needs to be updated:
 
* Whenever a page in a DB namespace is created/saved.
 
* Whenever a page in a DB namespace is created/saved.
 
* Whenever a page in a DB namespace is moved.
 
* Whenever a page in a DB namespace is moved.
* Whenever a page in a DB namespace is deleted/undeleted.
+
* Whenever a page in a DB namespace is deleted.
* Whenever the [[WikiDB/Configuration settings#$wgWikiDBNamespaces|$wgWikiDBNamespaces configuration setting]] is changed.
+
  
 
Considerations:
 
Considerations:
* If you remove a table namespace redundant records will be left in this table.  This may affect data display.
+
* What happens if a table namespace is deleted?  Hopefully it should all continue working as normally (with some redundant records in the DB), but this needs checking.
* If you convert an existing namespace to a table namespace, this table will not be updated.
+
* Both of the above issues can be resolved by running the [[WikiDB/Maintenance scripts#UpdateTableNS.php|UpdateTableNS maintenance script]].
+
  
  
Line 47: Line 31:
 
* Whenever any page is created/saved.
 
* Whenever any page is created/saved.
 
* Whenever any page is moved.
 
* Whenever any page is moved.
* Whenever any page is deleted/undeleted.
+
* Whenever any page is deleted.
* If we decide to cache formatted versions of the field data, whenever wikidb_tables is updated.
+
* If we decide to cache formatted versions of the fields, whenever wikidb_tables is updated.
  
  
Line 54: Line 38:
 
<describe_table table="wikidb_fielddata"></describe_table>
 
<describe_table table="wikidb_fielddata"></describe_table>
  
Contains a record for each field in each row in each database. Where a field has [[WikiDB/Defining data#Multi-value fields|multiple values within a single record]], there is a separate row for each value.
+
Contains a record for each field in each row in each database.
  
 
Purpose:
 
Purpose:
Line 62: Line 46:
 
Needs to be updated:
 
Needs to be updated:
 
* Whenever wikidb_rowdata is updated.
 
* Whenever wikidb_rowdata is updated.
* Whenever wikidb_tables is updated, as this affects the way we store the data in this table.
+
* If we need to cache formatted versions of the fields (which we probably do), whenever wikidb_tables is updated.
* Whenever [[WikiDB/Tutorial: Creating your own data type|data types]] are added, removed or have their data-handling behaviour modified.
+
  
 
Considerations:
 
Considerations:
 
* All values are in the same text field.
 
* All values are in the same text field.
* Sorting via this field should always give the correct order (whether the values are numeric or alphabetic).
+
* Sorting via this field should always give the correct order (whether numeric or alphabetic).
** This is handled by ensuring that all data is stored in a format that ensures an appropriate order when a text sort is used, including when a field contains more than one data typeFor example, numeric values are padded with spaces, so <code>9.5</code> becomes <code>&nbsp;p&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;9.5&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</code>, where the string starts with a space (so numbers sort before strings) followed by a character to indicate whether it's positive or negative, and then uses spaces to ensure the decimal point always aligns.
+
** One solution is for numeric values to be stored zero-paddedE.g. 9.5 becomes 00000009.50000000
** An alternative approach, which will probably improve performance (at the expense of disk space) would be to have multiple columns, one per supported data type, and store the data in multiple representations.
+
** Another solution is to cast the data in the query, based on the type of the field.  However this may not be possible for various reasons.
** A third option would be to CAST() or otherwise manipulate the stored data at the point the queries are run, but this feels like it would give too large a performance hit, would be harder to make portable across DB engines and may not be possible for all data types.  However, it would remove the need for the deferred updates, so table changes have immediate effect.
+
* When a table definition changes, all rows in the affected table are marked as stale and are regenerated slowly over time.  This is to avoid performance issues when a high-usage table is modified.  The [[WikiDB/Maintenance scripts#RefreshStaleData.php|RefreshStaleData maintenance script]] can be used to force an update more quickly.
+

Please note that all contributions to TestWiki may be edited, altered, or removed by other contributors. If you do not want your writing to be edited mercilessly, then do not submit it here.
You are also promising us that you wrote this yourself, or copied it from a public domain or similar free resource (see TestWiki:Copyrights for details). Do not submit copyrighted work without permission!

To edit this page, please answer the question that appears below (more info):

Cancel | Editing help (opens in new window)