Django: How do I get every table and all of that table's columns in a project?_问答_开发者

Django: How do I get every table and all of that table's columns in a project?

开发者 https://www.devze.com 2023-01-06 07:59 出处：网络

I\'m creating a set of SQL full database copy scripts using MySQL\'s INTO OUTFILE and LOAD DATA LOCAL INFILE.

相关专题：django python

I'm creating a set of SQL full database copy scripts using MySQL's INTO OUTFILE and LOAD DATA LOCAL INFILE.

Specifically:

SELECT {columns} FROM {table} INTO OUTFILE '{table}.csv'

LOAD DATA LOCAL INFILE '{table}.csv' REPLACE INTO {table} {columns}

Because of this开发者_Python百科, I don't need just the tables, I also need the columns for the tables.

I can get all of the tables and columns, but this doesn't include m2m tables:

from django.db.models import get_models()
for model in get_models():
    table = model._meta.db_table
    columns = [field.column for field in model._meta.fields]

I can also get all of the tables, but this doesn't give me access to the columns:

from django.db import connection
tables = connection.introspection.table_names()

How do you get every table and every corresponding column on that table for a Django project?

More details:

I'm doing this on a reasonably large dataset (>1GB) so using the flat file method seems to be the only reasonable way to make this large of a copy in MySQL. I already have the schema copied over (using ./manage.py syncdb --migrate) and the issue I'm having is specifically with copying the data, which requires me to have the tables and columns to create proper SQL statements. Also, the reason I can't use default column ordering is because the production database I'm copying from has different column ordering than what is created with a fresh syncdb (due to many months worth of migrations and schema changes).

Have you taken a look at manage.py ?

You can get boatloads of SQL information, for example to get all the create table syntax for an app within your project you can do:

python manage.py sqlall <appname>

If you type:

python manage.py help

You can see a ton of other features.

I dug in to the source to find this solution. I feel like there's probably a better way, but this does the trick.

This first block gets all of the normal (non-m2m) tables and their columns

from django.db import connection
from django.apps import apps

table_info = []
tables = connection.introspection.table_names()
seen_models = connection.introspection.installed_models(tables)
for model in apps.get_models():
    if model._meta.proxy:
        continue

    table = model._meta.db_table
    if table not in tables:
        continue

    columns = [field.column for field in model._meta.fields]
    table_info.append((table, columns))

This next block was the tricky part. It gets all the m2m field tables and their columns.

for model in apps.get_models():
    for field in model._meta.local_many_to_many:
        if not field.creates_table:
            continue

        table = field.m2m_db_table()
        if table not in tables:
            continue
        columns = ['id'] # They always have an id column
        columns.append(field.m2m_column_name())
        columns.append(field.m2m_reverse_name())
        table_info.append((table, columns))

Have you looked into "manage.py dumpdata" and "manage.py loaddata"? They dump and load in json format. I use it to dump stuff from one site and overwrite another site's database. It doesn't have an "every database" option on dumpdata, but you can call it in a loop on the results of a "manage.py dbshell" command.