Welcome 微信登录

首页 / 数据库 / MySQL / MySQL和PostgreSQL 导入数据对比

在虚拟机上测评了下MySQL 和 PostgreSQL 的各种LOAD FILE方式以及时间。 因为是虚拟机上的测评,所以时间只做参考,不要太较真, 看看就好了。MySQL 工具:1. 自带mysqlimport工具。2. 命令行 load data infile ...3. 利用mysql-connector-python Driver来写的脚本。PostgreSQL 工具:1. pgloader 第三方工具。2. 命令行 copy ... from ...3. 利用psycopg2写的python 脚本。测试表结构:mysql> desc t1;
+----------+-----------+------+-----+-------------------+-------+
| Field    | Type      | Null | Key | Default          | Extra |
+----------+-----------+------+-----+-------------------+-------+
| id      | int(11)  | NO  | PRI | NULL              |      |
| rank    | int(11)  | NO  |    | NULL              |      |
| log_time | timestamp | YES  |    | CURRENT_TIMESTAMP |      |
+----------+-----------+------+-----+-------------------+-------+
3 rows in set (0.00 sec)
mysql> select count(*) from t1;
+----------+
| count(*) |
+----------+
|  1000000 |
+----------+
1 row in set (6.80 sec)测试CSV文件:t1.csvMySQL 自身的loader: (时间24妙)mysql> load data infile "/tmp/t1.csv" into table t1 fields terminated by "," enclosed by """ lines terminated by " ";
Query OK, 1000000 rows affected (24.21 sec)
Records: 1000000  Deleted: 0  Skipped: 0  Warnings: 0MySQL python 脚本:(时间23秒)
>>>
Running 23.289 SecondsMySQL 自带mysqlimport:(时间23秒)[root@mysql56-master ~]# time mysqlimport t_girl "/tmp/t1.csv" --fields-terminated-by="," --fields-enclosed-by=""" --lines-terminated-by=" " --use-threads=2 -uroot -proot
t_girl.t1: Records: 1000000  Deleted: 0  Skipped: 0  Warnings: 0
real    0m23.664s
user    0m0.016s
sys    0m0.037sPostgreSQL 自身COPY:(时间7秒)t_girl=# copy t1 from "/tmp/t1.csv" with delimiter ",";
COPY 1000000
Time: 7700.332 msPsycopg2 驱动copy_to方法:(时间6秒)[root@postgresql-instance scripts]# python load_data.py
Running 5.969 Seconds.Pgloader 导入CSV:(时间33秒)[root@postgresql-instance ytt]# pgloader commands.load     
                    table name      read  imported    errors            time
                        ytt.t1    1000000    1000000          0        33.514s
------------------------------  ---------  ---------  ---------  --------------
------------------------------  ---------  ---------  ---------  --------------
            Total import time    1000000    1000000          0        33.514sPgloader 直接从MySQL 拉数据:(时间51秒)[root@postgresql-instance ytt]# pgloader commands.mysql
                    table name      read  imported    errors            time
              fetch meta data          2          2          0          0.138s
------------------------------  ---------  ---------  ---------  --------------
                            t1    1000000    1000000          0        51.136s
------------------------------  ---------  ---------  ---------  --------------
------------------------------  ---------  ---------  ---------  --------------
------------------------------  ---------  ---------  ---------  --------------
            Total import time    1000000    1000000          0        51.274s附上commands.load和commands.mysqlcommands.load:
LOAD CSV 
  FROM "/tmp/ytt.csv" WITH ENCODING UTF-8
        ( 
          id, rank, log_time 
        ) 
  INTO postgresql://t_girl:t_girl@127.0.0.1:5432/t_girl?ytt.t1
  WITH skip header = 0, 
        fields optionally enclosed by """, 
        fields escaped by backslash-quote, 
        fields terminated by "," 
 
    SET work_mem to "32 MB", maintenance_work_mem to "64 MB";
commands.mysql:
LOAD DATABASE 
    FROM mysql://python_user:python_user@192.168.1.131:3306/t_girl?t1
    INTO postgresql://t_girl:t_girl@127.0.0.1:5432/t_girl?ytt.t1
 
 with data only
  SET maintenance_work_mem to "64MB", 
      work_mem to "3MB", 
      search_path to "ytt";
附pgloader 手册:
http://pgloader.io/howto/pgloader.1.html------------------------------------华丽丽的分割线------------------------------------CentOS 6.3环境下yum安装PostgreSQL 9.3 http://www.linuxidc.com/Linux/2014-05/101787.htmPostgreSQL缓存详述 http://www.linuxidc.com/Linux/2013-07/87778.htmWindows平台编译 PostgreSQL http://www.linuxidc.com/Linux/2013-05/85114.htmUbuntu下LAPP(Linux+Apache+PostgreSQL+PHP)环境的配置与安装 http://www.linuxidc.com/Linux/2013-04/83564.htmUbuntu上的phppgAdmin安装及配置 http://www.linuxidc.com/Linux/2011-08/40520.htmCentOS平台下安装PostgreSQL9.3 http://www.linuxidc.com/Linux/2014-05/101723.htmPostgreSQL配置Streaming Replication集群 http://www.linuxidc.com/Linux/2014-05/101724.htm------------------------------------华丽丽的分割线------------------------------------PostgreSQL 的详细介绍:请点这里
PostgreSQL 的下载地址:请点这里本文永久更新链接地址