MUMPS_5.1.1.tar.gz_MUMPS_MUMPSC++_安装MUMPS资源-CSDN文库

版权申诉

mumps

5星 · 超过95%的资源 162 浏览量 2022-09-14 23:34:13 上传评论收藏 3.19MB GZ 举报

共536个文件

f：386个

h：40个

c：36个

《MUMPS 5.1.1：C++编程中的高效并行计算库》 MUMPS，全称为“Massively Parallel Unsymmetric MultiFrontal solver”，是一个强大的、开源的并行线性方程组求解器，尤其适用于大规模、不规则对称或非对称矩阵。在本压缩包“MUMPS_5.1.1.tar.gz”中，包含的是MUMPS的5.1.1版本，专为C++开发提供便捷的编译库支持。 MUMPS的核心功能是解决大型稀疏矩阵问题，它利用了分布式内存并行计算的优势，能够在高性能计算环境中有效地运行。MUMPS的设计理念是为了处理那些在科学和工程计算中常见的大规模、复杂问题，如有限元分析、流体动力学模拟等。其非对称特性使得它在处理实际世界中的非对称线性系统时表现出色。在“MUMPS_5.1.1”这个压缩包里，用户可以找到所有必要的文件来编译和链接MUMPS库到自己的C++项目中。MUMPS库包含了实现其核心算法的源代码、头文件以及相关的构建脚本，使得开发者能够轻松地将其集成到自己的软件中。通过这些库，用户可以调用高效的并行算法，如前消元法（Factorization）、多面体方法（MultiFrontal Method）和间接求解策略（Indirect Solvers），以解决复杂的线性系统。 MUMPS的关键特性包括： 1. **并行性**：MUMPS支持MPI（Message Passing Interface）进行分布式内存并行计算，可以在多处理器系统上高效运行。 2. **适应性**：MUMPS能够自动适应不同规模和结构的矩阵，无需人工调整参数。 3. **性能优化**：经过精心优化，MUMPS在大型计算任务中展现出优秀的性能，尤其是在高性能计算集群上。 4. **易于使用**：提供了丰富的C++接口，使得用户可以方便地将MUMPS库与自己的代码集成。 5. **灵活性**：支持多种输入格式，如PETSc和Trilinos的接口，以及直接读取Matlab或Arpack格式的矩阵。在使用MUMPS时，开发人员需要注意以下几点： - **配置和编译**：需要设置适当的MPI编译器和链接器选项，以确保MUMPS能正确地与系统上的MPI实现配合。 - **并行策略**：根据实际问题的规模和计算资源选择合适的并行策略，如进程分区和通信模式。 - **错误处理**：使用MUMPS提供的错误检测和报告机制，以调试和优化代码。 - **性能监控**：通过性能分析工具监控计算过程，找出可能的瓶颈并进行优化。 “MUMPS_5.1.1.tar.gz”是C++开发者解决大规模线性系统问题的重要工具。通过深入理解和熟练运用MUMPS，开发者可以大幅提升其软件在科学计算领域的效率和准确性。在未来的更新中，MUMPS将持续优化并增加新的特性和功能，以满足日益增长的高性能计算需求。

资源详情

资源评论

资源推荐

收起资源包目录

MUMPS_5.1.1.tar.gz_MUMPS_MUMPS C++ （536个子文件）

gelim.c 40KB

tree.c 31KB

ddcreate.c 30KB

mumps_io_basic.c 30KB

ddbisect.c 29KB

interface.c 27KB

mumpsmex.c 26KB

gbipart.c 21KB

intmumpsc.c 20KB

mumps_io.c 20KB

mumps_c.c 19KB

symbfac.c 18KB

mumps_io_thread.c 18KB

gbisect.c 17KB

minpriority.c 17KB

graph.c 16KB

multisector.c 11KB

nestdiss.c 11KB

mumps_pord.c 10KB

bucket.c 8KB

sort.c 6KB

mumps_metis64.c 4KB

mumps_io_err.c 4KB

mumps_metis.c 4KB

c_example.c 2KB

mumps_common.c 2KB

mumps_scotch.c 1KB

mumps_scotch64.c 1KB

mumps_metis_int.c 1KB

elapse.c 855B

mumps_scotch_int.c 710B

mpic.c 633B

mumps_save_restore_C.c 612B

mumps_size.c 563B

mumps_numa.c 413B

mumps_thread.c 375B

ChangeLog 26KB

CREDITS 2KB

manrev.dtd 3KB

ana_orderings.F 414KB

zsol_driver.F 215KB

dsol_driver.F 215KB

csol_driver.F 215KB

ssol_driver.F 215KB

cmumps_load.F 214KB

smumps_load.F 214KB

zmumps_load.F 214KB

dmumps_load.F 214KB

mumps_static_mapping.F 168KB

zana_driver.F 159KB

cana_driver.F 159KB

dana_driver.F 159KB

sana_driver.F 159KB

dfac_driver.F 124KB

zfac_driver.F 124KB

sfac_driver.F 124KB

cfac_driver.F 124KB

dmumps_ooc.F 121KB

zmumps_ooc.F 121KB

cmumps_ooc.F 120KB

smumps_ooc.F 120KB

dmumps_comm_buffer.F 120KB

zmumps_comm_buffer.F 120KB

cmumps_comm_buffer.F 119KB

smumps_comm_buffer.F 119KB

zana_aux.F 115KB

dana_aux.F 115KB

sana_aux.F 115KB

cana_aux.F 115KB

cana_aux_par.F 95KB

zana_aux_par.F 95KB

sana_aux_par.F 95KB

dana_aux_par.F 95KB

dsol_c.F 90KB

zsol_c.F 90KB

csol_c.F 89KB

ssol_c.F 89KB

dmumps_driver.F 87KB

zmumps_driver.F 87KB

cmumps_driver.F 87KB

smumps_driver.F 87KB

dfac_front_aux.F 66KB

zfac_front_aux.F 66KB

cfac_front_aux.F 65KB

sfac_front_aux.F 65KB

zfac_asm_master_m.F 61KB

dfac_asm_master_m.F 61KB

cfac_asm_master_m.F 61KB

sfac_asm_master_m.F 61KB

zfac_asm_master_ELT_m.F 58KB

cfac_asm_master_ELT_m.F 58KB

dfac_asm_master_ELT_m.F 58KB

sfac_asm_master_ELT_m.F 58KB

mpi.f 55KB

zsol_fwd_aux.F 50KB

dsol_fwd_aux.F 50KB

csol_fwd_aux.F 50KB

ssol_fwd_aux.F 49KB

dfac_process_maprow.F 49KB

zfac_process_maprow.F 49KB

共 536 条

MUltifrontal Massively Parallel Solver

(MUMPS 5.1.1)

Users’ guide

∗

March 20, 2017

Abstract

This document describes the Fortran 95 and C user interfaces to MUMPS5.1.1. We describe in detail

the data structures, parameters, calling sequences, and error diagnostics. Basic example programs using

MUMPS are also provided.

First improvements regarding low-rank compression (Block-Low Rank, or BLR) format are now

available in public releases of MUMPS.

∗

Information on how to obtain updated copies of MUMPS can be obtained from the Web page http://mumps-solver.org/

Contents

1 Introduction 5

2 Notes for users of previous versions of MUMPS 6

2.1 ChangeLog . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7

2.2 Binary compatibility . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7

2.3 Upgrading between minor releases . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7

2.4 Upgrading from MUMPS 5.0.2 to MUMPS 5.1.1 . . . . . . . . . . . . . . . . . . . 8

2.4.1 Changes on installation issues . . . . . . . . . . . . . . . . . . . . . . . . . . . 8

2.4.2 Selective 64-bit integer feature . . . . . . . . . . . . . . . . . . . . . . . . . . 8

2.5 Upgrading from MUMPS 4.10.0 to MUMPS 5.0.2 . . . . . . . . . . . . . . . . . . 9

2.5.1 Interface with the Metis and ParMetis orderings . . . . . . . . . . . . . . . . . 9

2.5.2 Interface with the SCOTCH and PT-SCOTCH orderings . . . . . . . . . . . . . 9

2.5.3 ICNTL(10): iterative reﬁnement . . . . . . . . . . . . . . . . . . . . . . . . . 9

2.5.4 ICNTL(11): error analysis . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9

2.5.5 ICNTL(20): sparse right-hand sides . . . . . . . . . . . . . . . . . . . . . . . . 10

2.5.6 ICNTL(4): Control of the level of printing . . . . . . . . . . . . . . . . . . . . 10

2.5.7 License . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10

3 Main functionalities of MUMPS 5.1.1 10

3.1 Input matrix format . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10

3.2 Preprocessing . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10

3.3 Post-processing facilities . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12

3.3.1 Iterative reﬁnement . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12

3.3.2 Error analysis and statistics . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12

3.4 Null pivot detection . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13

3.5 Computation of a solution of a deﬁcient matrix and of a null space basis . . . . . . . . . 13

3.6 Solving the transposed system . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13

3.7 Forward elimination during factorization . . . . . . . . . . . . . . . . . . . . . . . . . 13

3.8 Arithmetic versions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13

3.9 Numerical pivoting . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14

3.10 The working host processor . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14

3.11 MPI-free version . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14

3.12 Combining MPI and multithreaded parallelism . . . . . . . . . . . . . . . . . . . . . . 15

3.13 Out-of-core facility . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15

3.14 Determinant . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15

3.15 Computing selected entries of A

−1

. . . . . . . . . . . . . . . . . . . . . . . . . . . . 16

3.16 Reduce/condense a problem on an interface (Schur complement and reduced/condensed

right-hand side) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16

3.17 Block Low Rank (BLR) multifrontal factorization . . . . . . . . . . . . . . . . . . . . 17

4 User interface and available routines 18

5 Application Program Interface 21

5.1 General . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21

5.1.1 Initialization, Analysis, Factorization, Solve, Termination (JOB) . . . . . . . . . 21

5.1.2 Version number . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22

5.1.3 Control of parallelism (COMM, PAR) . . . . . . . . . . . . . . . . . . . . . . . 22

5.2 Input Matrix . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23

5.2.1 Matrix type (SYM) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23

5.2.2 Matrix format . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23

5.2.2.1 Centralized assembled matrix (ICNTL(5)=0 and ICNTL(18)=0). . 24

5.2.2.2 Distributed assembled matrix (ICNTL(5)=0 and ICNTL(18)=1,2,3). 25

5.2.2.3 Elemental matrix (ICNTL(5)=1 and ICNTL(18)=0). . . . . . . . . 26

5.2.3 Writing the input matrix to a ﬁle . . . . . . . . . . . . . . . . . . . . . . . . . 27

5.3 Preprocessing: permutation to zero-free diagonal and scaling . . . . . . . . . . . . . . . 27

5.3.1 Permutation to a zero-free diagonal (ICNTL(6)) . . . . . . . . . . . . . . . . 29

5.3.2 Scaling (ICNTL(6) or ICNTL(8)) . . . . . . . . . . . . . . . . . . . . . . . 29

5.4 Preprocessing: symmetric permutations . . . . . . . . . . . . . . . . . . . . . . . . . . 30

5.4.1 Symmetric permutation vector (ICNTL(7) and ICNTL(29)) . . . . . . . . . 31

5.4.2 Given ordering (ICNTL(7)=1 and ICNTL(28)=1) . . . . . . . . . . . . . . . 32

5.5 Post-processing: iterative reﬁnement . . . . . . . . . . . . . . . . . . . . . . . . . . . 32

5.6 Post-processing: error analysis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 34

5.7 Out-of-core (ICNTL(22)) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 35

5.8 Workspace parameters (ICNTL(14) and ICNTL(23)) and user workspace . . . . . . 35

5.9 Null pivot row detection . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 38

5.10 Discard matrix factors (ICNTL(31)) . . . . . . . . . . . . . . . . . . . . . . . . . . . 38

5.11 Computation of the determinant (ICNTL(33)) . . . . . . . . . . . . . . . . . . . . . 39

5.12 Forward elimination during factorization (ICNTL(32)) . . . . . . . . . . . . . . . . . 40

5.13 Right-hand side and solution vectors/matrices . . . . . . . . . . . . . . . . . . . . . . . 41

5.13.1 Dense right-hand side (ICNTL(20)=0) . . . . . . . . . . . . . . . . . . . . . 42

5.13.2 Sparse right-hand side (ICNTL(20)=1,2,3) . . . . . . . . . . . . . . . . . . . 42

5.13.3 A particular case of sparse right-hand side: computing entries of A

−1

(ICNTL(30)=1) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 43

5.13.4 Centralized solution (ICNTL(21)=0) . . . . . . . . . . . . . . . . . . . . . . 44

5.13.5 Distributed solution (ICNTL(21)=1) . . . . . . . . . . . . . . . . . . . . . . 45

5.14 Schur complement with reduced or condensed right-hand side (ICNTL(19) and

ICNTL(26)) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 45

5.14.1 Centralized Schur complement stored by rows (ICNTL(19)=1) . . . . . . . . 46

5.14.2 Distributed Schur complement (ICNTL(19)=2 or 3) . . . . . . . . . . . . . . 46

5.14.3 Centralized Schur complement stored by columns (ICNTL(19)=2 or 3) . . . . 48

5.14.4 Using partial factorization during solution phase (ICNTL(26)= 0, 1 or 2) . . . 49

5.15 Block Low Rank (BLR) factorization (ICNTL(35) and CNTL(7)) . . . . . . . . . 50

5.15.1 Enabling the BLR functionality at installation . . . . . . . . . . . . . . . . . . 50

5.15.2 Application Program Interface . . . . . . . . . . . . . . . . . . . . . . . . . . 51

5.15.3 BLR output: statistics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 52

6 Control parameters 53

6.1 Integer control parameters . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 53

6.2 Real/complex control parameters . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 68

6.3 Compatibility between options . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 70

7 Information parameters 72

7.1 Information local to each processor . . . . . . . . . . . . . . . . . . . . . . . . . . . . 72

7.2 Information available on all processors . . . . . . . . . . . . . . . . . . . . . . . . . . 74

8 Error diagnostics 77

9 Calling MUMPS from C 80

9.1 Array indices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 82

9.2 Issues related to the C and Fortran communicators . . . . . . . . . . . . . . . . . . . . 82

9.3 Fortran I/O . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 82

9.4 Runtime libraries . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 83

9.5 Integer, real and complex datatypes in C and Fortran . . . . . . . . . . . . . . . . . . . 83

9.6 Sequential version . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 83

10 Scilab and MATLAB/Octave interfaces 83

11 Examples of use of MUMPS 85

11.1 An assembled problem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 85

11.2 An elemental problem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 87

11.3 An example of calling MUMPS from C . . . . . . . . . . . . . . . . . . . . . . . . . . . 89

12 License 90

13 Credits 90

1 Introduction

MUMPS (“MUltifrontal Massively Parallel Solver”) is a package for solving systems of linear equations of

the form Ax = b, where A is a square sparse matrix that can be either unsymmetric, symmetric positive

deﬁnite, or general symmetric, on distributed memory computers. MUMPS implements a direct method

based on a multifrontal approach which performs a Gaussian factorization

A = LU (1)

where L is a lower triangular matrix and U an upper triangular matrix. If the matrix is symmetric then

the factorization

A = LDL

(2)

where D is block diagonal matrix with blocks of order 1 or 2 on the diagonal is performed. We refer the

reader to the papers [9, 10, 13, 28, 29, 33, 40, 31, 32, 18, 1, 45, 15, 41, 3, 35, 47, 19, 36, 44] for full details

of the techniques used, algorithms and related research.

The system Ax = b is solved in three main steps:

1. Analysis.

During analysis, preprocessing (see Subsection 3.2), including an ordering based on the

symmetrized pattern A + A

, and a symbolic factorization are performed. During the symbolic

factorization, a mapping of the multifrontal computational graph, the so called elimination tree

[38], is computed and used to estimate the number of operations and memory necessary for

factorization and solution. Both parallel and sequential implementations of the analysis phase are

available. Let A

pre

denote the preprocessed matrix (further deﬁned in Subsection 3.2).

2. Factorization.

During factorization A

pre

= LU or A

pre

= LDL

, depending on the symmetry of the

preprocessed matrix, is computed. The original matrix is ﬁrst distributed (or redistributed)

onto the processors depending on the mapping computed during the analysis. The numerical

factorization is then a sequence of dense factorization on so called frontal matrices. In addition to

standard threshold pivoting and two-by-two pivoting (not so standard in distributed memory codes)

there is an option to perform static pivoting. The elimination tree also expresses independency

between tasks and enables multiple fronts to be processed simultaneously. This approach is

called multifrontal approach. After factorization, the factor matrices are kept distributed (in-core

memory or on disk); they will be used at the solution phase.

3. Solution.

The solution x

pre

of LUx

pre

= b

pre

or LDL

pre

= b

pre

where x

pre

and b

pre

are

respectively the transformed solution x and right-hand side b associated to the preprocessed matrix

pre

, is obtained through a forward elimination step

Ly = b

pre

or LDy = b

pre

, (3)

followed by a backward elimination step

pre

= y or L

pre

= y . (4)

The solution x

pre

is ﬁnally postprocessed to obtain the solution x of the original system Ax = b,

where x is either assembled on an identiﬁed processor (the host) or kept distributed on the working

processors. Iterative reﬁnement and backward error analysis are also postprocessing options of the

solution phase.

Each of these 3 phases can be called separately (see Subsection 5.1.1). A special case is the one

where the forward elimination step is performed during factorization (see Subsection 3.7), instead of

during the solve phase. This allows accessing the L factors right after they have been computed, with a

better locality, and can avoid writing the L factors to disk in an out-of-core context. In this case (forward

elimination during factorization), only the backward elimination is performed during the solution phase .

The software is mainly written in Fortran 95 although a C interface is available (see Section 9). Scilab

and MATLAB/Octave interfaces are also available in the case of sequential executions. The parallel

m0_74844055

2023-02-22

资源简直太好了，完美解决了当下遇到的难题，这样的资源很难不支持~

评论收藏

内容反馈

版权申诉

朱moyimi

粉丝: 79
资源: 1万+

MUMPS_5.1.1.tar.gz_MUMPS_MUMPS C++

评论2

最新资源

MUMPS_5.1.1.tar.gz_MUMPS_MUMPS C++

评论2

M(MUMPS)语言

MUMPS_4.10.0.tar.gz

Mumps很牛逼和古老的语言工具

Cache 数据库相关----脚本MUMPS语言

cache数据库脚本语言（MUMPS）教程

linux-5.1.tar.gz

sip-5.1.1.tar.gz

metis-5.1.0.tar.gz

yambo-5.1.0.tar.gz

oozie-5.1.0.tar.gz（1）

MUMPS_5.1.2.tar.gz

WinMumps:用于构建 MUMPS 的 Visual Studio 项目和解决方案文件-开源

mumps user guide

MUMPS.jl：MUMPS的Julia接口

t1lib-5.1.2.tar.gz

php-5.1.6.tar.gz

UnixBench-5.1.3.tar.gz

libdrizzle-5.1.4.tar.gz

gmp-5.1.3.tar.gz

elasticsearch-5.1.2.tar.gz

PyYAML-5.1.2.tar.gz

oozie-5.1.0.tar.gz（3）

unixbench-5.1.2.tar.gz

oozie-5.1.0.tar.gz（2）

goahead-5.1.0.tar.gz

冰河的渗透实战笔记-冰河.pdf

stm32f103 adc采样+dma传输+fft处理 频率计_fft处理_stm32_ADCFFT_频率计_ADC采样_

ISO21434.pdf

Web安全漏洞扫描工具-AWVS14

最新资源

stm32f103 adc采样+dma传输+fft处理频率计_fft处理_stm32_ADCFFT_频率计_ADC采样_