vba 从 Word 文档中获取标题
声明:本页面是StackOverFlow热门问题的中英对照翻译,遵循CC BY-SA 4.0协议,如果您需要使用它,必须同样遵循CC BY-SA许可,注明原文地址和作者信息,同时你必须将它归于原作者(不是我):StackOverFlow
原文地址: http://stackoverflow.com/questions/274814/
Warning: these are provided under cc-by-sa 4.0 license. You are free to use/share it, But you must attribute it to the original authors (not me):
StackOverFlow
Getting the headings from a Word document
提问by user35762
How do I get a list of all the headings in a word document by using VBA?
如何使用 VBA 获取 Word 文档中所有标题的列表?
采纳答案by VonC
You mean like this createOutlinefunction (which actually copy all headings from a source word document into a new word document):
你的意思是这样的createOutline函数(它实际上将源 Word 文档中的所有标题复制到一个新的 Word 文档中):
(I believe the astrHeadings = _docSource.GetCrossReferenceItems(wdRefTypeHeading)function is the key in this program, and should allow you to retrieve what you are asking for)
(我相信 astrHeadings =函数是这个程序的关键,应该允许你检索你所要求的)_docSource.GetCrossReferenceItems(wdRefTypeHeading)
Public Sub CreateOutline()
Dim docOutline As Word.Document
Dim docSource As Word.Document
Dim rng As Word.Range
Dim astrHeadings As Variant
Dim strText As String
Dim intLevel As Integer
Dim intItem As Integer
Set docSource = ActiveDocument
Set docOutline = Documents.Add
' Content returns only the main body of the document, not the headers/footer.
Set rng = docOutline.Content
' GetCrossReferenceItems(wdRefTypeHeading) returns an array with references to all headings in the document
astrHeadings = docSource.GetCrossReferenceItems(wdRefTypeHeading)
For intItem = LBound(astrHeadings) To UBound(astrHeadings)
' Get the text and the level.
strText = Trim$(astrHeadings(intItem))
intLevel = GetLevel(CStr(astrHeadings(intItem)))
' Add the text to the document.
rng.InsertAfter strText & vbNewLine
' Set the style of the selected range and
' then collapse the range for the next entry.
rng.Style = "Heading " & intLevel
rng.Collapse wdCollapseEnd
Next intItem
End Sub
Private Function GetLevel(strItem As String) As Integer
' Return the heading level of a header from the
' array returned by Word.
' The number of leading spaces indicates the
' outline level (2 spaces per level: H1 has
' 0 spaces, H2 has 2 spaces, H3 has 4 spaces.
Dim strTemp As String
Dim strOriginal As String
Dim intDiff As Integer
' Get rid of all trailing spaces.
strOriginal = RTrim$(strItem)
' Trim leading spaces, and then compare with
' the original.
strTemp = LTrim$(strOriginal)
' Subtract to find the number of
' leading spaces in the original string.
intDiff = Len(strOriginal) - Len(strTemp)
GetLevel = (intDiff / 2) + 1
End Function
UPDATE by @kol on March 6, 2018
@kol 于 2018 年 3 月 6 日更新
Although astrHeadingsis an array (IsArrayreturns True, and TypeNamereturns String()) I get a type mismatcherror when I try to access its elements in VBScript (v5.8.16384 on Windows 10 Pro 1709 16299.248). This must be a VBScript-specific problem, because I can access the elements if I run the same code in Word's VBA editor. I ended up iterating the lines of the TOC, because it works even from VBScript:
尽管astrHeadings是一个数组(IsArray返回True和TypeName返回String()),type mismatch但当我尝试在 VBScript(Windows 10 Pro 1709 16299.248 上的 v5.8.16384)中访问其元素时出现错误。这一定是 VBScript 特有的问题,因为如果我在 Word 的 VBA 编辑器中运行相同的代码,我就可以访问这些元素。我最终迭代了 TOC 的行,因为它甚至可以从 VBScript 中运行:
For Each Paragraph In Doc.TablesOfContents(1).Range.Paragraphs
WScript.Echo Paragraph.Range.Text
Next
回答by JonnyGold
The easiest way to get a list of headings, is to loop through the paragraphs in the document, for example:
获取标题列表的最简单方法是遍历文档中的段落,例如:
Sub ReadPara()
Dim DocPara As Paragraph
For Each DocPara In ActiveDocument.Paragraphs
If Left(DocPara.Range.Style, Len("Heading")) = "Heading" Then
Debug.Print DocPara.Range.Text
End If
Next
End Sub
By the way, I find it is a good idea to remove the final character of the paragraph range. Otherwise, if you send the string to a message box or a document, Word displays an extra control character. For example:
顺便说一句,我发现删除段落范围的最后一个字符是个好主意。否则,如果您将字符串发送到消息框或文档,Word 将显示一个额外的控制字符。例如:
Left(DocPara.Range.Text, len(DocPara.Range.Text)-1)
回答by joshoff
This macro worked beautifully for me (Word 2010). I've extended the functionality slightly: now it prompts the user to enter a minimum level, and supresses subheadings below that level.
这个宏对我来说效果很好(Word 2010)。我稍微扩展了该功能:现在它会提示用户输入最低级别,并隐藏低于该级别的副标题。
Public Sub CreateOutline()
' from http://stackoverflow.com/questions/274814/getting-the-headings-from-a-word-document
Dim docOutline As Word.Document
Dim docSource As Word.Document
Dim rng As Word.Range
Dim astrHeadings As Variant
Dim strText As String
Dim intLevel As Integer
Dim intItem As Integer
Dim minLevel As Integer
Set docSource = ActiveDocument
Set docOutline = Documents.Add
minLevel = 1 'levels above this value won't be copied.
minLevel = CInt(InputBox("This macro will generate a new document that contains only the headers from the existing document. What is the lowest level heading you want?", "2"))
' Content returns only the
' main body of the document, not
' the headers and footer.
Set rng = docOutline.Content
astrHeadings = _
docSource.GetCrossReferenceItems(wdRefTypeHeading)
For intItem = LBound(astrHeadings) To UBound(astrHeadings)
' Get the text and the level.
strText = Trim$(astrHeadings(intItem))
intLevel = GetLevel(CStr(astrHeadings(intItem)))
If intLevel <= minLevel Then
' Add the text to the document.
rng.InsertAfter strText & vbNewLine
' Set the style of the selected range and
' then collapse the range for the next entry.
rng.Style = "Heading " & intLevel
rng.Collapse wdCollapseEnd
End If
Next intItem
End Sub
Private Function GetLevel(strItem As String) As Integer
' from http://stackoverflow.com/questions/274814/getting-the-headings-from-a-word-document
' Return the heading level of a header from the
' array returned by Word.
' The number of leading spaces indicates the
' outline level (2 spaces per level: H1 has
' 0 spaces, H2 has 2 spaces, H3 has 4 spaces.
Dim strTemp As String
Dim strOriginal As String
Dim intDiff As Integer
' Get rid of all trailing spaces.
strOriginal = RTrim$(strItem)
' Trim leading spaces, and then compare with
' the original.
strTemp = LTrim$(strOriginal)
' Subtract to find the number of
' leading spaces in the original string.
intDiff = Len(strOriginal) - Len(strTemp)
GetLevel = (intDiff / 2) + 1
End Function
回答by dxc
Fastest method for extracting of all headings (to LEVEL5).
提取所有标题的最快方法(到 LEVEL5)。
Sub EXTRACT_HDNGS()
Dim WDApp As Word.Application 'WORD APP
Dim WDDoc As Word.Document 'WORD DOC
Set WDApp = Word.Application
Set WDDoc = WDApp.ActiveDocument
For Head_n = 1 To 5
Head = ("Heading " & Head_n)
WDApp.Selection.HomeKey wdStory, wdMove
Do
With WDApp.selection
.MoveStart Unit:=wdLine, Count:=1
.Collapse Direction:=wdCollapseEnd
End with
With WDApp.Selection.Find
.ClearFormatting: .text = "":
.MatchWildcards = False: .Forward = True
.Style = WDDoc.Styles(Head)
If .Execute = False Then GoTo Level_exit
.ClearFormatting
End With
Heading_txt = RemoveSpecialChar(WDApp.Selection.Range.text, 1): Debug.Print Heading_txt
Heading_lvl = WDApp.Selection.Range.ListFormat.ListLevelNumber: Debug.Print Heading_lvl
Heading_lne = WDDoc.Range(0, WDApp.Selection.Range.End).Paragraphs.Count: Debug.Print Heading_lne
Heading_pge = WDApp.Selection.Information(wdActiveEndPageNumber): Debug.Print Heading_pge
If Wdapp.Selection.Style = "Heading 1" Then GoTo Level_exit
Wdapp.Selection.Collapse Direction:=wdCollapseStart
Loop
Level_exit:
Next Head_n
End Sub
回答by MagTun
Following Wikis comment on VonC answer, here is the code that worked for me. It makes the function faster.
遵循维基对 VonC 答案的评论,这是对我有用的代码。它使功能更快。
Public Sub CopyHeadingsInNewDoc()
Dim docOutline As Word.Document
Dim docSource As Word.Document
Dim rng As Word.Range
Dim astrHeadings As Variant
Dim strText As String
Dim longLevel As Integer
Dim longItem As Integer
Set docSource = ActiveDocument
Set docOutline = Documents.Add
' Content returns only the
' main body of the document, not
' the headers and footer.
Set rng = docOutline.Content
astrHeadings = _
docSource.GetCrossReferenceItems(wdRefTypeHeading)
For intItem = LBound(astrHeadings) To UBound(astrHeadings)
' Get the text and the level.
strText = Trim$(astrHeadings(intItem))
intLevel = GetLevel(CStr(astrHeadings(intItem)))
' Add the text to the document.
rng.InsertAfter strText & vbNewLine
' Set the style of the selected range and
' then collapse the range for the next entry.
rng.Style = "Heading " & intLevel
rng.Collapse wdCollapseEnd
Next intItem
End Sub
Private Function GetLevel(strItem As String) As Integer
' Return the heading level of a header from the
' array returned by Word.
' The number of leading spaces indicates the
' outline level (2 spaces per level: H1 has
' 0 spaces, H2 has 2 spaces, H3 has 4 spaces.
Dim strTemp As String
Dim strOriginal As String
Dim longDiff As Integer
' Get rid of all trailing spaces.
strOriginal = RTrim$(strItem)
' Trim leading spaces, and then compare with
' the original.
strTemp = LTrim$(strOriginal)
' Subtract to find the number of
' leading spaces in the original string.
longDiff = Len(strOriginal) - Len(strTemp)
GetLevel = (longDiff / 2) + 1
End Function
回答by jumpHyman
Why reinventing the wheel so many times?!?
为什么要多次重新发明轮子?!?
A "list of all headings" is just the standard Word index of document!
“所有标题的列表”只是文档的标准 Word 索引!
This is what I got by recording a macro while adding index to the document:
这是我在向文档添加索引的同时录制宏得到的结果:
Sub Macro1()
ActiveDocument.TablesOfContents.Add Range:=Selection.Range, _
RightAlignPageNumbers:=True, _
UseHeadingStyles:=True, _
UpperHeadingLevel:=1, _
LowerHeadingLevel:=5, _
IncludePageNumbers:=True, _
AddedStyles:="", _
UseHyperlinks:=True, _
HidePageNumbersInWeb:=True, _
UseOutlineLevels:=True
End Sub
回答by JoeK
You can also create a Table of Contents in the doc and copy that. This separates out the para ref from the title, which is handy if you need to present that in another context. If you do not want the ToC in your doc, just delete that after the Copy n Paste. JK.
您还可以在文档中创建一个目录并复制它。这将 para ref 与标题分开,如果您需要在另一个上下文中呈现它,这很方便。如果您不想在您的文档中使用 ToC,只需在 Copy n Paste 之后将其删除即可。JK。

