C语言使用 libavcodec 的 mpegts 容器中的原始 H264 帧

Question

提问by Ferenc Deak

I would really appreciate some help with the following issue:

我非常感谢您对以下问题的帮助：

I have a gadget with a camera, producing H264 compressed video frames, these frames are being sent to my application. These frames are not in a container, just raw data.

我有一个带摄像头的小工具，生成 H264 压缩视频帧，这些帧被发送到我的应用程序。这些帧不在容器中，只是原始数据。

I want to use ffmpeg and libav functions to create a video file, which can be used later.

我想使用ffmpeg和libav函数创建一个视频文件，以后可以使用。

If I decode the frames, then encode them, everything works fine, I get a valid video file. (the decode/encode steps are the usual libav commands, nothing fancy here, I took them from the almighty internet, they are rock solid)... However, I waste a lot of time by decoding and encoding, so I would like to skip this step and directly put the frames in the output stream. Now, the problems come.

如果我解码帧，然后对它们进行编码，一切正常，我会得到一个有效的视频文件。（解码/编码步骤是通常的libav命令，这里没什么特别的，我从万能的互联网上拿来的，它们坚如磐石）......但是，我在解码和编码上浪费了很多时间，所以我想跳过这一步，直接将帧放入输出流中。现在，问题来了。

Here is the code I came up with for producing the encoding:

这是我想出的用于生成编码的代码：

AVFrame* picture;

avpicture_fill((AVPicture*) picture, (uint8_t*)frameData, 
                 codecContext->pix_fmt, codecContext->width,
                 codecContext->height);
int outSize = avcodec_encode_video(codecContext, videoOutBuf, 
                 sizeof(videoOutBuf), picture);
if (outSize > 0) 
{
    AVPacket packet;
    av_init_packet(&packet);
    packet.pts = av_rescale_q(codecContext->coded_frame->pts,
                  codecContext->time_base, videoStream->time_base);
    if (codecContext->coded_frame->key_frame) 
    {
        packet.flags |= PKT_FLAG_KEY;
    }
    packet.stream_index = videoStream->index;
    packet.data =  videoOutBuf;
    packet.size =  outSize;

    av_interleaved_write_frame(context, &packet);
    put_flush_packet(context->pb);
}

Where the variables are like:

变量在哪里：

frameDatais the decoded frame data, that came from the camera, it was decoded in a previous step and videoOutBufis a plain uint8_t buffer for holding the data

frameData是来自相机的解码帧数据，它是在上一步中解码的，videoOutBuf是一个用于保存数据的普通 uint8_t 缓冲区

I have modified the application in order to not to decode the frames, but simply pass through the data like:

我修改了应用程序以便不解码帧，而是简单地传递如下数据：

    AVPacket packet;
    av_init_packet(&packet);

    packet.stream_index = videoStream->index;
    packet.data = (uint8_t*)frameData;
    packet.size = currentFrameSize;

    av_interleaved_write_frame(context, &packet);
    put_flush_packet(context->pb);

where

在哪里

frameDatais the raw H264 frame and currentFrameSizeis the size of the raw H264 frame, ie. the number of bytes I get from the gadget for every frame.

frameData是原始 H264 帧，currentFrameSize是原始 H264 帧的大小，即。我从小工具获得的每一帧的字节数。

And suddenly the application is not working correctly anymore, the produced video is unplayable. This is obvious, since I was not setting a correct PTS for the packet. What I did was the following (I'm desperate, you can see it from this approach :) )

突然应用程序不再正常工作，制作的视频无法播放。这很明显，因为我没有为数据包设置正确的 PTS。我所做的是以下（我很绝望，您可以从这种方法中看到它:)）

    packet.pts = timestamps[timestamp_counter ++];

where timestampsis actually a list of PTS's produced by the working code above, and written to a file (yes, you read it properly, I logged all the PTS's for a 10 minute session, and wanted to use them).

wheretimestamps实际上是由上面的工作代码生成并写入文件的 PTS 列表（是的，您正确阅读了它，我在 10 分钟的会话中记录了所有 PTS，并想使用它们）。

The application still does not work.

该应用程序仍然不起作用。

Now, here I am without any clue what to do, so here is the question:

现在，我不知道该怎么做，所以这里是问题：

I would like to create an "mpegts" video stream using libav functions, insert in the stream already encoded video frames and create a video file with it. How do I do it?

我想使用 libav 函数创建一个“mpegts”视频流，在流中插入已经编码的视频帧并用它创建一个视频文件。我该怎么做？

Thanks, f.

谢谢，F。

Answer 1

采纳答案by bob2

I believe if you set the following, you will see video playback.

我相信如果你设置了以下内容，你会看到视频播放。

packet.flags |= AV_PKT_FLAG_KEY;
packet.pts = packet.dts = 0;

You should really set packet.flags according to the h264 packet headers. You might try this fellow stack overflowian'ssuggestion for extracting directly from the stream.

你真的应该根据 h264 数据包头设置 packet.flags。你可以试试这个堆栈溢出的建议，直接从流中提取。

If you are also adding audio, then pts/dts is going to be more important. I suggest you study this tutorial

如果您还添加音频，那么 pts/dts 将更加重要。我建议你学习这个教程

EDIT

编辑

I found time to extract out what is working for me from my test app. For some reason, dts/pts values of zero works for me, but values other than 0 or AV_NOPTS_VALUE do not. I wonder if we have different versions of ffmpeg. I have the latest from git://git.videolan.org/ffmpeg.git.

我抽出时间从我的测试应用程序中提取出对我有用的东西。出于某种原因，零的 dts/pts 值对我有用，但 0 或 AV_NOPTS_VALUE 以外的值则不起作用。我想知道我们是否有不同版本的 ffmpeg。我有来自git://git.videolan.org/ffmpeg.git的最新信息。

fftest.cpp

#include <string>

#ifndef INT64_C
#define INT64_C(c) (c ## LL)
#define UINT64_C(c) (c ## ULL)
#endif

//#define _M
#define _M printf( "%s(%d) : MARKER\n", __FILE__, __LINE__ )

extern "C"
{
    #include "libavcodec/avcodec.h"
    #include "libavformat/avformat.h"
};


AVFormatContext *fc = 0;
int vi = -1, waitkey = 1;

// < 0 = error
// 0 = I-Frame
// 1 = P-Frame
// 2 = B-Frame
// 3 = S-Frame
int getVopType( const void *p, int len )
{   
    if ( !p || 6 >= len )
        return -1;

    unsigned char *b = (unsigned char*)p;

    // Verify NAL marker
    if ( b[ 0 ] || b[ 1 ] || 0x01 != b[ 2 ] )
    {   b++;
        if ( b[ 0 ] || b[ 1 ] || 0x01 != b[ 2 ] )
            return -1;
    } // end if

    b += 3;

    // Verify VOP id
    if ( 0xb6 == *b )
    {   b++;
        return ( *b & 0xc0 ) >> 6;
    } // end if

    switch( *b )
    {   case 0x65 : return 0;
        case 0x61 : return 1;
        case 0x01 : return 2;
    } // end switch

    return -1;
}

void write_frame( const void* p, int len )
{
    if ( 0 > vi )
        return;

    AVStream *pst = fc->streams[ vi ];

    // Init packet
    AVPacket pkt;
    av_init_packet( &pkt );
    pkt.flags |= ( 0 >= getVopType( p, len ) ) ? AV_PKT_FLAG_KEY : 0;   
    pkt.stream_index = pst->index;
    pkt.data = (uint8_t*)p;
    pkt.size = len;

    // Wait for key frame
    if ( waitkey )
        if ( 0 == ( pkt.flags & AV_PKT_FLAG_KEY ) )
            return;
        else
            waitkey = 0;

    pkt.dts = AV_NOPTS_VALUE;
    pkt.pts = AV_NOPTS_VALUE;

//  av_write_frame( fc, &pkt );
    av_interleaved_write_frame( fc, &pkt );
}

void destroy()
{
    waitkey = 1;
    vi = -1;

    if ( !fc )
        return;

_M; av_write_trailer( fc );

    if ( fc->oformat && !( fc->oformat->flags & AVFMT_NOFILE ) && fc->pb )
        avio_close( fc->pb ); 

    // Free the stream
_M; av_free( fc );

    fc = 0;
_M; 
}

int get_nal_type( void *p, int len )
{
    if ( !p || 5 >= len )
        return -1;

    unsigned char *b = (unsigned char*)p;

    // Verify NAL marker
    if ( b[ 0 ] || b[ 1 ] || 0x01 != b[ 2 ] )
    {   b++;
        if ( b[ 0 ] || b[ 1 ] || 0x01 != b[ 2 ] )
            return -1;
    } // end if

    b += 3;

    return *b;
}

int create( void *p, int len )
{
    if ( 0x67 != get_nal_type( p, len ) )
        return -1;

    destroy();

    const char *file = "test.avi";
    CodecID codec_id = CODEC_ID_H264;
//  CodecID codec_id = CODEC_ID_MPEG4;
    int br = 1000000;
    int w = 480;
    int h = 354;
    int fps = 15;

    // Create container
_M; AVOutputFormat *of = av_guess_format( 0, file, 0 );
    fc = avformat_alloc_context();
    fc->oformat = of;
    strcpy( fc->filename, file );

    // Add video stream
_M; AVStream *pst = av_new_stream( fc, 0 );
    vi = pst->index;

    AVCodecContext *pcc = pst->codec;
_M; avcodec_get_context_defaults2( pcc, AVMEDIA_TYPE_VIDEO );
    pcc->codec_type = AVMEDIA_TYPE_VIDEO;

    pcc->codec_id = codec_id;
    pcc->bit_rate = br;
    pcc->width = w;
    pcc->height = h;
    pcc->time_base.num = 1;
    pcc->time_base.den = fps;

    // Init container
_M; av_set_parameters( fc, 0 );

    if ( !( fc->oformat->flags & AVFMT_NOFILE ) )
        avio_open( &fc->pb, fc->filename, URL_WRONLY );

_M; av_write_header( fc );

_M; return 1;
}

int main( int argc, char** argv )
{
    int f = 0, sz = 0;
    char fname[ 256 ] = { 0 };
    char buf[ 128 * 1024 ];

    av_log_set_level( AV_LOG_ERROR );
    av_register_all();

    do
    {
        // Raw frames in v0.raw, v1.raw, v2.raw, ...
//      sprintf( fname, "rawvideo/v%lu.raw", f++ );
        sprintf( fname, "frames/frame%lu.bin", f++ );
        printf( "%s\n", fname );

        FILE *fd = fopen( fname, "rb" );
        if ( !fd )
            sz = 0;
        else
        {
            sz = fread( buf, 1, sizeof( buf ) - FF_INPUT_BUFFER_PADDING_SIZE, fd );
            if ( 0 < sz )
            {
                memset( &buf[ sz ], 0, FF_INPUT_BUFFER_PADDING_SIZE );          

                if ( !fc )
                    create( buf, sz );

                if ( fc )
                    write_frame( buf, sz );

            } // end if

            fclose( fd );

        } // end else

    } while ( 0 < sz );

    destroy();
}

Answer 2

回答by user347594

You can create a process to call ffmpeg from console.

您可以创建一个进程来从控制台调用 ffmpeg。

Example of command line for processing files like 000001.jpg, 000002.jpg, 000003.jpg, ...

用于处理 000001.jpg、000002.jpg、000003.jpg 等文件的命令行示例...

ffmpeg -i c:\frames\%06d.jpg -r 16 -vcodec mpeg4 -an -y c:\video\some_video.avi

ffmpeg -ic:\frames\%06d.jpg -r 16 -vcodec mpeg4 -an -yc:\video\some_video.avi

Other examples from ffmpeg docs

ffmpeg 文档中的其他示例

C语言使用 libavcodec 的 mpegts 容器中的原始 H264 帧

提问by Ferenc Deak

采纳答案by bob2

回答by user347594

相关推荐

最近更新

标签

C语言 使用 libavcodec 的 mpegts 容器中的原始 H264 帧

提问by Ferenc Deak

采纳答案by bob2

回答by user347594

相关推荐

C语言 C语言中的volatile关键字？

C语言 Flex、Bison 和 C：寻找非常基本的介绍

C语言 C、读取多行文本文件

C语言 以微秒为单位在 C 中获取时间戳？

相关推荐

最近更新

标签

C语言使用 libavcodec 的 mpegts 容器中的原始 H264 帧

C语言以微秒为单位在 C 中获取时间戳？